#
datawrangling
Here are 84 public repositories matching this topic...
A terminal spreadsheet multitool for discovering and arranging data
python
cli
tsv
json
csv
reconciliation
sqlite
opendata
loader
tabular-data
spreadsheet
eda
pandas
tui
unix-toolkit
hdf5
datawrangling
devops-tools
datajournalism
-
Updated
Jun 28, 2021 - Python
This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price
numpy
linear-regression
exploratory-data-analysis
pandas
data-visualization
seaborn
data-analysis
matplotlib
datawrangling
polynomial-regression
model-evaluation
model-development
multiple-linear-regression
datascience-machinelearning
-
Updated
Jun 13, 2021 - Jupyter Notebook
Advanced data wrangling for python
-
Updated
Jul 10, 2020 - Python
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
python
machine-learning
correlation
linear-regression
cross-validation
data-visualization
data-extraction
data-analysis
regularization
standardization
datawrangling
predictive-modeling
ridge-regression
data-exploration
k-fold
lasso-regression
encoding-library
parameter-tuning
root-mean-squared-error-metric
regression-analysis
-
Updated
Jan 19, 2018 - Jupyter Notebook
DataTutor: Data Science Code Comprehension Tools.
-
Updated
Apr 11, 2021 - JavaScript
Python Data Analytics, Machine Learning & Natural Language Processing
machine-learning
natural-language-processing
sentiment-analysis
pandas-dataframe
sklearn
pandas
credit-card-fraud
spacy
datawrangling
movie-reviews
nlp-machine-learning
classification-algorithm
alogrithms
algorithms-and-data-structures
breastcancer-classification
dataanalytics
sklearn-library
codemaker
loan-prediction-analysis
dataanalysisandmlusingpython
-
Updated
Apr 4, 2020 - Jupyter Notebook
machine-learning
pandas-dataframe
svm
scikit-learn
python-script
regression
data-analysis
datawrangling
uci-machine-learning
svr
-
Updated
Nov 29, 2017 - Jupyter Notebook
course website for data science tools 1
git
sklearn
jupyter-notebook
eda
pandas
seaborn
datascience
matplotlib
datawrangling
bash-script
dataingestion
-
Updated
Jun 8, 2021 - Jupyter Notebook
A collection of projects as part of the Python for Data Science program at GreyAtom EduTech Pvt Ltd
python
data-science
data
dictionary
datascience
data-analysis
datawrangling
pythonprojects
datascienceproject
greyatom
-
Updated
Jun 7, 2020 - Python
-
Updated
Mar 14, 2019 - Jupyter Notebook
Udacity Data Analyst NanoDegree Data Wrangling course project
-
Updated
Sep 12, 2017 - Jupyter Notebook
My collection of visualizing different datasets using (Matplotlib, Seaborn, and Folium) packages for Python
-
Updated
Dec 30, 2020 - Jupyter Notebook
This repository is a collection of all the solutions of tasks that were assigned to me during my Data Analytics Virtual Internship Experience Program at Quantium. 💻 📚 📊
python
data
presentation
excel
jupyter-notebook
python3
powerpoint
datascience
datawrangling
internship-task
datacleaning
dataanalytics
-
Updated
Mar 1, 2021 - Jupyter Notebook
python
json
data-science
machine-learning
csv
sql
numpy
pandas
data-structures
datawrangling
miniprojects
object-oriented-programming
dataanalysis
dataanalytics
wqu
worldquantuniversity
-
Updated
May 10, 2021 - Jupyter Notebook
This repository is demonstration of Pandas library of Python's super powers.
python
data
pandas-dataframe
python-library
python-script
pandas
python3
datascience
datawrangling
pandas-dataframes
dataengineering
pandas-tutorial
pandas-library
dataexploration
datamanipulation
pandas-python
-
Updated
Jun 1, 2020 - Jupyter Notebook
Why data analysis? , How to understand the problem, what to do for data analysis, and how clean the data for building Machine Learning models
python
correlation
exploratory-data-analysis
binning
datawrangling
descriptive-statistics
normalization
grouping
dataanalysis
categorical-features
missing-values
anova-test
pre-processing-data
-
Updated
Feb 20, 2021 - Jupyter Notebook
Data Analysis On Demand course
-
Updated
Jan 17, 2021
This is to wrangle and analyze data from WeRateDogs Twitter archive.
-
Updated
May 12, 2020 - Jupyter Notebook
This is an exercise on the use of python for data wrangling based on the book "Python for Data Analysis" by Wes McKinney
-
Updated
Jul 21, 2018 - Jupyter Notebook
Aggregate data in R using simple SQL commands
-
Updated
Dec 2, 2018 - R
Gather data from Twitter API and provide insights
-
Updated
Mar 17, 2019 - Jupyter Notebook
This assignment is based on a dataset of credit card transactions that you can download from Link to download the dataset Each row in the dataset refers to one credit card purchase by a cardholder. Each cardholder belongs to an agency. Each transaction has information about the amount, date, merchant and merchant category, as well as a description of the expense.
programming
python3
segmentation
kmeans
datawrangling
decision-trees
kmeans-clustering
dataanalysis
sklearn-library
-
Updated
Mar 6, 2019 - Jupyter Notebook
-
Updated
Mar 25, 2019 - Jupyter Notebook
-
Updated
Aug 20, 2017 - Jupyter Notebook
python
portfolio
data
udacity
analytics
exploratory-data-analysis
jupyter-notebook
eda
seaborn
data-analysis
nanodegree
datawrangling
fifa
datavisualization
prosper
prosper-loan-data
datacleaning
explanatory-data-analysis
-
Updated
Jul 2, 2019 - Jupyter Notebook
Repository containing self built case studies and analysis leveraging most of the phases in the entire executable data science chain end to end, ranging from harvesting of resources to building cutting edge deep dive data models.
json
collection
nosql
metrics
exploratory-data-analysis
evaluation
inference
feature-selection
statistical-analysis
documents
datawrangling
feature-engineering
datamining
statistical-models
datascraping
dataanalysis
exploratory-data-visualizations
datacleaning
machienlearning
-
Updated
Mar 7, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the datawrangling topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datawrangling topic, visit your repo's landing page and select "manage topics."
After loading a new file and coming to the Configure Parsing Options page, you have the ability to modify the parsing rules, which I do pretty frequently with the kinds of (mostly tsv-like) files that I get. Most frequently I need to change the "Ignore first" and "Discard initial" parameters and since I usually need both of these and since my files are large and complex enough for the preview upda