OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Mar 29, 2023 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
A Scalable Data Cleaning Library for PySpark.
Table Enforcer is my attempt to apply a sort of "test driven development" workflow to data cleaning and validation. A python package to facilitate the iterative process of developing and using schema-like representations of DataFrames in pandas for recoding and validating instances of these data.
Examples for Optimus a Data Cleansing Library for Big Data.
Data visualisations in Power BI
GitHub Repo of our Tidyverse workshop organized on Sep 8, 2022
Data cleansing and validation for Data Science Master degree
Advance Guide Of Cleaning & 20+ ways of cleaning data with python
This is the curated pile of notebooks/small projects which contains linear and non-linear regression models.
Cars24 is an online second handle cars selling company, in this project Data analysis was done on the cars for sale.
-This project targets the textual analysis of Egyptian movie plot summaries that were curated from online sources, covering the four golden decades of Egyptian Cinema.
sales_analysis
we use keras and tensorflow and sklearn to classify health level of student by using Nursey UCI Dataset
Data cleaning, analysing in excel and finally creating a dashboard in Tableau as part of the KPMG virtual internship.
Google Cloud Data Fusion - Data Transformation Logics using CDAP Wrangler Directives.
LECR EDA & Fine Tuning
cleaning bookellar data using tableau
Add a description, image, and links to the datacleansing topic page so that developers can more easily learn about it.
To associate your repository with the datacleansing topic, visit your repo's landing page and select "manage topics."