Learn how to design, develop, deploy and iterate on production-grade ML applications.
-
Updated
Jul 26, 2023 - Jupyter Notebook
Learn how to design, develop, deploy and iterate on production-grade ML applications.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Always know what to expect from your data.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Feature Store for Machine Learning
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
lakeFS - Data version control for your data lake | Git for data
Compare tables within or across databases
Learn how to design, develop, deploy and iterate on production-grade ML applications.
An open-source data logging library for machine learning models and data pipelines.
Feathr – A scalable, unified data and AI engineering platform for enterprise
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
re_data - fix data issues before your users & CEO would discover them
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Data quality assessment and metadata reporting for data frames and database tables
A curated, but incomplete, list of data-centric AI resources.
Automatically find issues in image datasets and practice data-centric computer vision.
Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."