Apache Superset is a Data Visualization and Data Exploration Platform
-
Updated
Dec 20, 2023 - TypeScript
Apache Superset is a Data Visualization and Data Exploration Platform
Learn how to design, develop, deploy and iterate on production-grade ML applications.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Workflow Engine for Kubernetes
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
The Data Engineering Cookbook
Roadmap to becoming a data engineer in 2021
An orchestration platform for the development, production, and observation of data assets.
Always know what to expect from your data.
Fancy stream processing made operationally mundane
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
The streaming database: redefining stream processing 🌊. PostgreSQL-compatible, highly performant, scalable, elastic, and reliable ☁️.
The open source high performance data integration platform built for developers.
Open Source Feature Flagging and A/B Testing Platform
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Feature Store for Machine Learning
lakeFS - Data version control for your data lake | Git for data
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."