Apache Superset is a Data Visualization and Data Exploration Platform
-
Updated
Nov 10, 2023 - TypeScript
Apache Superset is a Data Visualization and Data Exploration Platform
Learn how to design, develop, deploy and iterate on production-grade ML applications.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Workflow Engine for Kubernetes
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
The Data Engineering Cookbook
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
Roadmap to becoming a data engineer in 2021
Always know what to expect from your data.
An orchestration platform for the development, production, and observation of data assets.
Fancy stream processing made operationally mundane
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
The open source high performance data integration platform built for developers.
Open Source Feature Flagging and A/B Testing Platform
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Feature Store for Machine Learning
lakeFS - Data version control for your data lake | Git for data
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."