Alluxio, data orchestration for analytics and machine learning in the cloud
-
Updated
Feb 21, 2023 - Java
Alluxio, data orchestration for analytics and machine learning in the cloud
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
An open source, standard data file format for graph data storage and retrieval
This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.
Data-aware orchestration with dagster, dbt, and airbyte
Working with SCD Type (Change Data Capture) and need a Data Vault model to test Azure Data Factory v2? - This Code with Help!
A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing
Combine all four layers of analytics (descriptive, diagnostic, predictive, and prescriptive) on crypto data in a dashboard format as a web application.
Prefect - Data orchestration tool practice & learning
Add a description, image, and links to the data-orchestration topic page so that developers can more easily learn about it.
To associate your repository with the data-orchestration topic, visit your repo's landing page and select "manage topics."