Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
-
Updated
Mar 22, 2023 - Python
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
An orchestration platform for the development, production, and observation of data assets.
Fancy stream processing made operationally mundane
The open source high performance data integration platform built for developers.
Build data pipelines, the easy way
Privacy and Security focused Segment-alternative, in Golang and React
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Actively curated list of awesome BI tools. PRs welcome!
Data processing & ETL framework for Ruby
A Python stream processing engine modeled after Yahoo! Pipes
Sync data between persistence engines, like ETL only not stodgy
A lightweight stream processing library for Go
A curated list with resources about node-based UIs
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."