Logstash - transport and process your logs, events, or other data
-
Updated
Dec 12, 2023 - Java
Logstash - transport and process your logs, events, or other data
The open source high performance data integration platform built for developers.
Flow-based programming for JavaScript
This repository is a getting started guide to Singer.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Your single tool to express data, ML, and LLM pipelines with simple python functions. Runs anywhere that python runs, E.G. spark, airflow, jupyter, fastapi, etc. Incrementally adoptable. Use Hamilton to build testable, reusable, and self-documenting dataflows with lineage and metadata out of the box.
Making data lake work for time series
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
A simplified, lightweight ETL Framework based on Apache Spark
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Knowledge Graph Toolkit
Flow PHP - strongly typed data processing framework
A tool for building feature stores.
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Bender - Serverless ETL Framework
Configurable Extract, Transform, and Load
Add a description, image, and links to the etl-framework topic page so that developers can more easily learn about it.
To associate your repository with the etl-framework topic, visit your repo's landing page and select "manage topics."