The easiest way to orchestrate and observe your data pipelines
-
Updated
Apr 21, 2023 - Python
DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.
The easiest way to orchestrate and observe your data pipelines
Fancy stream processing made operationally mundane
Collect, aggregate, and visualize a data ecosystem's metadata
Polyglot workflows without leaving the comfort of your technology stack.
funsies is a lightweight workflow engine
Open source data infrastructure platform. Designed for developers, built for speed.
An end-to-end data lineage tool, detects table dependencies from SQL statements.
Manage Redshift/Postgres privileges in GitOps style written in Rust
Data analytics library for Python and suite of open source, command line based data ops tools.
Efficient streaming data ingestion, transformation & activation
A next-generation open source orchestration platform for the development, production, and observation of data assets.
ysv: clean and transform CSV data along your rules encoded in YAML format, lightning fast
Raccogliamo qui tutti i link alle risorse menzionate durante i nostri QShare
Repositorch is a VCS repository analysis engine written in C#.
Turns Data and AI algorithms into full web apps in no time.
simple data platform to study which country indicators are relevant for olympics sports outcome