Repositories
-
pulsar
Apache Pulsar - distributed pub-sub messaging system
-
superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
-
arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
-
hadoop-thirdparty
Apache Hadoop Thirdparty
-
-
comdev-site
Website sources for the Apache Community Development Website
-
-
shardingsphere
Distributed Database Ecosphere
-
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`.
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
-
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
-
-
submarine
Submarine is Cloud Native Machine Learning Platform.
-
-
echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
-
incubator-sedona
A cluster computing framework for processing large-scale geospatial data
-
-
-
camel-k
Apache Camel K is a lightweight integration platform, born on Kubernetes, with serverless superpowers
-
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators