Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
hadoop-ozone
Scalable, redundant, and distributed object store for Apache Hadoop
trafficserver
Apache Traffic Server™ is a fast, scalable and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.
hudi
Upserts, Deletes And Incremental Processing on Big Data.
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
camel
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
shardingsphere-benchmark
Apache shardingsphere
incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
spark
Apache Spark - A unified analytics engine for large-scale data processing
beam
Apache Beam is a unified programming model for Batch and Streaming
incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-extend visual workflow scheduling platform, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
incubator-yunikorn-k8shim
Apache YuniKorn K8shim
pulsar
Apache Pulsar - distributed pub-sub messaging system
camel-kafka-connector
Camel Kafka Connector allows you to use all Camel components as Kafka Connect connectors