Repositories
-
pulsar
Apache Pulsar - distributed pub-sub messaging system
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
-
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
-
lucene-solr
Apache Lucene and Solr open-source search software
-
beam
Apache Beam is a unified programming model for Batch and Streaming
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
lucene-solr-operator
Apache Lucene and Solr open-source search software
-
-
camel
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
-
-
guacamole-server
Mirror of Apache Guacamole Server
-
-
superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
-
-
-
-
incubator-gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
-
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
cassandra-website
Apache cassandra