Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
dagli
Framework for defining machine learning models, including feature generation and transformations, as directed acyclic graphs (DAGs).
-
detext
DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
-
ambry
Distributed object store
-
datahub
A Generalized Metadata Search & Discovery Tool
-
Hakawai
A powerful, extensible UITextView.
-
-
spark
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
-
cruise-control-ui
Cruise Control Frontend (CCFE): Single Page Web Application to Manage Large Scale of Kafka Clusters
-
TonY
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
-
rest.li
Rest.li is a REST+JSON framework for building robust, scalable service architectures using dynamic discovery and simple asynchronous APIs.
-
gdmix
A deep ranking personalization framework
-
-
cruise-control
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
-
brooklin
An extensible distributed system for reliable nearline data streaming at scale
-
test-butler
Reliable Android Testing, at your service
-
kafka-monitor
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived vital statistics - E2E latency, service produce/consume availability, offsets commit availability & latency, message loss rate and more.
-
transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
-
coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
-
smart-arg
Smart Arguments Suite (smart-arg) is a slim and handy python lib that helps one work safely and conveniently with command line arguments.
-
Burrow
Kafka Consumer Lag Checking
-
avro-util
Collection of utilities to allow writing java code that operates across a wide range of avro versions.
-
oncall
Oncall is a calendar tool designed for scheduling and managing on-call shifts. It can be used as source of dynamic ownership info for paging systems like http://iris.claims.
-
-
photon-ml
A scalable machine learning library on Apache Spark
-
iceberg
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
-
-
parseq
Asynchronous Java made easier
-
qark
Tool to look for several security related Android application vulnerabilities