The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Updated Jun 14, 2019
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spar…
Python
Updated Apr 10, 2019
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
#3822 opened 10 days ago by ophiry
1
#3698 opened about 1 month ago by Arakkun
2
#3702 opened about 1 month ago by hmswaffles
3
Python
Updated Jun 14, 2019
A realtime, decentralized, offline-first, mutable graph protocol to sync the web.
JavaScript
Updated Jun 14, 2019
The official home of the Presto distributed SQL query engine for big data
#12887 opened 15 days ago by wenleix
#12829 opened 29 days ago by highker
2
#12817 opened about 1 month ago by arhimondr
2
Java
Updated Jun 15, 2019
A tool for managing Apache Kafka.
Scala
Updated May 22, 2019
ClickHouse is a free analytic DBMS for big data.
#5335 opened 28 days ago by alexey-milovidov
#5310 opened about 1 month ago by alex-zaitsev
1
#5293 opened about 1 month ago by heyciao
2
C++
Updated Jun 15, 2019
Shell
Updated Jun 1, 2019
Alluxio, data orchestration for analytics and machine learning in the cloud
#8758 opened 2 months ago by apc999
Java
Updated Jun 15, 2019
The most widely used Python to C compiler
#2912 opened 2 months ago by MisterKeefe
1
#2886 opened 3 months ago by JonasT
3
#2876 opened 4 months ago by david-cortes
2
Python
Updated Jun 14, 2019
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, reg…
#858 opened 11 days ago by ancasarb
3
#833 opened 29 days ago by annaveronika
1
#825 opened about 1 month ago by cramen
1
C++
Updated Jun 15, 2019
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBo…
Java
Updated Jun 15, 2019
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems …
Python
Updated Jun 13, 2019
Reproducible Data Science at Scale!
#3792 opened 10 days ago by robfras
2
#3776 opened 16 days ago by gabrielgrant
3
#3771 opened 18 days ago by pappasilenus
Go
Updated Jun 15, 2019
Moloch is an open source, large scale, full packet capturing, indexing, and database system.
#1061 opened about 1 month ago by 31453
JavaScript
Updated Jun 14, 2019
Open Source In-Memory Data Grid
Java
Updated Jun 14, 2019
BigDL: Distributed Deep Learning Library for Apache Spark
Scala
Updated Jun 14, 2019
Vespa is an engine for low-latency computation over large data sets.
#9552 opened 22 days ago by pinankg
#9158 opened about 2 months ago by lesters
#8258 opened 5 months ago by jobergum
2
Java
Updated Jun 15, 2019
Bare bone examples of machine learning in TensorFlow
Python
Updated Mar 14, 2017
An easy to use, self-service open BI reporting and BI dashboard platform.
TSQL
Updated Jun 10, 2019
A large-scale entity and relation database supporting aggregation of properties
#2152 opened about 2 months ago by m316257
#2066 opened 4 months ago by d47853
4
#1793 opened about 1 year ago by m55624
3
Java
Updated Jun 11, 2019
📊 📋 Dashboards using YAML or JSON files
JavaScript
Updated Jun 10, 2019
A search engine which can hold 100 trillion lines of log data.
Go
Updated May 22, 2017
MySQL performance monitoring and analysis.
Java
Updated Jan 9, 2019
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala
Updated Jun 14, 2019
Distributed Big Data Orchestration Service
Java
Updated Jun 12, 2019
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Jupyter Notebook
Updated Sep 6, 2017
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine L…
Java
Updated Dec 26, 2018
⚡️ A vue component support big amount data list with high scroll performance.
JavaScript
Updated Jun 4, 2019
TrailDB is an efficient tool for storing and querying series of events
C
Updated Oct 31, 2018