apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
Chisel 3: A Modern Hardware Design Language
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Microsoft Machine Learning for Apache Spark
♞ lichess.org: the forever free, adless and open source chess server ♞
CMAK is a tool for managing Apache Kafka clusters
State of the Art Natural Language Processing
The Scala 3 compiler, also known as Dotty.
sbt Native Packager
Rocket Chip Generator
A Spark plugin for reading Excel files via Apache POI
A STAC/OGC API Features Web Service
A fault tolerant, protocol-agnostic RPC system
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Declarative, type-safe web endpoints library
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
A minimal, idiomatic Scala interface for HTTP
DataStax Spark Cassandra Connector
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
ZIO — A type-safe, composable library for async and concurrent programming in Scala
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Bitcoin Implementation in Scala