apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
Chisel 3: A Modern Hardware Design Language
CMAK is a tool for managing Apache Kafka clusters
Modern Load Testing as Code
Cortex: a Powerful Observable Analysis and Active Response Engine
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
ZIO — A type-safe, composable library for async and concurrent programming in Scala
Mirror of Apache livy (Incubating)
A data access library for Scala + Postgres.
A Scala API for Apache Beam and Google Cloud Dataflow.
Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Rocket Chip Generator
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Repository for chisel3 testers2 open alpha
Redshift data source for Apache Spark
Greek symbols plugin for IntelliJ IDEA
Connectors for Delta Lake
Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"
Hybrid visual and textual functional programming.
Scala Language Integrated Connection Kit. Slick is a modern database query and access library for Scala
Microsoft Machine Learning for Apache Spark