hdfs
Here are 258 public repositories matching this topic...
Divolte Collector
-
Updated
Aug 16, 2021 - Java
Quick start: pip install jsoniq ⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy datasets (JSON, text, CSV, Parquet, Delta...) | Data Lakehouse with Updates, Scripting, Declarative Machine Learning and more
-
Updated
Apr 4, 2026 - Java
A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Architecture)
-
Updated
Mar 17, 2026 - Java
Exports Hadoop HDFS content statistics to Prometheus
-
Updated
Mar 8, 2026 - Java
DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
-
Updated
Oct 16, 2024 - Java
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
-
Updated
Sep 11, 2023 - Java
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
-
Updated
Jan 11, 2024 - Java
Kafka Connect FileSystem Connector
-
Updated
Nov 7, 2022 - Java
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
-
Updated
Nov 15, 2022 - Java
基于Hadoop的分布式云存储系统 🌴
-
Updated
Jan 10, 2025 - Java
A data layout optimization framework for wide tables stored on HDFS. See rainbow's webpage
-
Updated
Jun 19, 2018 - Java
Improve this page
Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."