Here are
96 public repositories
matching this topic...
Semi-official Apache CouchDB Docker images
Updated
Jun 30, 2023
Shell
Apache Flink shaded artifacts repository
Updated
May 8, 2023
Shell
大数据组件 All-in-One 的 Dockerfile
Updated
Feb 1, 2021
Shell
A Hadoop cluster based on Docker, including Hive and Spark.
Updated
Nov 13, 2022
Shell
Alpine Linux based Kafka Docker Image
Updated
Dec 7, 2022
Shell
A Spark cluster setup running on Docker containers
Updated
Dec 26, 2019
Shell
Official Dockerfile for Apache Spark
Updated
Jun 29, 2023
Shell
Updated
Jul 11, 2023
Shell
Apache CouchDB Packaging support files
Updated
May 26, 2023
Shell
Ambari service for Azkaban
Updated
Aug 29, 2021
Shell
A Project where one can fetch and read tweets and show the analysis like who is most influential
Updated
Apr 21, 2022
Shell
Apache Arrow Ballista Python bindings
Updated
Jul 10, 2023
Shell
A complete (distributed) BigData stack, running in containers
Updated
Mar 22, 2017
Shell
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
Updated
Sep 10, 2019
Shell
Setup hadoop in linux for big data analysis
Updated
Oct 18, 2022
Shell
Containerized version of Panoptes for testing and experimentation.
Updated
May 1, 2023
Shell
A Cassandra Architecture for GDELT Database 🌍
Updated
Mar 7, 2019
Shell
A simple portable file cataloging tool for bash
Updated
Jun 14, 2022
Shell
Updated
May 24, 2023
Shell
📚 Open source documentation written during @droxey 's tenure as an instructor at UC Berkeley. No copyrighted in-class materials are provided here.
Updated
May 18, 2018
Shell
Improve this page
Add a description, image, and links to the
big-data
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
big-data
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.