Here are
276 public repositories
matching this topic...
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
-
Updated
Jan 23, 2023
-
Python
Python clone of Spark, a MapReduce alike framework in Python
-
Updated
Dec 25, 2020
-
Python
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
-
Updated
Jan 16, 2023
-
Python
Google, Naver multiprocess image web crawler (Selenium)
-
Updated
Jul 25, 2022
-
Python
学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
-
Updated
Dec 8, 2022
-
Python
Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
-
Updated
Aug 14, 2021
-
Python
ROOT I/O in pure Python and NumPy.
-
Updated
Feb 19, 2021
-
Python
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
-
Updated
Feb 9, 2022
-
Python
Simple-IT-English: smart wordbook from community for community
-
Updated
Dec 13, 2019
-
Python
A collection of pentest tools and resources targeting Hadoop environments
-
Updated
Sep 9, 2021
-
Python
-
Updated
Nov 4, 2020
-
Python
AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
-
Updated
May 16, 2022
-
Python
ROOT I/O in pure Python and NumPy.
-
Updated
Jan 9, 2023
-
Python
Scalable Bloom Filter implemented in Python
-
Updated
Oct 11, 2022
-
Python
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
-
Updated
Jan 12, 2023
-
Python
🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
-
Updated
Jun 21, 2020
-
Python
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
-
Updated
Jan 22, 2019
-
Python
Apache Spark 3 - Structured Streaming Course Material
-
Updated
Sep 26, 2020
-
Python
-
Updated
Nov 7, 2022
-
Python
Amas is recursive acronym for “Amas, monitor alert system”.
-
Updated
Apr 8, 2018
-
Python
Improve this page
Add a description, image, and links to the
bigdata
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
bigdata
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.