massive-datasets

Here are 14 public repositories matching this topic...

ApsaraDB / galaxysql

Star

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

mysql distributed-transactions cloud-native high-availability relational-database high-concurrency massive-datasets htap horizontal-scaling enterprise-class

Updated Oct 21, 2021
Java

ApsaraDB / PolarDB-X

Star

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

mysql distributed-transactions cloud-native high-availability relational-databases high-concurrency massive-datasets htap horizontal-scaling enterprise-class

Updated Oct 20, 2021

joshuaboud / gen-dataset

Star

Command line tool to quickly generate a lot of files in a lot of directories

linux benchmarking evaluation multithreading dataset dataset-generation massive-datasets cli-tool dataset-generator

Updated May 7, 2021
C++

manuparra / hadoop-statistics

Star

Calculate statistical measures of one column in big data Datasets with these simply Hadoop Application

java hadoop bigdata max avg min standardeviation massive-datasets

Updated Feb 24, 2017
Java

rajeshidumalla / node2vec

Star

Building node2vec algorithm

python data-science machine-learning numpy pandas data-analysis matplotlib massive-datasets node2vec networkx-graph

Updated Oct 7, 2021
Jupyter Notebook

gmalik9 / floating_point_data_compressor

Star

gipa -- compression/decompression tool to package compress and encode massive archive files with floating-point data

compression data-visualization autoencoder compressor data-compression representation representation-learning floating-point massive-datasets

Updated Sep 14, 2017
Python

rajeshidumalla / Bloom-Filter

Star

Building a Bloom Filter on English dictionary words

python data-science machine-learning bloom-filter data-analysis nltk-library massive-datasets

Updated Oct 7, 2021
Jupyter Notebook

miguel-kjh / Machine-Translation

Star

language translation massive-datasets

Updated Dec 11, 2020
Jupyter Notebook

diem-ai / google-bigquery

Star

Series of SQL exercise working with databases, using Google BigQuery to scale to massive datasets taught by educators in Kaggle.com

python bigquery sql analytics kaggle massive-datasets

Updated Jul 9, 2019
Jupyter Notebook

rajeshidumalla / Wordcount-in-Spark

Star

word count in Spark

python spark python-library pandas wordcount massive-datasets

Updated Oct 6, 2021
Jupyter Notebook

rajeshidumalla / PageRank

Star

Building PageRank algorithm on Web Graph around Stanford.edu using NetworkX python library

python data-science machine-learning spark numpy pagerank-algorithm pandas data-analysis massive-datasets networkx-library

Updated Oct 7, 2021
Jupyter Notebook

dhruv3 / MRbasedFriendRecommender

Star

Map Reduce program to suggest new friends based on count of mutual friends

java mapreduce datamining massive-datasets

Updated Mar 2, 2018
Java

pero5ar / FER.AVSP

Star

Lab assignments for the Analysis of Massive Data Sets course @ FER, University of Zagreb

homework data-analysis university-course massive-datasets fer laboratory-exercises

Updated Jun 30, 2018
C#

INFJakZda / Processing-Massive-Data-Sets

Star

University lab exercises with processing big data.

data-processing massive-datasets star-schema

Updated Nov 19, 2018
Python

Improve this page

Add a description, image, and links to the massive-datasets topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the massive-datasets topic, visit your repo's landing page and select "manage topics."

Learn more