Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
Updated
Mar 16, 2023 - Go
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
c++ LINQ -like library of higher-order functions for data manipulation
Estimating k-mer coverage histogram of genomics data
t-digest module for Redis
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Dynatrace hash library for Java
Federated Principal Component Analysis Revisited!
A Set of Streaming Algorithms in C++, Python, and Go
Streaming, Memory-Limited, r-truncated SVD Revisited!
An online statistics library, written in Go
This is the codebase for Faucet, described in our manuscript: https://academic.oup.com/bioinformatics/article/34/1/147/4004871, by Roye Rozov, Gil Goldshlager, Eran Halperin, and Ron Shamir
RiverText is a framework that standardizes the Incremental Word Embeddings proposed in the state-of-art. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Spearman's Correlation (Bivariate)
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister.
Create MPEG2-TS encapsulated stream-segments.
This repository contains all the solutions of assignments, starter files and other materials related to this specialization.
Simulates a HTTP Adaptive Streaming (HAS) session based on a throughput pattern and video segment sizes.
DynoGraph benchmark suite, implemented using the STINGER graph engine
Add a description, image, and links to the streaming-algorithms topic page so that developers can more easily learn about it.
To associate your repository with the streaming-algorithms topic, visit your repo's landing page and select "manage topics."