OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Jul 15, 2023 - Java
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
OpenRefine is a free, open source power tool for working with messy data and improving it
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Statistical Machine Intelligence & Learning Engine
Java dataframe and visualization library
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Hopsworks - Data-Intensive AI platform with a Feature Store
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
ELKI Data Mining Toolkit
The premier open source Data Quality solution
Categorical Query Language IDE
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
A Java Toolbox for Scalable Probabilistic Machine Learning
Una introduccion al analisis de datos con R y R Studio
A point-and-click tool for creating and analyzing topic models produced by MALLET.
A tool of detecting anomaly points from data