A full pipeline AutoML tool for tabular data
-
Updated
Feb 1, 2023 - Python
A full pipeline AutoML tool for tabular data
AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach
Unified Distributed Execution
Evaluation Tool for Anomaly Detection Algorithms on Time Series
Parallel Lammps Python interface - control a mpi4py parallel LAMMPS instance from a serial python process or an Jupyter notebook
Test LightGBM's Dask integration on different cluster types
A Python library for creating, fitting, and applying predictive data modeling pipelines.
Code for "Training models when data doesn't fit in memory" post
Python library to query and transform genomic data from indexed files
Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex
Procurement: Dask Cluster as a Process.
Perform I/O intensive workloads on high-volume data sparsely located across multiple AWS regions through the use of Dask.
Scalable Cytometry Image Processing (SCIP) is an open-source tool that implements an image processing pipeline on top of Dask, a distributed computing framework written in Python. SCIP performs projection, illumination correction, image segmentation and masking, and feature extraction.
Launch a Dask cluster from a Poetry environment
Preserve all necessary runtime data of a Dask client in order to "replay" and analyze the performance and behavior of the client after the fact
Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/
HPC cluster deployment and management for the Hetzner Cloud
Magic commands to support running MPI python code as well as multi-node Dask workloads on Jupyter notebooks.
Python 3 tools for distributed analysis and visualisation of big climate data on HPC systems.
Parallelized pipeline in Python for analyzing and cleaning data scraped from the online catalogue of major Italian book publishers with limited computational resources.
Add a description, image, and links to the dask-distributed topic page so that developers can more easily learn about it.
To associate your repository with the dask-distributed topic, visit your repo's landing page and select "manage topics."