Skip to content
#

data-centric

Here are 31 public repositories matching this topic...

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

  • Updated Jun 17, 2023
  • Rust

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling to supercharge model performance.

  • Updated Jun 18, 2023
  • Python

Improve this page

Add a description, image, and links to the data-centric topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-centric topic, visit your repo's landing page and select "manage topics."

Learn more