data-cleaning

Currently the X argument of CleanLearning.fit() does not seem to support non-array data.
Perhaps this is due to the sklearn function check_X_y() called inside CleanLearning, which we could replace.
Or perhaps it's due to how the cross-validation is currently being implemented.

However these are both easy to improve to rid the restriction that only array data are supported.
Seems e

Describe the bug
pa.errors.SchemaErrors.failure_cases only returns the first 10 failure_cases

I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandera. 0.6.5
(optional) I have confirmed this bug exists on the master branch of pandera.

Note: Please read [this guide](https://matthewrocklin.c

A note from Uwe Ligges of CRAN:

For the future: Is there some reference about the method you can add in the Description field in the form Authors (year) doi:.....?

I don't know about DOIs. Anyone have a thought on this? Is it only appropriate for packages associated with a research paper?

Write unit test coverage for SafeDataset and SafeDataLoader, along with the functions in utils.py.

Context

At the moment Data type and Column header are links, although differently color coded; this brings about some confusion

Idea

Move the Data type switching menu into the Column Edit menu:
which would mean there

data-cleaning

Here are 1,244 public repositories matching this topic...

johnkerl / miller

cleanlab / cleanlab

justmarkham / pandas-videos

justmarkham / DAT8

pandera-dev / pandera

sfirke / janitor

data-forge / data-forge-ts

schema-inspector / schema-inspector

dirty-cat / dirty_cat

msamogh / nonechucks

data-cleaning / validate

jim-schwoebel / voicebook

akanz1 / klib

rasgointelligence / feature-engineering-tutorials

probcomp / PClean

ekstroem / dataMaid

ajaymache / data-analysis-using-python

hi-primus / bumblebee

iam-mhaseeb / Skytrax-Data-Warehouse

ChrisMuir / refinr

jim-schwoebel / allie

HoloClean / HoloClean-Legacy-deprecated

LoLei / redditcleaner

akvo / akvo-lumen

Context

Idea

msberends / clean

scottythered / gratefuldata

sharmaroshan / Drugs-Recommendation-using-Reviews

ropensci / taxa

dssg / pgdedupe

ammsa / DTCleaner

Improve this page

Add this topic to your repo