Skip to content
#

data

Here are 2,119 public repositories matching this topic...

flyte
glom
mahmoud
mahmoud commented May 9, 2018

Right now the tutorial is coherently designed, tested, and even documented. However, it doesn't build up in a way that's very beginner friendly. It establishes glom's value and then immediately uses it at an intermediate level.

I'd like it if it was a bit more drawn out to use basic features first and then add a multi-line Coalesce as the

help wanted good first issue
colour
anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

help wanted good first issue available for hacking infrastructure
naterush
naterush commented Mar 17, 2022

Describe the bug

Columns move order in the export formatting section. That is obviously not intended. Given that it's one of our main pro features, we should def fix it up!

To Reproduce

  1. Go pro.
  2. Go to Excel export. Add a few columns to the formatting.
  3. Change the columns selected using their selects. Watch them switch order.

Expected behavior

They shouldn't ch

good first issue type: mitosheet effort: 3 priority: high
pdpipe
yarkhinephyo
yarkhinephyo commented Nov 28, 2021

For pipeline stages provided by the pdpipe.basic_stages, supplying conditions to the prec and post keyword arguments may not return the correct error messages.

Example Code

import pandas as pd; import pdpipe as pdp;
df = pd.DataFrame([[1,4],[4,5],[1,11]], [1,2,3], ['a','b'])
pline = pdp.PdPipeline([
  pdp.FreqDrop(2, 'a', prec=pdp.cond.HasAllColumns(['x']))
])
pline.apply(
tlento
tlento commented Apr 16, 2022

Describe the bug
We have validation methods that return objects containing Optional collections of issues and things of that nature. In particular, the ModelBuildResult has an optional "issues" tuple which gets populated with a tuple of validation errors whenever validation runs. Making this optional leads to callsite shenanigans like this:

errors = ModelValidator.validate_model.issue
bug good first issue

Improve this page

Add a description, image, and links to the data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data topic, visit your repo's landing page and select "manage topics."

Learn more