Skip to content
#

data-engineering

Here are 860 public repositories matching this topic...

marvin-robot
marvin-robot commented May 17, 2021

Opened from the Prefect Public Slack Community

elliot: Hey folks is anyone else getting a deprecation warning on marshmallow? Something like:

...
/home/rof/.pyenv/versions/3.9.1/lib/python3.9/site-packages/marshmallow/fields.py:198
/home/rof/.pyenv/versions/3.9.1/lib/python3.9/site-packages/marshmallow/fields.py:198: RemovedInMarshmallow4Warning: Passing fi
gardnerdev
gardnerdev commented Jan 16, 2021

Describe the bug
When trying to run scaffolding (profiling) command, it fails because of commas in columns.

To Reproduce
Steps to reproduce the behavior:

  1. Run great_expectations suite scaffold scaffold-name on datasource where commas are in column
  2. Bug pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 5323 saw 2

Expected behavior
D

A comprehensive list of 160+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.

  • Updated Jun 22, 2021
anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

Improve this page

Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."

Learn more