data-engineering
Here are 860 public repositories matching this topic...
-
Updated
Jun 25, 2021
-
Updated
Apr 2, 2021
-
Updated
May 28, 2021
Opened from the Prefect Public Slack Community
elliot: Hey folks is anyone else getting a deprecation warning on marshmallow? Something like:
...
/home/rof/.pyenv/versions/3.9.1/lib/python3.9/site-packages/marshmallow/fields.py:198
/home/rof/.pyenv/versions/3.9.1/lib/python3.9/site-packages/marshmallow/fields.py:198: RemovedInMarshmallow4Warning: Passing fi
Describe the bug
When trying to run scaffolding (profiling) command, it fails because of commas in columns.
To Reproduce
Steps to reproduce the behavior:
- Run
great_expectations suite scaffold scaffold-nameon datasource where commas are in column - Bug
pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 5323 saw 2
Expected behavior
D
-
Updated
Jun 24, 2021 - Go
-
Updated
Jun 24, 2021 - Python
-
Updated
Jun 9, 2021
-
Updated
Jun 21, 2021 - JavaScript
-
Updated
Jun 25, 2021 - Jupyter Notebook
-
Updated
May 10, 2021 - Jupyter Notebook
-
Updated
Mar 9, 2020 - Python
-
Updated
Jun 22, 2021
if they are not class methods then the method would be invoked for every test and a session would be created for each of those tests.
`class PySparkTest(unittest.TestCase):
@classmethod
def suppress_py4j_logging(cls):
logger = logging.getLogger('py4j')
logger.setLevel(logging.WARN)
@classmethod
def create_testing_pyspark_session(cls):
return Sp
Hi ,
I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?
-
Updated
Jun 2, 2021
-
Updated
Mar 5, 2020 - Python
-
Updated
Jun 23, 2021 - R
-
Updated
Jun 24, 2021
-
Updated
Jun 7, 2021 - Ruby
-
Updated
May 22, 2021
-
Updated
Jun 5, 2021 - TypeScript
-
Updated
Feb 7, 2021 - CSS
-
Updated
Jun 25, 2021 - Python
-
Updated
Nov 29, 2018 - Java
-
Updated
Jun 5, 2021
Improve this page
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."


Screenshot
Description
chart 3 dot menu is behind the chart title panel in chart maximize mode