Skip to content
#

data-engineering

Here are 751 public repositories matching this topic...

superset
kvnkho
kvnkho commented Apr 9, 2021

Current behavior

LocalStorage saves flows by default in a local that looks like Users/username. LocalStorage and LocalRun work well together because Prefect will able to find the Flow in that default location. The issue is when we combine LocalStorage with other RunConfigs. Users will get an error that looks like `Failed to load and execute Flow's environment: ModuleNotFoundError("No module

gardnerdev
gardnerdev commented Jan 16, 2021

Describe the bug
When trying to run scaffolding (profiling) command, it fails because of commas in columns.

To Reproduce
Steps to reproduce the behavior:

  1. Run great_expectations suite scaffold scaffold-name on datasource where commas are in column
  2. Bug pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 5323 saw 2

Expected behavior
D

anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

mandy-chessell
mandy-chessell commented Feb 9, 2021

Egeria's open metadata labs use python notebooks to drive sequences of REST API calls to Egeria's runtime platform called the OMAG Server Platform. There is one function called printAssetUniverse that needs work. This function is designed to provide a data scientist with detailed information about an Asset (such as a file or a database). This includes name, description, its location, content,

Improve this page

Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."

Learn more