data-engineering
Here are 550 public repositories matching this topic...
AWS Batch Task
Created by Dylan Hughes via monday.com integration.
-
Updated
Sep 29, 2020
Describe the bug
Using a data source with umlauts in the column names leads to the Jupyter Notebook with which the suite can be edited throw an error on startup. The Notebook then doesn't load.
This might be a Jupyter Notebook bug, not sure?!
To Reproduce
Steps to reproduce the behavior:
- Initialize a suite with this xlsx file as a data source (nothing fancy: Just two columns, the
-
Updated
Sep 11, 2020
-
Updated
Sep 7, 2020 - JavaScript
-
Updated
Sep 30, 2020 - Python
-
Updated
Sep 30, 2020 - Jupyter Notebook
-
Updated
Jun 30, 2020 - Jupyter Notebook
-
Updated
Mar 9, 2020 - Python
Problem description
When I use the function of concatenating multiple columns, I find that it does not handle null values as expected.
This is the current output
df.concatenate_columns(["cat_1","cat_2","cat_3"],"cat",sep=",")| cat_1 | cat_2 |
|---|
if they are not class methods then the method would be invoked for every test and a session would be created for each of those tests.
`class PySparkTest(unittest.TestCase):
@classmethod
def suppress_py4j_logging(cls):
logger = logging.getLogger('py4j')
logger.setLevel(logging.WARN)
@classmethod
def create_testing_pyspark_session(cls):
return Sp
-
Updated
Aug 21, 2020 - CSS
-
Updated
Oct 1, 2020 - R
-
Updated
Mar 5, 2020 - Python
-
Updated
Nov 29, 2018 - Java
-
Updated
Sep 30, 2020 - TypeScript
In SubjectAreaRESTServicesInstance, it hard codes the default page size as 0, this is not correct
public static final String PAGE_SIZE_DEFAULT_VALUE = "0";
it should be changed to
public static final String PAGE_SIZE_DEFAULT_VALUE = "1000";
So it is consistent with OMAGServerConfig default
private static final int defaultMaxPageSize = 1000;
-
Updated
Apr 20, 2020 - Python
-
Updated
Sep 15, 2020
-
Updated
Sep 24, 2020
-
Updated
Jun 18, 2020 - Python
-
Updated
Sep 24, 2020 - Python
-
Updated
Aug 11, 2020 - Python
-
Updated
Mar 25, 2019
-
Updated
Aug 7, 2019 - Jupyter Notebook
-
Updated
Aug 24, 2020 - Scala
-
Updated
Sep 25, 2020 - Python
Pivot missing categories breaks FeatureSet/AggregatedFeatureSet
Summary
When defining a feature set, it's expected that pivot will have all categories and, as a consequence, the resulting Source dataframe will be suitable to be transformed. When a different behavior happens, FeatureSet and AggregatedFeatureSet breaks.
Feature related:
Age: legacy
Improve this page
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."
A clear and concise description of what the bug is.
Expected results
The cursor will move to the left
Actual results
New Tab is created
Screenshots
If applicable, add screenshots to help explain your problem.
How to reproduce the bug