Skip to content
#

apache-spark

Here are 935 public repositories matching this topic...

PhilipMay
PhilipMay commented Jun 12, 2020

MLflow seems to have a length limit of 5000 when setting tags (see below).

[...]
  File "/home/smay/miniconda3/envs/py38/lib/python3.8/site-packages/mlflow/utils/validation.py", line 136, in _validate_length_limit
    raise MlflowException(
mlflow.exceptions.MlflowException: Tag value '[0.8562690322984875, 0.8544098885636596, 0.8544098885636596, 0.8544098885636596, 0.85440988856365
thrixton
thrixton commented Jul 13, 2020

This is more a question than a feature request.

When parsing JSON files, I need to sanitize the field names so field with spaces becomes field_with_spaces.
I want to preserve the original name as well, metadata about the column if you like :)

There is a metadata field on StructField, but it is internal.
Why is this internal, is it possible or desirable to expose it?

Improve this page

Add a description, image, and links to the apache-spark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-spark topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.