data-processing

🚨🚨 Feature Request

If your feature will improve `HUB`

To explore the structure of a dataset it is convenient to have nicer and more informative prints of dataset objects and samples

Description of the possible solution

1) show ds

now

> ds
Dataset(path='hub://activeloop/abalone_full_dataset', tensors=['length', 'diameter', 'height', 'weight'])

Hi, I was wondering, is DALI currently compatible with tensorcom? And if so, are there any examples showing how to use DALI with a tensorcom data server, for example, like this?

Describe the bug
pa.errors.SchemaErrors.failure_cases only returns the first 10 failure_cases

I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandera. 0.6.5
(optional) I have confirmed this bug exists on the master branch of pandera.

Note: Please read [this guide](https://matthewrocklin.c

(1) Add docstrings to methods
(2) Covert .format() methods to f strings for readability
(3) Make sure we are using Python 3.8 throughout
(4) zip extract_all() in ingest_flights.py can be simplified with a Path parameter

setting pretrained_model_name will not only define the model arch but also load the pre-trained checkpoint. We should have another hparam to control whether to load pre-trained checkpoint or not.

Hello Benito,

For a specific task I need a "bitwise exclusive or"-function, but I realized xidel doesn't have one. So I created a function for that.

I was wondering if, in addition to the EXPath File Module, you'd be interested in integrating the EXPath Binary Module as well. Then I can use bin:xor() instead (although for

Write unit test coverage for SafeDataset and SafeDataLoader, along with the functions in utils.py.

The exception in subject is thrown by the following code:

from datetime import date
from pysparkling.sql.session import SparkSession
from pysparkling.sql.functions import collect_set

spark = SparkSession.Builder().getOrCreate()

dataset_usage = [
    ('steven', 'UUID1', date(2019, 7, 22)),
]
dataset_usage_schema = 'id: string, datauid: string, access_date: date'

df = spa

data-processing

Here are 597 public repositories matching this topic...

johnkerl / miller

lorien / awesome-web-scraping

activeloopai / Hub

🚨🚨 Feature Request

If your feature will improve HUB

Description of the possible solution

1) show ds

NVIDIA / DALI

TomWright / dasel

asyml / texar

dashbitco / broadway

onceupon / Bash-Oneliner

microsoft / DialoGPT

python-bonobo / bonobo

pandera-dev / pandera

GoogleCloudPlatform / data-science-on-gcp

GoogleCloudPlatform / DataflowJavaSDK

asyml / texar-pytorch

infoslack / awesome-kafka

benibela / xidel

polyaxon / traceml

constellation-rs / amadeus

kousun12 / eternal

msamogh / nonechucks

SebKrantz / collapse

alttch / rapidtables

maykulkarni / Machine-Learning-Notebooks

Yord / pxi

streamnative / pulsar-flink

svenkreiss / pysparkling

PytLab / VASPy

lithops-cloud / lithops

matousc89 / padasip

iTechArt / convtools-ita

Improve this page

Add this topic to your repo

If your feature will improve `HUB`