evaluation

Describe the bug

Streaming Datasets can't be pickled, so any interaction between them and multiprocessing results in a crash.

Steps to reproduce the bug

import transformers
from transformers import Trainer, AutoModelForCausalLM, TrainingArguments
import datasets

ds = datasets.load_dataset('oscar', "unshuffled_deduplicated_en", split='train', streaming=True).with_format("

Description

Currently, when a challenge link from EvalAI is shared users see a generic view of EvalAI homepage. We want the details specific to a challenge to be shown when a link is shared. Here's how it looks currently

Expected behavior:

T

Discussed in ContinualAI/avalanche#900

^{Originally posted by sivomke January 30, 2022}
Hi, everyone!

I have created a benchmark with dataset_benchmark that contains 3 experiences, and then have added a validation set to this benchmark with benchmark_with_validation_stream. I am trying to use Early St

evaluation

Here are 692 public repositories matching this topic...

huggingface / datasets

Streaming Datasets don't work with Transformers Trainer when dataloader_num_workers>1

Describe the bug

Steps to reproduce the bug

mrgloom / awesome-semantic-segmentation

sdiehl / write-you-a-haskell

viebel / klipse

zzw922cn / Automatic_Speech_Recognition

Knetic / govaluate

MichaelGrupp / evo

Cloud-CV / EvalAI

Add metadata for challenges when a challenge link is shared

Description

Use custom email template for reset password feature

Add documentation for challenge hosts on how to make a submission a baseline submission

xinshuoweng / AB3DMOT

sepandhaghighi / pycm

Maluuba / nlg-eval

abo-abo / lispy

ContinualAI / avalanche

Using validation set for early stopping

Discussed in ContinualAI/avalanche#900

Examples in the documentation

Make some packages optional

EthicalML / xai

google / fuzzbench

bochinski / iou-tracker

dbolya / tide

tecnickcom / tcexam

PaesslerAG / gval

PRBonn / semantic-kitti-api

caserec / CaseRecommender

toshas / torch-fidelity

votchallenge / toolkit-legacy

zzzprojects / Eval-Expression.NET

jkkummerfeld / text2sql-data

davidstutz / superpixel-benchmark

StrangerZhang / pysot-toolkit

codingseb / ExpressionEvaluator

CBLUEbenchmark / CBLUE

danthedeckie / simpleeval

Improve this page

Add this topic to your repo