gpt

huggingface/transformers#12276 introduced a new --log_level feature, which now allows users to set their desired log level via CLI or TrainingArguments.

run_translation.py was used as a "model" for other examples.

Now we need to replicate this to all other Trainer-based examples under examples/pytorch/, the 3 changes are

importing datasets
using `training

The Split class accepts SplitDelimiterBehavior which is really useful. The Punctuation however always uses SplitDelimiterBehavior::Isolated (and Whitespace on the other hand behaves like SplitDelimiterBehavior::Removed).

impl PreTokenizer for Punctuation {
    fn pre_tokenize(&self, pretokenized: &mut PreTokenizedString) -> Result<()> {
        pretokenized.split(|_, s| s.spl

I'm playing around with this wonderful code but I'm running into a curious issue when I try to train the model with my own data.

I replicated the personachat_self_original.json file structure and added my own data. I deleted dataset_cache_OpenAIGPTTokenizer file but when I try to train, I get this error:

INFO:train.py:Pad inputs and convert to Tensor
Traceback (most recent call last)

gpt

Here are 120 public repositories matching this topic...

huggingface / transformers

[examples] replicate the new `--log_level` feature to all trainer-based pytorch examples

Add error message to Wav2Vec2 & Hubert if labels > vocab_size

[Performance] Tracking open Issues and PRs (pytorch transformers)

pbatard / rufus

EleutherAI / gpt-neo

huggingface / tokenizers

Add SplitDelimiterBehavior to Punctuation constructor

dbiir / UER-py

huggingface / transfer-learning-conv-ai

RuntimeError: shape '[-1, 2, 34]' is invalid for input of size 61710

thu-coai / CDial-GPT

guillaume-be / rust-bert

systemd / mkosi

MorvanZhou / NLP-Tutorials

bradfitz / embiggen-disk

limine-bootloader / limine

NVIDIA / FasterTransformer

lonePatient / awesome-pretrained-chinese-nlp-models

ValdikSS / Super-UEFIinSecureBoot-Disk

Novetta / adaptnlp

akanyaani / gpt-2-tensorflow2.0

teddykoker / image-gpt

jaanauati / react-dfp

geekjr / quickai

IBM / TabFormer

shreyansh26 / Annotated-ML-Papers

will-thompson-k / deeplearning-nlp-models

Mexit / MultiOS-USB

jhermsmeier / node-disk

luni64 / TeensyTimerTool

ethanmad / chromeos-resize

pampanic / pam_panic

itoffshore / alpine-linux-scripts

amazon-research / transformers-data-augmentation

Improve this page

Add this topic to your repo