nlg

Description

While using tokenizers.create with the model and vocab file for a custom corpus, the code throws an error and is not able to generate the BERT vocab file

Error Message

ValueError: Mismatch vocabulary! All special tokens specified must be control tokens in the sentencepiece vocabulary.

To Reproduce

from gluonnlp.data import tokenizers
tokenizers.create('spm', model_p

nlg

Here are 195 public repositories matching this topic...

spro / practical-pytorch

dmlc / gluon-nlp

[Error Message] Improve error message in SentencepieceTokenizer when arguments are not expected.

Description

Error Message

To Reproduce

Use official MXNet batchify to implement the batchify functions

NMT Inference: Chunk overlength sequences and translate in sequence

Maluuba / nlg-eval

charlesXu86 / Chatbot_CN

MiuLab / TC-Bot

rodrigopivi / Chatito

simplenlg / simplenlg

patil-suraj / question_generation

accelerated-text / accelerated-text

wyu97 / KENLG-Reading

santhoshkolloju / Abstractive-Summarization-With-Transfer-Learning

semiosis / pen.el

accelerated-text / awesome-nlg

yongzhuo / nlg-yongzhuo

SimGus / Chatette

coteries / cedille-ai

gyunggyung / NLP-Papers

AMontgomerie / question_generator

CZWin32768 / XNLG

cdjhz / multigen

devjwsong / gpt2-dialogue-generation-pytorch

agaralabs / transformer-drg-style-transfer

naver / gdc

KaijuML / data-to-text-hierarchical

BSlience / xbot

google / abstracttext

MiuLab / DuaLUG

Eulring / Text-Generation-Papers

spro / nalgene

Yngie-C / JasoAI

Improve this page

Add this topic to your repo