New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
[Bug/Question] Write With Transformers Implementation vs. Custom Implementation
#7273
opened Sep 20, 2020 by
krrishdholakia
Changing learning rate for BertModelforTokenClassification
#7264
opened Sep 20, 2020 by
YojanaGadiya
When I updated my transformers to the latest, the previously trained model loaded with an error
#7262
opened Sep 20, 2020 by
wulaoshi
0 of 4
LXMERT visual feature extraction during training/fine-tuning phase
#7261
opened Sep 20, 2020 by
mmiakashs
very poor performance of Longformer on SQuAD-like question-answering tasks
#7249
opened Sep 19, 2020 by
xixiaoyao
2 of 4
[example/glue] run_glue compute metrics fail for bart like models
#7247
opened Sep 19, 2020 by
patil-suraj
How to get cross attention weights of decoder when using 'encoderdecodermodel'
#7246
opened Sep 19, 2020 by
kimmo1019
trainer.evaluate() aggregates predictions on GPU and causes CUDA out of memory issues for large datasets
#7232
opened Sep 18, 2020 by
eugeneware
2 of 4
a possible hack for FSMT's SinusoidalPositionalEmbedding peculiarity
#7229
opened Sep 18, 2020 by
stas00
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.