language-model
Here are 653 public repositories matching this topic...
-
Updated
Oct 22, 2020
-
Updated
Feb 10, 2021 - Rust
chooses 15% of token
From paper, it mentioned
Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.
It means that 15% of token will be choose for sure.
From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.
PositionalEmbedding
-
Updated
Feb 15, 2021 - Python
-
Updated
Jul 28, 2020 - Python
-
Updated
Nov 11, 2020 - Python
-
Updated
May 7, 2020 - Python
-
Updated
Dec 9, 2020 - Python
-
Updated
Feb 5, 2021
-
Updated
Nov 6, 2020 - Python
Is your feature request related to a problem? Please describe.
To follow up on deepset-ai/haystack#664 ("Build simple Streamlit UI #664"): it would be great to have an option to upload files to the DocumentStore using the UI.
Describe the solution you'd like
Current UI allows the user to do a query against "some indexed Game of Thrones articles." The orig
-
Updated
Jan 14, 2021 - Python
-
Updated
Feb 7, 2019 - Python
-
Updated
Feb 15, 2021 - Python
-
Updated
Aug 5, 2020
-
Updated
Jan 1, 2019 - Python
-
Updated
Feb 15, 2021 - Python
-
Updated
Feb 15, 2021 - Go
-
Updated
Dec 13, 2020 - Python
-
Updated
Oct 29, 2020 - Python
-
Updated
Jan 12, 2021 - Python
-
Updated
Dec 14, 2020 - Python
-
Updated
Dec 18, 2017 - Python
-
Updated
Feb 15, 2021 - TeX
-
Updated
Feb 10, 2021 - Jupyter Notebook
-
Updated
Jan 19, 2021
-
Updated
Nov 15, 2018 - Jupyter Notebook
Improve this page
Add a description, image, and links to the language-model topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the language-model topic, visit your repo's landing page and select "manage topics."
If someone wants to solve a puzzle, this test:
works on its own, but fails if it's run in the group with other tests:
it doesn't learn anything - eval_blue remains 0.0
The only small issue is that the t