nlp
go
natural-language-processing
language-detection
language-modeling
golang-library
text-processing
nlp-machine-learning
language-recognition
language-processing
language-identification
language-classification
-
Updated
Dec 28, 2021 - Go
Is your feature request related to a problem? Please describe.
Since the Oscar is limited by the fasttext language classifier which was trained on Wikipedia, the datasets contain also the sentences in other languages. For instance, Tajik (tg.txt) language contains large chunks of Uzbek sentences in Cyrillic script
Describe the solution you'd like
Train new models using other data othe