Here are
17 public repositories
matching this topic...
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
-
Updated
May 22, 2020
-
Python
Unsupervised text tokenizer focused on computational efficiency
Explains nlp building blocks in a simple manner.
-
Updated
Sep 23, 2019
-
Jupyter Notebook
Byte Pair Encoding for Python!
-
Updated
Jan 21, 2020
-
Python
Fast and customizable text tokenization library with BPE and SentencePiece support
Subword Encoding in Lattice LSTM for Chinese Word Segmentation
-
Updated
Apr 25, 2019
-
Python
Machine Learning for Phishing Website Detection
Subword-augmented Embedding for Cloze Reading Comprehension (COLING 2018)
-
Updated
Nov 6, 2018
-
Python
R package for Byte Pair Encoding based on YouTokenToMe
High performance unsupervised text tokenization for Ruby
-
Updated
Apr 23, 2020
-
Ruby
Learning BPE embeddings by first learning a segmentation model and then training word2vec
-
Updated
Mar 25, 2020
-
Python
-
Updated
Feb 25, 2019
-
Python
Low resource language machine translation(az,be,tr -> en).
-
Updated
Nov 10, 2018
-
Python
A python package to build a corpus vocabulary using the byte pair methodology and also a tokenizer to tokenize input texts based on the built vocab.
-
Updated
May 21, 2020
-
Python
Central repository with pretrained models for transfer learning, BPE subword-tokenization, mono/multilingual embeddings, and everything in between.
-
Updated
Jun 2, 2019
-
Python
💡 Servidor RPG de SA:MP Brasil
Generating new titles for movie posters using a combination of image features and pre-trained subword embeddings
-
Updated
Jun 5, 2020
-
Jupyter Notebook
Improve this page
Add a description, image, and links to the
bpe
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
bpe
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.