#
tokenize
Here are 61 public repositories matching this topic...
Developer friendly Natural Language Processing ✨
visualization
nlp
natural-language-processing
sentiment-analysis
pattern-matching
chatbot
vectorizer
ner
wink
hacktoberfest
pos-tagging
tokenize
bm25
sentence-boundary-detection
word-vectors
sbd
named-entity-extraction
negation-handling
custom-entity-detection
wink-nlp
-
Updated
Apr 3, 2022 - JavaScript
A pythonic wrapper for Stanford CoreNLP.
nlp
parser
wrapper
natural-language-processing
sentiment-analysis
named-entity-recognition
stanford
stanford-corenlp
dependency-parser
lemmatizer
part-of-speech-tagger
tokenize
corenlp
coreference-resolution
core-nlp
ssplit
-
Updated
Jun 29, 2018 - Python
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
nlp
natural-language-processing
bag-of-words
ngrams
stem
tokenize
sentence-boundary-detection
stop-words
phonetize
-
Updated
Jan 31, 2022 - JavaScript
Tokenize2 is a plugin which allows your users to select multiple items from a predefined list or ajax, using autocompletion as they type to find each item. You may have seen a similar type of text entry when filling in the recipients field sending messages on facebook or tags on tumblr.
-
Updated
Feb 22, 2022 - JavaScript
mdast utility to parse markdown
-
Updated
Apr 1, 2022 - JavaScript
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
redaction
hipaa
deidentification
tokenize
dlp
gdpr
anonymization
masking
privacy-tools
synthetic-data
data-anonymization
data-loss-prevention
redact
de-identification
synthetic-dataset-generation
de-identify
data-masking
synthetic-data-generator
text-anonymization
cpra
-
Updated
Dec 21, 2021 - Python
Extract JavaScript code comments from a string or glob of files.
-
Updated
Nov 24, 2018 - JavaScript
Lexers, tokenizers, parsers, compilers, renderers, stringifiers... What's the difference, and how do they work?
-
Updated
Apr 26, 2017
Uses babel to extract JavaScript code comments from a string. Returns an array of comment objects, with line, column, index, comment type and comment string.
-
Updated
May 22, 2018 - JavaScript
Korean text data preprocess toolkit for NLP
-
Updated
Jun 11, 2019 - Python
Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.
nodejs
javascript
gfm
node
parse
jsdoc
js
code
comment
javadoc
tokenize
code-comment
jonschlinkert
comment-parser
parse-comments
-
Updated
Nov 26, 2018 - JavaScript
A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.
svg
html
php
html5
minify
tokenizer
html-parser
html-dom-parser
minification
simplehtmldom
tokenize
minify-html
-
Updated
Apr 8, 2022 - PHP
Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading comprehension model.
text-classification
transformer
jieba
natural-language-inference
tokenize
nmt-model
sentence-entailment
-
Updated
Feb 10, 2022 - Python
More detailed documentation for the Python tokenize module
-
Updated
Mar 5, 2021
A Python toolkit to generate a tokenized dump of Wikipedia for NLP
-
Updated
Oct 12, 2020 - Python
Transforms tokens into original source code (while preserving whitespace)
-
Updated
May 24, 2019 - Python
Python3 module to tokenize english sentences.
-
Updated
Apr 24, 2019 - Python
NFTSwaps is a cross-chain and permissionless platform to tokenize NFTs and make them tradable on AMMs such as PancakeSwap or BakerySwap through the NFTSwaps UI.
-
Updated
Jan 24, 2022 - JavaScript
Sentiment analysis for amazon product reviews using NLTK, Scikit-Learn, and Keras. Using hyperparameter search and LSTM, our best model achieves ~96% accuracy.
machine-learning
natural-language-processing
sentiment-analysis
scikit-learn
keras
lstm
nltk
bag-of-words
binary-classification
tokenize
sequence-models
hyperparameter-search
-
Updated
Sep 24, 2019 - Python
A PHP Library to extract n-grams from a text. Simple preprocessing tools (cleaning, tokenizing) included.
nlp
php
natural-language-processing
php7
php-library
tokenizer
ngram
ngrams
tokenize
tokenization
ngram-analysis
tokenized-sentences
-
Updated
Dec 5, 2017 - PHP
simple regex for correcting punctuations
-
Updated
Apr 28, 2018 - Python
Improve this page
Add a description, image, and links to the tokenize topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tokenize topic, visit your repo's landing page and select "manage topics."