Here are
117 public repositories
matching this topic...
A language detection library for PHP. Detects the language from a given text string.
A simple/fast/accurate accent prediction for non-accented Vietnamese text
Updated
Oct 20, 2017
Java
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Updated
Mar 21, 2019
HTML
Updated
Mar 20, 2017
Python
Part-of-Speech Tagging Models in Python
Updated
Oct 7, 2019
Python
🍰 A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
Term 1 Project 3 Design a Sign Language Recognition System by Luke Schoen for Udacity Artificial Intelligence Nanodegree (AIND)
Updated
May 2, 2017
Jupyter Notebook
Sentiment Analysis of Twitter Data (saotd)
Package to detect the language of a given text (focusing on short "sms" type text used on tweets, facebook, WhatsApp, etc)
Updated
Sep 20, 2018
JavaScript
Data from a corpus of written Hawaiian
New spell(1) implementation for NetBSD
KiloGram algorithm for finding the top-k most frequent n-grams for large values of n quickly with fixed memory.
Updated
Oct 13, 2020
Java
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
Updated
Oct 4, 2019
Jupyter Notebook
Predict the composition year of a given MIDI piece - Classical Music Hack Day 2013 @ Vienna. Live at:
Updated
Oct 16, 2013
JavaScript
Updated
Jul 27, 2018
Python
Code written as a part of assignments for CSE556 Natural Language Processing taught by Dr. Tanmoy Chakraborty at IIIT Delhi in Monsoon 2018
Updated
May 10, 2019
Jupyter Notebook
A web service that exposes semantic similarity search via a web GUI and a RESTful API.
Updated
Dec 9, 2018
Python
Implementation of language model for parallel n-gram extraction from large text corpora
🏰 Mapping British place names and other analysis
Updated
Nov 4, 2017
Python
AdWords Script:-Find Your Best And Worst Search Queries Using N-Grams Python Version
Updated
Jul 4, 2018
Python
Device detection for the web by the device's User-Agent string as defined in the HTTP/1.1
Use different orders of N-gram model to play Hangman game.
Updated
Jun 26, 2021
Python
N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.
Updated
Feb 8, 2018
Python
English language article analyzer using some NLP techniques
Updated
Mar 22, 2016
Java
Language detection using n-gram model for PyCon PL'2020 lightning talk
Updated
Feb 18, 2021
Jupyter Notebook
Springboard Foundations of Data Science - Capstone Project Repository
Generates random text in the style of a given corpus
Updated
May 10, 2019
Python
Model Generator for Firestore
Improve this page
Add a description, image, and links to the
n-grams
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
n-grams
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.
Need to implement a smarter method of tokenization which takes into account languages that traditionally does not use spaces between words (currently resulting in full-sentence tokens not suitable for the current method of cosine similarity comparisons).
Some of these languages include: