This is now the official location of the Kaldi project.
#
speech
Repositories 425
JavaScript
Updated Mar 10, 2019
Code examples for new APIs of iOS 10.
ios
ios10
swift-3
swift-4
speech
metal
cnn
image-recognition
convolutional-neural-networks
demo
metal-performance-shaders
metal-cnn
uiviewpropertyanimator
Swift
Updated Sep 15, 2018
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Python
Updated Mar 19, 2018
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Lingvo
speech-recognition
translation
speech-to-text
machine-translation
mnist
seq2seq
language-model
tts
asr
lm
nlp
tensorflow
speech
research
distributed
gpu-computing
speech-synthesis
Python
Updated Mar 22, 2019
WaveNet vocoder
Python
Updated Dec 30, 2018
Python library and CLI tool to interface with Google Translate's text-to-speech API
Python
Updated Feb 20, 2019
Open-Source Large Vocabulary Continuous Speech Recognition Engine
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is…
speech-recognition
gru
dnn
kaldi
rnn-model
pytorch
timit
deep-learning
deep-neural-networks
recurrent-neural-networks
multilayer-perceptron-network
lstm
lstm-neural-networks
speech
asr
rnn
dnn-hmm
Perl
Updated Mar 20, 2019
Free, easy, portable audio engine for games
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to pro…
Java
Updated Nov 25, 2018
speech
speech-recognition
speech-to-text
voice-control
stt
node
hotword-detection
keyword-spotting
alexa
voice-recognition
JavaScript
Updated Mar 8, 2019
simple audio I/O for pytorch
Python
Updated Mar 20, 2019
Speech Enhancement Generative Adversarial Network in TensorFlow
speech
gan
tensorflow
deep-learning
deep-neural-networks
generative-model
generative-adversarial-networks
Python
Updated Aug 18, 2018
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Python
Updated Jun 7, 2018
AAC communication board with text-to-speech for the browser
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
HTML
Updated Nov 27, 2018
Praat: Doing Phonetics By Computer
C
Updated Mar 21, 2019
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly rec…
vad
dnn
lstm
bdnn
acam
attention
speech
data
voice-detection
speech-recognition
voice-activity-detection
speech-activity-detection
MATLAB
Updated Feb 10, 2019
A Python wrapper for Kaldi
python
wrapper
kaldi
openfst
asr
speech-recognition
speech
language-model
feature-extraction
clif
numpy
Python
Updated Mar 12, 2019
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
JavaScript
Updated Mar 21, 2019
A neural network for end-to-end speech denoising
machine-learning
deep-learning
neural-networks
speech-denoising
speech
wavenet
end-to-end
speech-processing
Python
Updated Jun 5, 2018
Android speech recognition and text to speech made easy
Java
Updated Feb 2, 2019
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Python
Updated Jun 3, 2018
Gender recognition by voice and speech analysis
gender-recognition
gender
machine-learning
data-science
artificial-intelligence
neural-network
logistic-regression
vocal
voice
speech
acoustic-properties
signal
ai
R
Updated Jul 3, 2018
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
keras
deepspeech
asr
ctc
coreml
speechrecognition
speech-to-text
deep-learning
machine-learning
neural-networks
baidu
speech
deeplearning
neural-network
nn
Python
Updated Mar 17, 2018
SDK & Sample to do speech recognition using websockets in Javascript
microsoft
speech
speechtotext
sdk
javascript
typescript
ts
js
browser
websocket
cognitive-services
speech-recognition
websockets
microsoft-speech-service
recognition
speechrecognition
TypeScript
Updated Feb 25, 2019
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Python
Updated Feb 16, 2019
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
C#
Updated Mar 16, 2019