-
Updated
Jul 31, 2020 - Python
#
asr
Here are 299 public repositories matching this topic...
alexa
ai
amazon-echo
muse
tts
google-home
unit
bci
speaker
homeassistant
snowboy
asr
anyq
raspeberry-pi
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
Aug 29, 2020 - Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
deep-neural-networks
deep-learning
speech
dnn
pytorch
recurrent-neural-networks
lstm
gru
speech-recognition
rnn
kaldi
rnn-model
asr
lstm-neural-networks
multilayer-perceptron-network
timit
dnn-hmm
-
Updated
Jun 11, 2020 - Python
DELTA is a deep learning based natural language and speech processing platform.
nlp
front-end
ops
deep-learning
text-classification
tensorflow
nlu
speech
inference
text-generation
speech-recognition
seq2seq
sequence-to-sequence
speaker-verification
asr
tensorflow-serving
emotion-recognition
custom-ops
serving
tensorflow-lite
-
Updated
Aug 28, 2020 - Python
A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
-
Updated
Mar 26, 2019 - Python
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
-
Updated
Aug 29, 2020 - Python
The official repository of the Eesen project
-
Updated
May 23, 2019 - C++
A Python wrapper for Kaldi
python
wrapper
numpy
speech
feature-extraction
speech-recognition
kaldi
language-model
asr
openfst
clif
-
Updated
Aug 17, 2020 - Python
SincNet is a neural architecture for efficiently processing raw audio samples.
audio
python
deep-learning
signal-processing
waveform
cnn
pytorch
artificial-intelligence
speech-recognition
neural-networks
convolutional-neural-networks
digital-signal-processing
filtering
speaker-recognition
speaker-verification
speech-processing
audio-processing
asr
timit
speaker-identification
-
Updated
Dec 23, 2019 - Python
Open STT
-
Updated
Aug 18, 2020 - Python
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
-
Updated
May 7, 2020 - Python
Sequence-to-Sequence Framework in PyTorch
deep-learning
cnn
pytorch
speech-recognition
seq2seq
neural-machine-translation
nmt
multimodality
asr
-
Updated
Aug 25, 2020 - Jupyter Notebook
On-device streaming speech-to-text engine powered by deep learning
android
python
c
raspberry-pi
iot
ios
machine-learning
arm
deep-learning
offline
webassembly
voice-recognition
speech-recognition
speech-to-text
stt
asr
-
Updated
Apr 28, 2020 - Python
Open tools and data for cloudless automatic speech recognition
-
Updated
Aug 28, 2020 - Python
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
tensorflow
end-to-end
speech-recognition
beam-search
automatic-speech-recognition
speech-to-text
attention-mechanism
asr
timit-dataset
ctc
timit
end-to-end-learning
csj
librispeech
joint-ctc-attention
-
Updated
Jan 23, 2018 - Python
End-to-end ASR/LM implementation with PyTorch
streaming
speech
language-modeling
pytorch
transformer
speech-recognition
seq2seq
attention
automatic-speech-recognition
sequence-to-sequence
language-model
attention-mechanism
asr
ctc
rnn-transducer
transformer-xl
-
Updated
Aug 23, 2020 - Python
Dockerfile for kaldi-gstreamer-server.
-
Updated
Jul 19, 2020 - Dockerfile
an open-source implementation of sequence-to-sequence based speech processing engine
deployment
tensorflow
tts
transformer
speech-recognition
sequence-to-sequence
unsupervised-learning
speaker-recognition
asr
ctc
wfst
-
Updated
Aug 28, 2020 - Python
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
machine-learning
deep-learning
neural-network
keras
nn
speech
neural-networks
baidu
deeplearning
speech-to-text
asr
ctc
speechrecognition
coreml
deepspeech
-
Updated
Mar 17, 2018 - Python
Kaldi-based Korean ASR (한국어 음성인식) open-source project
open-source
speech-recognition
lexicon
audio-data
korean
kaldi
language-model
data-augmentation
asr
tdnn
fastcampus
zeroth
-
Updated
Jul 28, 2019 - Shell
SenYan1999
commented
Aug 18, 2020
I am a newcomer to the audio field. I have some questions when use this project to generate the audio embedding for my multimodality model (text and audio)
I want to use Mockingjay, and run `python preprocess_any.py --feature_type=mel' but get 80 dim features, I just simply change num_mel in utility/audio.py from 80 to 160(I see this model need 160dim mel features in README), is it right?
Th
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
-
Updated
Aug 23, 2020 - Python
python
speech-recognition
nlp-library
asr
pypi-packages
nlp-tool
arabic-numbers
arabic-numerals
chinese-numbers
python-tools
chinese-numerals
-
Updated
Aug 20, 2020 - Python
Chinese text normalization for speech processing
-
Updated
Jun 9, 2020 - Python
An opensource speech-to-text software written in tensorflow
-
Updated
Jun 12, 2019 - Python
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
-
Updated
Dec 27, 2018
Improve this page
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."
One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.