#

speech

Here are 872 public repositories matching this topic...

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Apr 14, 2021
Shell

TalAter / annyang

Star

💬 Speech recognition for your site

demo gui tutorial voice speech speech-recognition speech-to-text hacktoberfest

Updated Mar 26, 2021
JavaScript

PaddlePaddle / models

Star

Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）

natural-language-processing computer-vision deep-learning neural-network speech recommendation paddlepaddle

Updated Apr 15, 2021
Python

mozilla / TTS

Star

🤖

💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Apr 13, 2021
Jupyter Notebook

shu223 / iOS-10-Sampler

Sponsor Star

Code examples for new APIs of iOS 10.

ios demo metal speech cnn swift-3 image-recognition convolutional-neural-networks ios10 uiviewpropertyanimator swift-4 metal-performance-shaders metal-cnn

Updated Apr 1, 2020
Swift

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Apr 15, 2021
Python

pytorch-kaldi

mravanelli / pytorch-kaldi

Star

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Mar 15, 2021
Python

readbeyond / aeneas

Star

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Updated Dec 28, 2020
Python

r9y9 / wavenet_vocoder

Sponsor Star

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Nov 2, 2020
Python

Kyubyong / tacotron

Star

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

tensorflow speech tts speech-synthesis-model

Updated Mar 19, 2018
Python

Delta-ML / delta

Star

DELTA is a deep learning based natural language and speech processing platform.

Updated Feb 6, 2021
Python

pndurette / gTTS

Star

Python library and CLI tool to interface with Google Translate's text-to-speech API

python cli text-to-speech python-library pypi speech tts gtts speech-api

Updated Mar 18, 2021
Python

pytorch / audio

Star

Open

Autograd tests for Transforms

6

mthrok commented Apr 1, 2021

Until recent, we have been assumed that ops provided in torchaudio support autograd just because they are implemented with PyTorch. However, this assumption was not correct all the time. For example, in #704, it was pointed out that lfitler does not support autograd, and this was resolved in #1310 with proper unit tests by community contribution. Similarly, as a part of #1337, I have added autog

Read more

contributions welcome good first issue help wanted testing

Open

Better structure dataset implementations

22

julius-speech / julius

Star

Open-Source Large Vocabulary Continuous Speech Recognition Engine

recognition speech speech-recognition audio-processing

Updated Sep 24, 2020
C

PaddlePaddle / DeepSpeech

Star

A PaddlePaddle implementation of ASR.

speech speech-recognition speech-to-text deep-speech

Updated Apr 15, 2021
Python

soloud

jarikomppa / soloud

Star

Open

more filters should be implemented

1

brightening-eyes commented Feb 20, 2018

hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out

Read more

good first issue help wanted

Open

Seek performance

5

Kyubyong / dc_tts

Star

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

speech tts speech-to-text

Updated Jun 7, 2018
Python

coqui-ai / TTS

Star

🐸

💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts vocoder tacotron speaker-encodings tensorflow2 melgan speaker-encoder melgan-stft glow-tts hifigan align-tts

Updated Apr 15, 2021
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

text-to-speech german speech pytorch english speech-recognition spanish colab speech-to-text pretrained-models stt asr pretrained onnx stt-benchmark enterprise-grade-stt silero-models tts-models torch-hub

Updated Apr 15, 2021
Jupyter Notebook

pykaldi / pykaldi

Star

A Python wrapper for Kaldi

python wrapper numpy speech feature-extraction speech-recognition kaldi language-model asr openfst clif

Updated Apr 7, 2021
Python

praat / praat

Star

Praat: Doing Phonetics By Computer

speech phonetics acoustics

Updated Apr 15, 2021
C

santi-pdp / segan

Star

Speech Enhancement Generative Adversarial Network in TensorFlow

deep-neural-networks deep-learning tensorflow speech gan generative-model generative-adversarial-networks

Updated Jan 14, 2021
Python

MITESHPUTHRANNEU / Speech-Emotion-Analyzer

Star

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

data-science natural-language-processing deep-neural-networks deep-learning neural-network keras voice speech emotion python3 audio-files speech-recognition emotion-recognition natural-language-understanding speech-emotion-recognition

Updated Mar 18, 2021
Jupyter Notebook

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 22, 2020
MATLAB

googleapis / nodejs-speech

Star

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

nodejs machine-learning speech speech-to-text

Updated Apr 15, 2021
TypeScript

evancohen / sonus

Star

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

alexa node speech voice-recognition speech-recognition speech-to-text voice-control stt hotword-detection keyword-spotting

Updated Mar 28, 2021
JavaScript

drethage / speech-denoising-wavenet

Star

A neural network for end-to-end speech denoising

machine-learning deep-learning end-to-end speech neural-networks wavenet speech-processing speech-denoising

Updated Jul 24, 2019
Python

google / tacotron

Star

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

audio machine-learning speech tts prosody tacotron

Updated Apr 5, 2021
HTML

lkuza2 / java-speech-api

Star

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

java api google recognition speech speech-synthesis speech-recognition speech-to-text jarvis

Updated May 2, 2019
Java

goxr3plus / XR3Player

Star

🎧

🎼 Advanced JavaFX Media Player

javafx mp3 speech audio-visualizer audio-player audio-recorder spectrum-analyzer audio-formats web-browser audio-processing dropbox-client stream-player java-speech java-stream-player

Updated Apr 14, 2021
Java

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."