tts

Star

Here are 1,584 public repositories matching this topic...

CorentinJ / Real-Time-Voice-Cloning

Star

Clone a voice in 5 seconds to generate arbitrary speech in real-time

python deep-learning tensorflow pytorch tts voice-cloning

Updated May 5, 2023
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated May 16, 2023
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Jan 9, 2023
Jupyter Notebook

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 15, 2023
Python

NVIDIA / NeMo

Star

NeMo: a toolkit for conversational AI

nlp text-to-speech deep-learning neural-network machine-translation tts speech-synthesis speech-recognition speech-to-text nmt language-model speaker-recognition nlp-machine-learning asr speaker-diarization text-normalization

Updated May 17, 2023
Python

wzpan / wukong-robot

Sponsor

Star

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

alexa ai amazon-echo muse tts openai google-home unit bci speaker homeassistant snowboy asr anyq raspeberry-pi gpt3 chatgpt

Updated May 7, 2023
Python

jaywalnut310 / vits

Star

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Apr 24, 2023
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated May 6, 2023
Jupyter Notebook

TensorSpeech / TensorFlowTTS

Star

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Updated Nov 25, 2022
Python

MoonInTheRiver / DiffSinger

Star

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

keithito / tacotron

Star

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

python machine-learning tensorflow tts speech-synthesis tacotron

Updated Jun 21, 2022
Python

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated May 12, 2023
Python

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Star

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)