Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
May 5, 2023 - Python
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
NeMo: a toolkit for conversational AI
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Lingvo
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
WaveRNN Vocoder + TTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Add a description, image, and links to the tts topic page so that developers can more easily learn about it.
To associate your repository with the tts topic, visit your repo's landing page and select "manage topics."