Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
Nov 11, 2023 - Python
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
NeMo: a toolkit for conversational AI
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Lingvo
Add a description, image, and links to the tts topic page so that developers can more easily learn about it.
To associate your repository with the tts topic, visit your repo's landing page and select "manage topics."