tts

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

api kubernetes bloom ai containers falcon tts api-rest llama alpaca vicuna guanaco gpt-neox llm stable-diffusion rwkv gpt4all

Updated Nov 22, 2023
C++

PaddlePaddle / PaddleSpeech

Star

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Nov 22, 2023
Python

NVIDIA / NeMo

Star

NeMo: a toolkit for conversational AI

nlp text-to-speech deep-learning neural-network machine-translation tts speech-synthesis speech-recognition speech-to-text nmt language-model speaker-recognition nlp-machine-learning asr speaker-diarization text-normalization

Updated Nov 22, 2023
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

Plachtaa / VALL-E-X

Star

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

text-to-speech tts gpt transformer-architecture emotional-speech voice-clone vall-e

Updated Nov 3, 2023
Python

pot-app / pot-desktop

Star

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognize.

windows macos linux ocr translation tts translate pot recognize tauri pot-app

Updated Nov 22, 2023
JavaScript

jaywalnut310 / vits

Star

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Jul 4, 2023
Python

wzpan / wukong-robot

Sponsor

Star

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

alexa ai amazon-echo muse tts openai google-home unit bci speaker homeassistant snowboy asr anyq raspeberry-pi gpt3 chatgpt

Updated Nov 21, 2023
Python

netease-youdao / EmotiVoice

Star

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Nov 22, 2023
Python

LokerL / tts-vue

Star

🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。

electron vue tts element-plus

Updated Nov 16, 2023
TypeScript

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

MoonInTheRiver / DiffSinger

Star

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

lobehub / lobe-chat

Star

🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

chat agent ai nextjs chatbot tts openai gpt stt gpt-4 azure-openai chatgpt langchangjs lobehub function-calling lobe-chat dalle-3 gpt-4-vision

Updated Nov 22, 2023
TypeScript

TensorSpeech / TensorFlowTTS

Star

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)