End-to-End Speech Processing Toolkit
-
Updated
Mar 17, 2023 - Python
End-to-End Speech Processing Toolkit
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Voice Conversion Tool Kit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Voice Converter Using CycleGAN and Non-Parallel Data
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
基于javaFX的简单字幕处理桌面程序,集成在线翻译及语音转换
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Library to build speech synthesis systems designed for easy and fast prototyping.
Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Deep learning for audio processing
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks https://arxiv.org/abs/1806.02169
Add a description, image, and links to the voice-conversion topic page so that developers can more easily learn about it.
To associate your repository with the voice-conversion topic, visit your repo's landing page and select "manage topics."