VITS2 backbone with multilingual-bert
Mice speech to text with MX Cinnamon OS ISO
Best practice TTS based on BERT and VITS
Unofficial Parallel WaveGAN
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
SoftVC VITS Singing Voice Conversion
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Chinese voice dialogue robot/smart speaker project
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A walk along memory lane
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
PAddle PARAllel text-to-speech toolKIT
Implementation of a Transformer based neural network
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Generative Adversarial Networks for Efficient and High Fidelity Speech
An implementation of Tacotron 2 that supports multilingual experiments
Bangla text to speech synthesis in python
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
A cross-platform wrapper for common text-to-speech engines in Python