Use Microsoft Edge's online text-to-speech service from Python
State-of-the-art TTS model under 25MB
A TTS that fits in your CPU (and pocket)
A fast TTS architecture with conditional flow matching
Qwen3-TTS is an open-source series of TTS models
Controllable & emotion-expressive zero-shot TTS
Towards Human-Sounding Speech
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
Converts text to speech in realtime
Comprehensive Gradio WebUI for audio processing
1 min voice data can also be used to train a good TTS model
Free, high-quality text-to-speech API endpoint to replace OpenAI
SOTA Open Source TTS
TTS with kokoro and onnx runtime
EPUB to audiobook converter, optimized for Audiobookshelf
Real-time voice interactive digital human
Foundational model for human-like, expressive TTS
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Virtual AI anchor that combines state-of-the-art technology
Speech-AI-Forge is a project developed around TTS generation model
Open-source multi-speaker long-form text-to-speech model
Bailing is a voice dialogue robot similar to GPT-4o
Automatically translates the text of a video based on a subtitle file