Qwen3-TTS is an open-source series of TTS models
Qwen3 is the large language model series developed by Qwen team
Qwen3-Coder is the code version of Qwen3
Qwen3-ASR is an open-source series of ASR models
Designed for text embedding and ranking tasks
Qwen3-omni is a natively end-to-end, omni-modal LLM
Use Microsoft Edge's online text-to-speech service from Python
State-of-the-art TTS model under 25MB
A fast TTS architecture with conditional flow matching
A TTS that fits in your CPU (and pocket)
Controllable & emotion-expressive zero-shot TTS
Towards Human-Sounding Speech
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
Multimodal embedding and reranking models built on Qwen3-VL
Converts text to speech in realtime
SoTA open-source TTS
Comprehensive Gradio WebUI for audio processing
Free, high-quality text-to-speech API endpoint to replace OpenAI
1 min voice data can also be used to train a good TTS model
SOTA Open Source TTS
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
EPUB to audiobook converter, optimized for Audiobookshelf
Foundational model for human-like, expressive TTS
Virtual AI anchor that combines state-of-the-art technology