Qwen3-TTS is an open-source series of TTS models
Qwen3 is the large language model series developed by Qwen team
Qwen3-Coder is the code version of Qwen3
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Qwen3-ASR is an open-source series of ASR models
Designed for text embedding and ranking tasks
Qwen3-omni is a natively end-to-end, omni-modal LLM
Use Microsoft Edge's online text-to-speech service from Python
State-of-the-art TTS model under 25MB
A fast TTS architecture with conditional flow matching
A TTS that fits in your CPU (and pocket)
A single Gradio + React WebUI with extensions for ACE-Step
Controllable & emotion-expressive zero-shot TTS
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
Towards Human-Sounding Speech
Multimodal embedding and reranking models built on Qwen3-VL
Speech to Text to Speech, sends text as OSC messages
Qwen3.5 is the large language model series developed by Qwen team
The open-source voice synthesis studio powered by Qwen3-TTS
Converts text to speech in realtime
SoTA open-source TTS
Comprehensive Gradio WebUI for audio processing
1 min voice data can also be used to train a good TTS model
Free, high-quality text-to-speech API endpoint to replace OpenAI