A high-quality rapid TTS voice cloning model
Capable of understanding text, audio, vision, video
PersonaPlex code
TTS with kokoro and onnx runtime
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
State-of-the-art TTS model under 25MB
A TTS model capable of generating ultra-realistic dialogue
Towards Human-Sounding Speech
Synchronized Translation for Videos
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Instant voice cloning by MIT and MyShell. Audio foundation model
High-quality multi-lingual text-to-speech library by MyShell.ai
A simple, high-quality voice conversion tool focused on ease of use
Open-source model for program synthesis
LLM Large Model of Selling Anchor
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Interface for OuteTTS models
A sound cloning tool with a web interface, using your voice
Inference code for CodeLlama models
Clone a voice in 5 seconds to generate arbitrary speech in real-time
NLP Cloud serves high performance pre-trained or custom models for NER
A simple native web interface that uses ChatTTS to synthesize text
Repo of Qwen2-Audio chat & pretrained large audio language model
Large Audio Language Model built for natural interactions
From Images to High-Fidelity 3D Assets