Curated collection of Amazing Python scripts
Multi-lingual large voice generation model, providing inference
Spark-TTS Inference Code
Open-source framework for intelligent speech interaction
Large Audio Language Model built for natural interactions
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
In-App assistant SDK to build a multimodal conversational UX websites
A simple, high-quality voice conversion tool focused on ease of use
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
A text-to-speech, speech-to-text and speech-to-speech library
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Conversational voice AI agents
TTS with kokoro and onnx runtime
Production ready toolkit to run AI locally
Real-time voice interactive digital human
Assistant SDK to build a multimodal conversational UX for Android
In-App assistant SDK to build a multimodal conversational UX for iOS
A robust, efficient, low-latency speech-to-text library
Build your own AI friend
Adds support for Yandex Smart Home (Alice voice assistant)
AI tool for automatic batch short video creation and editing
Focus on prompting and generating
Open Source Speech Language Model
A simple native web interface that uses ChatTTS to synthesize text