A single Gradio + React WebUI with extensions for ACE-Step
One-click deployment (including offline integration package)
Generate audiobooks from e-books
Speech recognition for your site
Transcribe on your own
C++ inference library for multiple SVC/TTS
A speech-text foundation model for real time dialogue
SoTA open-source TTS
Free & Easy AI Voice Accounting Software For Blind & Speechless People
Framework for building AI-powered interactive digital humans and agent
Interface for OuteTTS models
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Build and run agents you can see, understand and trust
A Conversational Speech Generation Model
Multi-modal large language model designed for audio understanding
Natural speech programming assistant for the software developers
A Web UI for easy subtitle using whisper model
Convert VoIP calls to text and analyze them with AI
LLM-based Reinforcement Learning audio edit model
Powerful Android AI agent with tools, automation, and Linux shell
Build Vision Agents quickly with any model or video provider
Meow Voice Changer is a lightweight, real-time voice modulation tool
LLM Large Model of Selling Anchor
Open-source industrial-grade ASR models
Olares: An Open-Source Sovereign Cloud OS for Local AI