Tokenizer-Free TTS for Multilingual Speech Generation
TTS with kokoro and onnx runtime
The official Python SDK for the ElevenLabs API
Long-form streaming TTS system for multi-speaker dialogue generation
Free, open-source, and offline speech-to-text & voice control app.
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
A simple, high-quality voice conversion tool focused on ease of use
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Controllable & emotion-expressive zero-shot TTS
Free, high-quality text-to-speech API endpoint to replace OpenAI
The python library for real-time communication
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Foundational model for human-like, expressive TTS
Capable of understanding text, audio, vision, video
Faster Whisper transcription with CTranslate2
Offline inference engine for art, real-time voice conversations
State-of-the-art TTS model under 25MB
Gp.nvim (GPT prompt) Neovim AI plugin
A single Gradio + React WebUI with extensions for ACE-Step
A speech-text foundation model for real time dialogue
Video translation and dubbing tool powered by LLMs
Lightning-fast, on-device TTS, running natively via ONNX
Stanford CoreNLP, a Java suite of core NLP tools
Open-source multi-speaker long-form text-to-speech model
Speakr is a personal, self-hosted web application