Audio foundation model excelling in audio understanding
Persian NLP Toolkit
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
The media player for language learning, with dual subtitles
Multi-lingual large voice generation model, providing inference
Efficient, internationalized, text renderer for GameMaker
Speech recognition for your site
A simple native web interface that uses ChatTTS to synthesize text
SOTA discrete acoustic codec models with 40/75 tokens per second
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Interface for OuteTTS models
Open-source framework for intelligent speech interaction
A TTS model capable of generating ultra-realistic dialogue
LLM-based Reinforcement Learning audio edit model
Real-time voice interactive digital human
Transcribe on your own
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Bailing is a voice dialogue robot similar to GPT-4o
Ito, smart dictation in every application
A sound cloning tool with a web interface, using your voice
Open-source industrial-grade ASR models
Framework for building neural networks
Underthesea - Vietnamese NLP Toolkit
Anki flashcards on Android
Repo of Qwen2-Audio chat & pretrained large audio language model