Audio Plugin for Audio to MIDI transcription using deep learning
Automatic Speech Recognition with Word-level Timestamps
Self-hosted AI audio transcription
Open Source AI Dictation App
Faster Whisper transcription with CTranslate2
A free, open source, and extensible speech-to-text application
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A private, local meeting notes assistant
Crowdsourcing platform for full text transcription and tagging
A Web UI for easy subtitle using whisper model
Fast and accurate automatic speech recognition (ASR) for edge devices
Comprehensive Gradio WebUI for audio processing
Qwen3-ASR is an open-source series of ASR models
A lightweight audio-to-MIDI converter with pitch bend detection
Generate blog articles from video or audio
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper
A Family of Open Sourced Music Foundation Models
Voice Recognition to Text Tool
Convert files and web content into clean, usable Markdown easily
AI-powered tool for generating, optimizing, and translating subtitles
Synchronized Translation for Videos
AI tool converting video/audio into structured documents instantly
PageLM is a community driven version of NotebookLM
A nearly-live implementation of OpenAI's Whisper
Cut videos with a text editor