Offline speech recognition API for Android, iOS, Raspberry Pi
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A PyTorch-based Speech Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps
OpenVINO™ Toolkit repository
Training data (data labeling, annotation, workflow) for all data types
Interactive Machine Learning experiments
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Toolkit for conversational AI
A ranked list of awesome machine learning Python libraries
Python Audio Analysis Library: Feature Extraction, Classification
Port of OpenAI's Whisper model in C/C++
A Lightweight Face Recognition and Facial Attribute Analysis
Statistical machine intelligence and learning engine
Data manipulation and transformation for audio signal processing
Open Source Computer Vision Library
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Models for the spaCy Natural Language Processing (NLP) library
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Translate the video from one language to another and embed dubbing
Han Language Processing
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
C++ library for high performance inference on NVIDIA GPUs