Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
State-of-the-art 2D and 3D Face Analysis Project
Accurate × Fast × Comprehensive
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
Crowdsourcing platform for full text transcription and tagging
A framework to enable multimodal models to operate a computer
OCRmyPDF adds an OCR text layer to scanned PDF files
Enhances Tesseract OCR output using LLMs (local or API)
Open source AI VTuber platform with voice chat and Live2D avatars
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Formula recognition based on LaTeX-OCR and ONNXRuntime
AI Agent Application Development Framework
Replace OpenAI GPT with another LLM in your app
Omnilingual ASR Open-Source Multilingual SpeechRecognition
A simple tool for reading in poorly redacted documents
Library for OCR-related tasks powered by Deep Learning
Repo of Qwen2-Audio chat & pretrained large audio language model
NLP Cloud serves high performance pre-trained or custom models for NER
An on-premises, OCR-free unstructured data extraction
A ranked list of awesome machine learning Python libraries
Qwen3-Coder is the code version of Qwen3