Source code of PyGAD, Python 3 library for building genetic algorithms
Python inference and LoRA trainer package for the LTX-2 audio–video
Video-based AI memory library. Store millions of text chunks in MP4
Stable Diffusion web UI
Image polygonal annotation with Python
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A high-throughput and memory-efficient inference and serving engine
Official inference repo for FLUX.1 models
Personal AI, On Personal Devices
OCRmyPDF adds an OCR text layer to scanned PDF files
Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model
Seamlessly integrate LLMs as Python functions
Python Optimal Transport
Public repository for Agent Skills
The official Python client for the Huggingface Hub
Reverse-engineered Python API for Google Gemini web app
Awesome multilingual OCR toolkits based on PaddlePaddle
Code for running inference and finetuning with SAM 3 model
State-of-the-art TTS model under 25MB
3D reconstruction software
The highest-scoring AI memory system ever benchmarked
Ready-to-use OCR with 80+ supported languages
Industrial-strength Natural Language Processing (NLP)
A Model Context Protocol server for Excel file manipulation