InvokeAI is a leading creative engine for Stable Diffusion models
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Instant voice cloning by MIT and MyShell. Audio foundation model
Open-source multi-speaker long-form text-to-speech model
Python bindings for llama.cpp
The ultimate RAG for your monorepo
AIMET is a library that provides advanced quantization and compression
Python-based neural networks API
A high-quality tool for convert PDF to Markdown and JSON
Open Source Document Management System for Digital Archives
The largest open-source medical AI skills library for OpenClaw
Qwen3-TTS is an open-source series of TTS models
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A Web UI for easy subtitle using whisper model
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
AI-powered tool for generating, optimizing, and translating subtitles
A text-to-speech, speech-to-text and speech-to-speech library
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
High-performance inference framework for large language models
Text and image to video generation: CogVideoX and CogVideo
A natural language interface for computers
The official Python client for the Huggingface Hub
A collaboration friendly studio for NeRFs
lightweight package to simplify LLM API calls
The Multi-Agent Framework