The subtitle editor
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Collection of Gemma 3 variants that are trained for performance
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Cross-platform AI language practice app
Framework for building realtime multimodal voice AI agents apps
Implementation of a U-net complete with efficient attention
AI-Generated Tags and Summaries for Telegram Messages
Secure open source cloud runtime for AI apps & AI agents
Video Object and Interaction Deletion
The AI-Native Search Database
All-in-one LLM CLI tool featuring Shell Assistant
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Official SeedVR2 Video Upscaler for ComfyUI
Vim plugin for LLM-assisted code/text completion
Examples and guides for using the Gemini API
Self-hosted AI audio transcription
Code and models for ICML 2024 paper, NExT-GPT
Video, Image and GIF upscale/enlarge(Super-Resolution)
Modular AI image and video generation web UI with extensible tools
AI teacher that lives as a buddy next to your cursor
Open source text-to-speech tool, supports extra-long text
Toolkit for conversational AI
Olares: An Open-Source Sovereign Cloud OS for Local AI