A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Convert AI papers to GUI
Image polygonal annotation with Python
The most powerful and modular diffusion model GUI, api and backend
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
GUI/CLI tool for downloading Xiaohongshu
Uncommon Objects in 3D dataset
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Python hands on tutorial with 50+ Python Application
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Free Viewer Bot Supporting Twitch | YouTube | Kick And 5+ Other Plats
Free Streaming Bot: Compatible with Twitch, YouTube and Facebook
AI Suite for upscaling, interpolating & restoring images/videos
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Leading free and open-source liveliness check &face recognition system
Video automatic transcribe and translated subtitle generator
A repository of trained models
Windows-GUI
Based on the Disco Diffusion, version of the AI art creation software
Easy-OCR solution and Tesseract trainer for GNU/Linux
Computer vision and image processing library for Qt.