AI based photo editing website for changing image background
Translate the video from one language to another and embed dubbing
Offline inference engine for art, real-time voice conversations
Industry leading face manipulation platform
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Towards Human-Sounding Speech
All-in-one WebUI for AI generative image and video creation
Free, high-quality text-to-speech API endpoint to replace OpenAI
LLM Large Model of Selling Anchor
Reverse engineering Gemini's SynthID detection
Virtual AI anchor that combines state-of-the-art technology
Python Audio Analysis Library: Feature Extraction, Classification
SUPIR upscaling wrapper for ComfyUI
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Build Vision Agents quickly with any model or video provider
Software that uses AI to perform real-time voice conversion
Advanced AI Explainability for computer vision
Refine and quantize messy AI pixel art into clean, perfect pixels
AI-powered tool to quickly remove watermarks from images flawlessly
Multi-Voice and Prompt-Controlled TTS Engine
A walk along memory lane
Real-ESRGAN aims at developing Practical Algorithms
We provide a PyTorch implementation of the paper Voice Separation
Separate audio recordings into individual sources