HY-Motion model for 3D character animation generation
A Production-ready Reinforcement Learning AI Agent Library
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Phi-3.5 for Mac: Locally-run Vision and Language Models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
A series of math-specific large language models of our Qwen2 series
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Tiny vision language model
LLM-based Reinforcement Learning audio edit model
The official PyTorch implementation of Google's Gemma models
Qwen3-omni is a natively end-to-end, omni-modal LLM
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A state-of-the-art open visual language model
Open Source Speech Language Model
Open-source industrial-grade ASR models
Fast-stable-diffusion + DreamBooth
A Pragmatic VLA Foundation Model
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
Collection of Gemma 3 variants that are trained for performance
Implementation of "MobileCLIP" CVPR 2024