Generating Immersive, Explorable, and Interactive 3D Worlds
Repo for SeedVR2 & SeedVR
Fast, Sharp & Reliable Agentic Intelligence
Video Object and Interaction Deletion
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Contexts Optical Compression
Controllable & emotion-expressive zero-shot TTS
An implementation of model parallel GPT-2 and GPT-3-style models
Foundation Models for Time Series
HY-Motion model for 3D character animation generation
Qwen3-ASR is an open-source series of ASR models
LLM-based Reinforcement Learning audio edit model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open source large language model by Alibaba
My personal Claude Code configuration
Block Diffusion for Ultra-Fast Speculative Decoding
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Sharp Monocular Metric Depth in Less Than a Second
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Global weather forecasting model using graph neural networks and JAX
Large-scale autoregressive pixel model for image generation by OpenAI
Scaling Reinforcement Learning with LLMs
Collection of Gemma 3 variants that are trained for performance
MiniMax-M2, a model built for Max coding & agentic workflows
Real-time behaviour synthesis with MuJoCo, using Predictive Control