GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Reference PyTorch implementation and models for DINOv3
Code for running inference with the SAM 3D Body Model 3DB
Text and image to video generation: CogVideoX and CogVideo
An experimental version of DeepSeek model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Open-source multi-speaker long-form text-to-speech model
A Family of Open Sourced Music Foundation Models
Official repository for LTX-Video
Models for object and human mesh reconstruction
Stable Diffusion with Core ML on Apple Silicon
Accurate × Fast × Comprehensive
A Systematic Framework for Interactive World Modeling
Visual Causal Flow
Towards Real-World Vision-Language Understanding
Recovering the Visual Space from Any Views
The official repo of Qwen chat & pretrained large language model
Lets make video diffusion practical
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
AlphaFold 3 inference pipeline
Revolutionizing Database Interactions with Private LLM Technology
Qwen3-TTS is an open-source series of TTS models
Generating Immersive, Explorable, and Interactive 3D Worlds
Uncommon Objects in 3D dataset