Block Diffusion for Ultra-Fast Speculative Decoding
Official implementation of Watermark Anything with Localized Messages
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
ChatGPT interface with better UI
DeepSeek Coder: Let the Code Write Itself
Open-source framework for intelligent speech interaction
FAIR Sequence Modeling Toolkit 2
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Inference code for scalable emulation of protein equilibrium ensembles
Generate Any 3D Scene in Seconds
Advancing Open-source World Models
Controllable & emotion-expressive zero-shot TTS
The Clay Foundation Model - An open source AI model and interface
Qwen3-ASR is an open-source series of ASR models
VMZ: Model Zoo for Video Modeling
Video understanding codebase from FAIR for reproducing video models
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image
A Unified Framework for Text-to-3D and Image-to-3D Generation
GLM-4 series: Open Multilingual Multimodal Chat LMs
Easy Docker setup for Stable Diffusion with user-friendly UI
Inference script for Oasis 500M