CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Tooling for the Common Objects In 3D dataset
Fine-tuning ChatGLM-6B with PEFT
The ChatGPT Retrieval Plugin lets you easily find personal documents
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Claude Code image, a one-stop open source transit service
Research code artifacts for Code World Model (CWM)
GPT4V-level open-source multi-modal model based on Llama3-8B
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Official repo for consistency models
Code release for ConvNeXt V2 model
Learning Continuous Signed Distance Functions for Shape Representation
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Advancing Formal Mathematical Reasoning via Reinforcement Learning
685B model with improved agents and consistency
High-efficiency reasoning and agentic intelligence model
High-compute ultra-reasoning model surpassing model surpassing GPT-5
Official DeiT repository
Agentic 123B coding model optimized for large-scale engineering
Lightweight 24B agentic coding model with vision and long context
Official PyTorch Implementation of "Scalable Diffusion Models"
Dia-1.6B generates lifelike English dialogue and vocal expressions
Official implementation of DreamCraft3D
Text-to-image model optimized for artistic quality and safe generation