The Clay Foundation Model - An open source AI model and interface
Z80-μLM is a 2-bit quantized language model
Research code artifacts for Code World Model (CWM)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
RGBD video generation model conditioned on camera input
Audio foundation model excelling in audio understanding
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Easy Docker setup for Stable Diffusion with user-friendly UI
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source large language model family from Tencent Hunyuan
Recovering the Visual Space from Any Views
Industrial-level controllable zero-shot text-to-speech system
Project Lyra: Open Generative 3D World Models
Video Object and Interaction Deletion
HY-Motion model for 3D character animation generation
A Systematic Framework for Interactive World Modeling
Pokee Deep Research Model Open Source Repo
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
Open Source Speech Language Model
Open-source industrial-grade ASR models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multimodal-Driven Architecture for Customized Video Generation
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models