Official inference framework for 1-bit LLMs
BitNet: Scaling 1-bit Transformers for Large Language Models
High-performance Inference and Deployment Toolkit for LLMs and VLMs
100–200× Acceleration for Video Diffusion Models
NeurIPS2025 Spotlight] Quantized Attention
Generative Adversarial Networks for Efficient and High Fidelity Speech
32 bit VIRGO Linux Kernel