A lightweight vLLM implementation built from scratch
Port of OpenAI's Whisper model in C/C++
Fast Multimodal LLM on Mobile Devices
TVM Documentation in Chinese Simplified
TinyML AI inference library
LL model providing reasoning and conversational capabilities
Large language model developed and released by NVIDIA
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices