GLM-4 series: Open Multilingual Multimodal Chat LMs
A course of learning LLM inference serving on Apple Silicon
Capable of understanding text, audio, vision, video
DeepSeek LLM: Let there be answers
QwQ-32B is a reasoning-focused language model for complex tasks
Powerful 14B LLM with strong instruction and long-text handling
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
Lightweight 24B agentic coding model with vision and long context