Automatic question answering for local knowledge bases based on LLM
AI Agent Evaluator & Red Team Platform
Collect, organize, use, and share, all in OmniBox
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Repo of Qwen2-Audio chat & pretrained large audio language model
SQL-Driven RAG Engine
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Towards Efficient Self-Evolving Agent System
Unified KV Cache Compression Methods for Auto-Regressive Models
A tension reasoning engine over 131 S-class problems
An LLM Compiler for Parallel Function Calling
AI Powered Knowledge Graph Generator
DepGraph: Towards Any Structural Pruning
Performance-optimized AI inference on your GPUs
Retrieval and Retrieval-augmented LLMs
slime is an LLM post-training framework for RL Scaling
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
95% token savings. 155x faster queries. 16 languages
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
OpenCompass is an LLM evaluation platform
Search all of YouTube from the command line
Deploy your agentic worfklows to production
Chat & pretrained large vision language model
PyTorch library of curated Transformer models and their components