Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Data science interview questions and answers
Build multimodal AI applications with cloud-native stack
An efficient forwarding service designed for LLMs
Test-Time Reinforcement Learning
A simple yet powerful agent framework that delivers with models
Bridging LLM and Recommender System
Semi-Structured Agentic Framework. Workflows build themselves
Parallax is a distributed model serving framework
Minimal reproduction of OneRec
Redundancy-aware KV Cache Compression for Reasoning Models
Automated framework for asset discovery and vulnerability scanning
The official implementation of RAPTOR
AI-driven multi-agent research assistant automating hypothesis
A high-quality PDF to Markdown tool based on large language model
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
Specify a github or local repo, github pull request
CNCF Sandbox Project
From nobody to big model (LLM) hero
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
the terminal client for Ollama
NeurIPS2025 Spotlight] Quantized Attention
An open-source, modern-design AI training tracking and visualization
A simple, easy-to-hack GraphRAG implementation