Train any agents simply by 'talking'
Curated list of datasets and tools for post-training
Train multi-step agents for real-world tasks using GRPO
Project aimed at extracting, exporting, and analyzing chat records
Claude Code is an agentic coding tool that lives in your terminal
Designed for training LLM/VLM agents via RL
Libre Survival Manual for Android with offline in mind
TextWorld is a sandbox learning environment for the training
Faster and easier training and deployments
Learning to Reason with Search for LLMs via Reinforcement Learning
A simple, performant and scalable Jax LLM
Training framework for Stable Baselines3 reinforcement learning agents
Deep learning optimization library: makes distributed training easy
Dataset Management Framework, a Python library and a CLI tool to build
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
LLM101n: Let's build a Storyteller
A fast TTS architecture with conditional flow matching
A flexible, high-performance 3D simulator for Embodied AI research
"Big Model" trains a visual multimodal VLM with 26M parameters
RF-DETR is a real-time object detection and segmentation
Volcano Engine Reinforcement Learning for LLMs
Retrieval and Retrieval-augmented LLMs
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Ongoing research training transformer models at scale