Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Constrained Value Alignment via Safe Reinforcement Learning
Qwen3 is the large language model series developed by Qwen team
A dataset consists of 15,140 ChatGPT prompts from Reddit
Recipes to train reward model for RLHF
Course to get into Large Language Models (LLMs)
Semi-Structured Agentic Framework. Workflows build themselves
A Survey of Large Language Models
Open-source model for program synthesis
A series of math-specific large language models of our Qwen2 series
dLLM: Simple Diffusion Language Modeling
Official release of InternLM series
Quick guide (especially) for trending instruction finetuning dataset