Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Constrained Value Alignment via Safe Reinforcement Learning
Unleashing 10,000+ Word Generation from Long Context LLMs
An agentless approach to automatically solve software development
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Implementation for MatMul-free LM
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
High-performance inference framework for large language models
Code and models for ICML 2024 paper, NExT-GPT
Structured data extraction and instruction calling with ML, LLM
Robust recipes to align language models with human and AI preferences
Accessible large language models via k-bit quantization for PyTorch
Open-source large language model family from Tencent Hunyuan
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Provides line-oriented text file editing capabilities
Airtable integration for AI-powered applications
Efficient Retrieval Augmentation and Generation Framework
A full spaCy pipeline and models for scientific/biomedical documents
Libraries for applying sparsification recipes to neural networks
Data processing for and with foundation models