Bringing BERT into modernity via both architecture changes and scaling
StarVector is a foundation model for SVG generation
Knowledge Graph Generation from Any Text
Examples of using E2B
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
LongBench v2 and LongBench (ACL 25'&24')
Hypernetworks that adapt LLMs for specific benchmark tasks
Learning to Reason with Search for LLMs via Reinforcement Learning
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Constrained Value Alignment via Safe Reinforcement Learning
Unleashing 10,000+ Word Generation from Long Context LLMs
An agentless approach to automatically solve software development
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Implementation for MatMul-free LM
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
High-performance inference framework for large language models
Code and models for ICML 2024 paper, NExT-GPT
Structured data extraction and instruction calling with ML, LLM
Robust recipes to align language models with human and AI preferences
Accessible large language models via k-bit quantization for PyTorch