Structured outputs for llms
Python bindings for llama.cpp
Build AI WhatsApp Bots with Pure Python
A Simple and Universal Swarm Intelligence Engine
A high-throughput and memory-efficient inference and serving engine
The Multi-Agent Framework
Run Local LLMs on Any Device. Open-source
Toolkit for conversational AI
lightweight package to simplify LLM API calls
Ongoing research training transformer models at scale
A modular graph-based Retrieval-Augmented Generation (RAG) system
Revolutionizing Database Interactions with Private LLM Technology
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Interact with your documents using the power of GPT
Simple, Pythonic building blocks to evaluate LLM applications
Powerful AI language model (MoE) optimized for efficiency/performance
Phi-3.5 for Mac: Locally-run Vision and Language Models
An LLM-powered knowledge curation system that researches topics
Operating LLMs in production
Multilingual sentence & image embeddings with BERT
Advanced language and coding AI model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An elegent pytorch implement of transformers
State-of-the-art Parameter-Efficient Fine-Tuning
Access large language models from the command-line