GPT4V-level open-source multi-modal model based on Llama3-8B
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Evals is a framework for evaluating LLMs and LLM systems
JAX-based neural network library
LLM training code for MosaicML foundation models
OCR expert VLM powered by Hunyuan's native multimodal architecture
Qwen-Image is a powerful image generation foundation model
Swirl queries any number of data sources with APIs
Open-source multi-speaker long-form text-to-speech model
Practical productivity tools for Claude Code, Codex-CLI
Gracefully face hCaptcha challenge with multimodal llms
Open-source framework for intelligent speech interaction
Renderer for the harmony response format to be used with gpt-oss
High-Quality Voice Cloning TTS for 600+ Languages
Enterprise AI agent platform for workflows, models, and RAG apps
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
Advanced techniques for RAG systems
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
The Memory layer for AI Agents
Models for the spaCy Natural Language Processing (NLP) library
SQL-native memory layer enabling persistent context for AI agents
The largest open-source medical AI skills library for OpenClaw
Easiest and laziest way for building multi-agent LLMs applications
Quark Agent - Your AI-powered Android APK Analyst