Unified Multimodal Understanding and Generation Models
Volcano Engine Reinforcement Learning for LLMs
An open sourced end-to-end VLM-based GUI Agent
Revolutionizes the way users interact with Autogen
Run LLMs locally on Cloud Workstations
Genome modeling and design across all domains of life
AI tool that generates tests to improve code coverage quickly
Local RAG engine for private multimodal knowledge search on devices
AI multi-agent framework for automating data-driven R&D workflows
Collection of Kaggle Solutions and Ideas
Implement a concise and clear Deep Search Agent from 0
General-purpose image editing model that delivers high-fidelity
No-code LLM Platform to launch APIs and ETL Pipelines
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Self-learning data agent that grounds its answers in layers of content
This repository contains the official implementation of FastVLM
ICLR2024 Spotlight: curation/training code, metadata, distribution
Diffusion Transformer with Fine-Grained Chinese Understanding
Build multi-modal Agents with memory, knowledge, tools and reasoning
Reading book source
MCP integration platforms for AI agents to use tools at any scale
Virtual AI anchor that combines state-of-the-art technology
AI tool for real-time monitoring and analysis of Goofish listings
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
One-click deployment (including offline integration package)