Visual intelligence for your home.
Open source framework for deep learning satellite and aerial imagery
Enable AI to control your desktop, mobile and HMI devices
Implementation of Vision Transformer, a simple way to achieve SOTA
A computer vision closed-loop learning platform
Build Vision Agents quickly with any model or video provider
Open Source Computer Vision Library
Interactive video and image annotation tool for computer vision
Phi-3.5 for Mac: Locally-run Vision and Language Models
Open Source Differentiable Computer Vision Library
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
C++ and Python Examples
The repository provides code for running inference with SAM 2
3D reconstruction software
Collection of CVPR 2026 Papers and Open Source Projects
Witness the aha moment of VLM with less than $3
Structure-from-Motion and Multi-View Stereo
OpenVINO™ Toolkit repository
Effortless data labeling with AI support from Segment Anything
Medical imaging toolkit for deep learning
A lightweight vision library for performing large object detection
Gracefully face hCaptcha challenge with multimodal llms
Towards Real-World Vision-Language Understanding
Set of comprehensive computer vision & machine intelligence libraries
Google Testing and Mocking Framework