A fast, powerful, and simple hierarchical vision transformer
Phi-3.5 for Mac: Locally-run Vision and Language Models
Nodejs bindings to OpenCV 3 and OpenCV 4
Hub of ready-to-use datasets for ML models
Visual Instruction Tuning: Large Language-and-Vision Assistant
High-Resolution 3D Human Digitization from A Single Image
Deep Learning (Flower Book) mathematical derivation
ArrayFire, a general purpose GPU library
BoofCV is an open source Java library for real-time computer vision.
Matlab implementation of the ECO tracker
NetVLAD: CNN architecture for weakly supervised place recognition
A neural network that transforms a design mock-up into static websites
Provides code for running inference with the SegmentAnything Model
Datasets, transforms and models specific to Computer Vision
**MOVED TO GITHUB** ==> https://github.com/MRPT/mrpt
Machine learning algorithms for advanced analytics
Fast image augmentation library and an easy-to-use wrapper
Best Practices, code samples, and documentation for Computer Vision
Code release for ConvNeXt model
Various hashing methods for image retrieval and serves as the baseline
Guide to deploying deep-learning inference networks
Set of comprehensive computer vision & machine intelligence libraries
Code Repository for Machine Learning with PyTorch and Scikit-Learn
[CVPR 2025 Best Paper Award] VGGT
OpenCV Bindings for node.js