Interactive video and image annotation tool for computer vision
Open-Source RPA Software (formerly Kantu)
Enable AI to control your desktop, mobile and HMI devices
Implementation of Vision Transformer, a simple way to achieve SOTA
Open source framework for deep learning satellite and aerial imagery
A framework to enable multimodal models to operate a computer
3D reconstruction software
YOLOv5 is the world's most loved vision AI
Structure-from-Motion and Multi-View Stereo
A GUI Agent app based on UI-TARS to control your computer using AI
Open Source Computer Vision Library
Google Testing and Mocking Framework
OpenVINO™ Toolkit repository
Java interface to OpenCV, FFmpeg, and more
Datasets, transforms and models specific to Computer Vision
Open Source Differentiable Computer Vision Library
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast image augmentation library and an easy-to-use wrapper
The open-source tool for building high-quality datasets
Go package for computer vision using OpenCV 4 and beyond
Deep learning library
Training data (data labeling, annotation, workflow) for all data types
AI based photo editing website for changing image background
Visual Instruction Tuning: Large Language-and-Vision Assistant
Automatically find issues in image datasets