Turn WiFi signals into real-time human pose estimation and detection
Code for running inference with the SAM 3D Body Model 3DB
Turn WiFi signals into real-time human sensing and spatial awareness.
Implementation of DeepLabCut
End-to-end pipeline converting generative videos
[CVPR 2025 Best Paper Award] VGGT
A lightweight 3D Morphable Face Model library in modern C++
Cross-platform, customizable ML solutions for live and streaming media
Models for object and human mesh reconstruction
Pluggable SOTA multi-object tracking modules for segmentation
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Cross-platform, customizable ML solutions
ElectronBot is a mini desktop robot
Diffusion Transformer with Fine-Grained Chinese Understanding
Build Vision Agents quickly with any model or video provider
Automatically find issues in image datasets
A gallery that showcases on-device ML/GenAI use cases
ROS 2 package of 3D lidar slam using ndt/gicp registration
Effortless data labeling with AI support from Segment Anything
Free Motion Capture for Everyone
AI-data warehouse to enrich, transform and analyze unstructured data
C++ and Python Examples
Public opinion analysis system
RGBD video generation model conditioned on camera input
Advancing Open-source World Models