3d vision nvidia free download

Showing 131 open source projects for "3d vision nvidia"

View related business solutions

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

NVIDIA Isaac GR00T

NVIDIA Isaac GR00T N1.5 is the world's first open foundation model

NVIDIA Isaac‑GR00T N1.5 is an open-source foundation model engineered for generalized humanoid robot reasoning and manipulation skills. It accepts multimodal inputs—such as language and images—and uses a diffusion transformer architecture built upon vision-language encoders, enabling adaptive robot behaviors across diverse environments.

Downloads: 1 This Week

Last Update: 2025-11-05
See Project
2

NVIDIA Model Optimizer

A unified library of SOTA model optimization techniques

Model Optimizer is a unified library that provides state-of-the-art techniques for compressing and optimizing deep learning models to improve inference efficiency and deployment performance. It brings together multiple optimization strategies such as quantization, pruning, distillation, and speculative decoding into a single cohesive framework. The library is designed to reduce model size and computational requirements while maintaining accuracy, making it particularly valuable for deploying...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
3

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
4

MESHROOM

3D reconstruction software

Photogrammetry is the science of making measurements from photographs. It infers the geometry of a scene from a set of unordered photographies or videos. Photography is the projection of a 3D scene onto a 2D plane, losing depth information. The goal of photogrammetry is to reverse this process. The dense modeling of the scene is the result yielded by chaining two computer vision-based pipelines, “Structure-from-Motion” (SfM) and “Multi View Stereo” (MVS). Fusion of Multi-bracketing LDR images into HDR. Alignment of panorama images. ...

1 Review

Downloads: 146 This Week

Last Update: 2025-08-19
See Project
Application Monitoring That Won't Slow Your App Down
AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.

Start Free
5

NVIDIA Generative AI Examples

Generative AI reference workflows

NVIDIA GenerativeAIExamples is an open-source repository that provides practical reference implementations and example workflows for building generative AI applications using NVIDIA’s software ecosystem. The project is designed to help developers accelerate the development of AI applications by providing ready-to-run pipelines, notebooks, and tools that demonstrate how to integrate large language models into real-world systems.

Downloads: 1 This Week

Last Update: 2026-03-05
See Project
6

NVCleanstall

Custom NVIDIA driver installer: skip telemetry, GFE, bloat, etc.

NVCleanstall lets you customize the NVIDIA GeForce Driver package by removing components that you don't need (or want). This not only keeps things tidy, but also lowers disk usage and memory footprint. Unlike the NVIDIA custom installation, you can unselect packages like "Stereo 3D Glasses", "USB-C", Notebook Optimizations or Telemetry from the NVIDIA drivers.

1 Review

Downloads: 157 This Week

Last Update: 2025-11-23
See Project
7

COLMAP

Structure-from-Motion and Multi-View Stereo

COLMAP is a general-purpose Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline with a graphical and command-line interface. It offers a wide range of features for the reconstruction of ordered and unordered image collections. The software is licensed under the new BSD license.

Downloads: 63 This Week

Last Update: 2026-03-18
See Project
8

CUDA Containers for Edge AI & Robotics

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

...The project is particularly useful for developers building edge AI and robotics systems that rely on GPU-accelerated inference and real-time computer vision. By using containerized environments, developers can ensure that their applications run consistently across different Jetson platforms and JetPack versions. The repository also includes build tools and package management utilities that help automate the process of assembling machine learning environments.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
9

AliceVision

3D Computer Vision Framework

AliceVision is an open-source photogrammetric computer vision framework designed to reconstruct detailed 3D scenes and camera motion from collections of images or videos. It provides a complete pipeline for structure-from-motion (SfM), multi-view stereo (MVS), and mesh generation, allowing users to convert 2D imagery into accurate 3D models. The framework is built with a strong emphasis on research-grade algorithms while maintaining the robustness required for production environments, making it suitable for industries such as visual effects, cultural heritage preservation, and robotics. ...

Downloads: 4 This Week

Last Update: 2026-03-18
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Newton

An open-source, GPU-accelerated physics simulation engine

...Newton supports OpenUSD for modern 3D scene representation and interoperability, making it suitable for complex simulation ecosystems. It is developed as a Linux Foundation project with contributions from major organizations like NVIDIA, Google DeepMind, and Disney Research, highlighting its relevance in cutting-edge robotics and AI development.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
11

MakeHuman

This is the main repository for the MakeHuman application as such

This is the main source code for the MakeHuman application as such. See "Getting started" below for instructions on how to get MakeHuman up and running. Mac users should be able to use the same instructions as windows users, although this has not been thoroughly tested. At the point of writing this, the source code is almost ready for a stable release. The testing vision for this code is to build a community release that includes main application and often-used, user-contributed plug-ins. We...

Downloads: 41 This Week

Last Update: 2024-04-19
See Project
12

GoCV

Go package for computer vision using OpenCV 4 and beyond

GoCV gives programmers who use the Go programming language access to the OpenCV 4 computer vision library. The GoCV package supports the latest releases of Go and OpenCV v4.5.4 on Linux, macOS, and Windows. Our mission is to make the Go language a “first-class” client compatible with the latest developments in the OpenCV ecosystem. Computer Vision (CV) is the ability of computers to process visual information, and perform tasks normally associated with those performed by humans. CV software...

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
13

TorchIO

Medical imaging toolkit for deep learning

...TorchIO is a Python package containing a set of tools to efficiently read, preprocess, sample, augment, and write 3D medical images in deep learning applications written in PyTorch, including intensity and spatial transforms for data augmentation and preprocessing. Transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
14

CO3D (Common Objects in 3D)

Tooling for the Common Objects In 3D dataset

CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. ...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
15

Kornia

Open Source Differentiable Computer Vision Library

Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within...

Downloads: 0 This Week

Last Update: 2025-11-08
See Project
16

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library

CV-CUDA is an open-source project that enables building efficient cloud-scale Artificial Intelligence (AI) imaging and computer vision (CV) applications. It uses graphics processing unit (GPU) acceleration to help developers build highly efficient pre- and post-processing pipelines. CV-CUDA originated as a collaborative effort between NVIDIA and ByteDance.

Downloads: 1 This Week

Last Update: 2025-11-15
See Project
17

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

Mesh R-CNN is a 3D reconstruction and object understanding framework developed by Facebook Research that extends Mask R-CNN into the 3D domain. Built on top of Detectron2 and PyTorch3D, Mesh R-CNN enables end-to-end 3D mesh prediction directly from single RGB images. The model learns to detect, segment, and reconstruct detailed 3D mesh representations of objects in natural images, bridging the gap between 2D perception and 3D understanding. Unlike voxel-based or point-based approaches, Mesh...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
18

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks

OSWorld is an open-source synthetic world environment designed for embodied AI research and multi-agent learning. It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
19

Instant Neural Graphics Primitives

Instant neural graphics primitives: lightning fast NeRF and more

...The framework is capable of reconstructing detailed 3D scenes from images and generating realistic views of those scenes in real time. Compared with earlier neural radiance field approaches, instant-ngp significantly reduces training time and computational requirements, enabling models to be trained within seconds or minutes on modern GPUs.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
20

The FreeMoCap Project

Free Motion Capture for Everyone

FreeMoCap is an open-source markerless motion capture system that enables users to record human movement using ordinary cameras and convert the footage into usable 3D motion data. The project’s goal is to democratize motion capture by removing the need for expensive suits or proprietary studio hardware, instead relying on computer vision and pose estimation pipelines. It processes synchronized video feeds to reconstruct skeletal motion, which can then be exported for animation, biomechanics research, or creative projects. ...

Downloads: 2 This Week

Last Update: 2026-02-19
See Project
21

Engine_3D

A real-time 3D engine for DOS-talgics (see GLOBE_3D for new stuff !)

Engine_3D is a real-time 3D engine project grabbed from my old data. The entire 3D rendering is pure software: no OpenGL or the like, no 3D graphics card acceleration! And: Engine_3D is completely programmed in Ada. Some parts of the 3D engine, like the animated bump mapping, are still waiting to be ported to my newer 3D project, GLOBE_3D !... DOS and Linux versions available. Remarks for the DOS version: It is also a funny example of a game system almost fully in Ada,...

Downloads: 1 This Week

Last Update: 2026-01-23
See Project
22

UCO3D

Uncommon Objects in 3D dataset

uCO3D is a large-scale 3D vision dataset and toolkit centered on turn-table videos of everyday objects drawn from the LVIS taxonomy. It provides about 170,000 full videos per object instance rather than still frames, along with per-video annotations including object masks, calibrated camera poses, and multiple flavors of point clouds. Each sequence also ships with a precomputed 3D Gaussian Splat reconstruction, enabling fast, differentiable rendering workflows and modern implicit/point-based modeling experiments. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
23

CogAgent

An open sourced end-to-end VLM-based GUI Agent

...The model is designed for agent-style execution rather than freeform chat, maintaining a continuous execution history across steps while requiring a fresh session for each new task. Inference supports BF16 on NVIDIA GPUs, with optional INT8 and INT4 modes available but with noted performance loss at INT4; example CLIs and a web demo illustrate bounding-box outputs and operation categories.

Downloads: 2 This Week

Last Update: 5 days ago
See Project
24

CoordinateTransformations.jl

A fresh approach to coordinate transformations

CoordinateTransformations is a Julia package to manage simple or complex networks of coordinate system transformations. Transformations can be easily applied, inverted, composed, and differentiated (both with respect to the input coordinates and with respect to transformation parameters such as rotation angle). Transformations are designed to be light-weight and efficient enough for, e.g., real-time graphical applications, while support for both explicit and automatic differentiation makes...

Downloads: 0 This Week

Last Update: 2025-02-04
See Project
25

HomeRobot

Mobile manipulation research tools for roboticists

...It provides interfaces for Detic, Grounded-SAM, and Contact-GraspNet, allowing open-vocabulary detection and 3D grasping.

Downloads: 1 This Week

Last Update: 2025-10-08
See Project