Search Results for "convert 2d image to 3d model"

Showing 21 open source projects for "convert 2d image to 3d model"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    SAM 3D Objects

    SAM 3D Objects

    Models for object and human mesh reconstruction

    SAM 3D Objects is a foundation model that reconstructs full 3D geometry, texture, and spatial layout of objects and scenes from a single image. Given one RGB image and object masks (for example, from the Segment Anything family), it can generate a textured 3D mesh for each object, including pose and approximate scene layout. The model is specifically designed to be robust in real-world images with clutter, occlusions, small objects, and unusual viewpoints, where many earlier 3D-from-image...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    TRELLIS 2

    TRELLIS 2

    Native and Compact Structured Latents for 3D Generation

    TRELLIS.2 is a cutting-edge open-source model and codebase for high-fidelity 3D asset generation from 2D images, developed to push forward the state of the art in image-to-3D generation. At its core is a novel sparse voxel structure called O-Voxel that jointly encodes both geometry and surface appearance, enabling reconstruction and generation of complex 3D shapes with arbitrary topology, open surfaces, and physically based rendering (PBR) textures. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 3
    Mesh R-CNN

    Mesh R-CNN

    code for Mesh R-CNN, ICCV 2019

    Mesh R-CNN is a 3D reconstruction and object understanding framework developed by Facebook Research that extends Mask R-CNN into the 3D domain. Built on top of Detectron2 and PyTorch3D, Mesh R-CNN enables end-to-end 3D mesh prediction directly from single RGB images. The model learns to detect, segment, and reconstruct detailed 3D mesh representations of objects in natural images, bridging the gap between 2D perception and 3D understanding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Stable Virtual Camera

    Stable Virtual Camera

    Stable Virtual Camera: Generative View Synthesis with Diffusion Models

    Stable Virtual Camera is a multi-view diffusion model developed by Stability AI that transforms 2D images into immersive 3D videos with realistic depth and perspective. Unlike traditional methods that require complex reconstruction or scene-specific optimization, this model allows users to generate novel views from any number of input images and define custom camera trajectories, enabling dynamic exploration of scenes. It supports various aspect ratios and can produce 3D-consistent videos up...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    glslViewer

    glslViewer

    Console-based GLSL Sandbox for 2D/3D shaders shaders

    GlslViewer is a flexible console-based OpenGL Sandbox to display 2D/3D GLSL shaders without the need of a UI. You can definitely make your own UI or wrapper using the Python Module (include) or any other tool that communicates back/forth with the GPS viewer through the standard POSIX console In/Out or OSC. Default vert/frag shaders for 2D shader and 3D material shaders with PBR lighting model. Hot reload of files on changes.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 6
    Depth Anything 3

    Depth Anything 3

    Recovering the Visual Space from Any Views

    Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    Make-A-Video - Pytorch (wip)

    Make-A-Video - Pytorch (wip)

    Implementation of Make-A-Video, new SOTA text to video generator

    Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch. They combine pseudo-3d convolutions (axial convolutions) and temporal attention and show much better temporal fusion. The pseudo-3d convolutions isn't a new concept. It has been explored before in other contexts, say for protein contact prediction as "dimensional hybrid residual networks". The gist of the paper comes down to, take a SOTA text-to-image model (here they use DALL-E2, but the same learning points would easily apply to Imagen), make a few minor modifications for attention across time and other ways to skimp on the compute cost, do frame interpolation correctly, get a great video model out. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Stable-Dreamfusion

    Stable-Dreamfusion

    Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion

    A pytorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. This project is a work-in-progress and contains lots of differences from the paper. The current generation quality cannot match the results from the original paper, and many prompts still fail badly! Since the Imagen model is not publicly available, we use Stable Diffusion to replace it (implementation from diffusers).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PIFuHD

    PIFuHD

    High-Resolution 3D Human Digitization from A Single Image

    PIFuHD (Pixel-Aligned Implicit Function for 3D human reconstruction at high resolution) is a method and codebase to reconstruct high-fidelity 3D human meshes from a single image. It extends prior PIFu work by increasing resolution and detail, enabling fine geometry in cloth folds, hair, and subtle surface features. The method operates by learning an implicit occupancy / surface function conditioned on the image and camera projection; at inference time it queries dense points to reconstruct a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Point-E

    Point-E

    Point cloud diffusion for 3D model synthesis

    point-e is the official repository for Point-E, a generative model developed by OpenAI that produces 3D point clouds from textual (or image) prompts. Its principal advantage is speed: it can generate 3D assets in just 1–2 minutes on a single GPU, which is significantly faster than many competing text-to-3D models. The model works via a two-stage diffusion approach: first, it uses a text → image diffusion network to produce a synthetic 2D view consistent with the prompt; then a second diffusion model converts that image into a 3D point cloud. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    FrankMocap

    FrankMocap

    A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

    FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    VRN

    VRN

    Code for "Large Pose 3D Face Reconstruction

    The VRN (Volumetric Regression Network) repository implements the “Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression” method. Instead of explicitly fitting a 3D model via landmark estimation and deformation, VRN treats the reconstruction task as volumetric segmentation: it learns a CNN to regress a 3D volume aligned to the input image, and then extracts a mesh via isosurface from that volume. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DensePose

    DensePose

    A real-time approach for mapping all human pixels of 2D RGB images

    ...DensePose is widely used in augmented reality, motion capture, virtual try-on, and visual effects applications because it enables real-time 3D human mapping from 2D inputs. The model architecture builds on Mask R-CNN, using additional regression heads to predict UV coordinates that map image pixels to 3D surfaces.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    PotterDraw

    PotterDraw

    Design your own pottery for 3D printing in full color

    PotterDraw is a free 3D software application for designing and visualizing pottery. It’s useful for creating cylindrical shapes such as pots, flasks, vases, bowls, plates, cups, glasses, goblets, or anything similar. The outer wall of the pot is drawn using a spline, which consists of one or more curved or linear segments. The inner wall is generated automatically. PotterDraw can export files suitable for 3D printing. It can also map synthesized or user-specified textures onto the pot....
    Leader badge
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15

    Goki Check

    Goki check aims to detect context from an image in deterministic way.

    From 3D model, goki check matches 2D image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    STL 3dit

    STL 3dit

    An STL model viewer, editor, and creator for manipulating STL files.

    This software allows a person to open, edit, create, modify, view, and save 3D STL models. It can also be used to extrude 2D images into 3D objects. You can also insert 3D closed primitives. The new STLX file format supports groups, materials, as wells as colors and generates much smaller files on average (compared to binary STL files). NOTE: This latest build targets only 64-bit Windows Operating Systems due to the necessity of being able to manage (and navigate) very large 3D...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17

    J3DMaker

    a java based software to convert 2D to 3D images.

    a java based software to convert 2D to 3D images. These 3D images can be viewed by Red-Cyan glasses. SUPPORT ME BY GETTING A COPY OF J3DMAKER FOR ANDROID FROM HERE: https://play.google.com/store/apps/details?id=com.deepakpk.j3dmaker
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    vol2im

    Convert volumetric data to images.

    This program embeds volumetric data in various image files, using their RGBA or RGB channels. Each slice of the original model is represented as one channel of the output file. The output images can be used as 3D textures to perform volumetric visualization in GLSL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    LARMcess
    LARMcess is that "Linux Augmented Reality 3d Marker" . We will developing 3D Marker based on 2D Barcode(QR Code) on android OS. we can get the content's information from 2D barcode using android phone and display the content's 3D object and annou
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The openLBMflow is an fast fluid flow solver based on Lattice Boltzmann Method. Main future are 2D and 3D code, single and multiphase models, Output data in VTK format can be directly open in Paraview. Download Windows or Linux binary version now.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    2D 3D image registration including volume-image registration and model-image registration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB