Monocular Depth Estimation Model to TensorRT

Project Overview

This project aims to optimize the inference performance of various monocular depth estimation models using NVIDIA's TensorRT. It provides a pipeline to convert pre-trained PyTorch models into ONNX format and then into TensorRT engines, allowing for a comparative analysis of inference speeds.

Key Features:
- Introduction to various monocular depth estimation models and a TensorRT conversion pipeline.
- Performance comparison (FPS, inference time) between the original PyTorch models and the TensorRT-optimized models.
- Generation of 3D depth information and point clouds from 2D images.

1. Development Environment

Hardware: NVIDIA RTX3060 (notebook)
OS: Windows Subsystem for Linux (WSL)
Linux Distribution: Ubuntu 22.04.5 LTS
CUDA Version: 12.9

# Create and activate a Conda virtual environment
conda create -n trte python=3.12 --yes
conda activate trte

# Install the required libraries
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
pip install cuda-python
pip install tensorrt
pip install onnx
pip install opencv-python
pip install matplotlib

2. Supported Models

Each model directory contains a README.md file with detailed instructions.

Model Name	Link to TensorRT Conversion	Main Outputs
Depth Anything V2	TensorRT Conversion	Depth
Distill Any Depth	TensorRT Conversion	Depth
Depth Anything AC	TensorRT Conversion	Depth
Depth Pro	TensorRT Conversion	Depth, Focal Length
Uni Depth V2	TensorRT Conversion	Depth, Point Cloud, Intrinsics
Metric3D V2	TensorRT Conversion	Depth, Surface Normal
UniK3D	TensorRT Conversion	Depth, Point Cloud
MoGe-2	TensorRT Conversion	Depth, Point Cloud, Surface Normal, FOV
VGGT	TensorRT Conversion	Depth, Point Cloud, Camera Pose
StreamVGGT	TensorRT Conversion	Depth, Point Cloud, Camera Pose
RAFT	TensorRT Conversion	Optical Flow
MeFlow	TensorRT Conversion	Optical Flow
MEMFOF	TensorRT Conversion	Optical Flow
NeuFlow	TensorRT Conversion	Optical Flow

3. To-Do List for Project Improvement

Unified Inference Script: Create a single inference script that accepts the model name as an argument to improve user experience.
Summarize Performance Analysis: Add a table to the main README.md that summarizes the performance of all models (including input resolution, precision, and hardware details) for easy comparison.
Docker Support: Add a Dockerfile to facilitate the environment setup and ensure reproducibility.

3. Upcoming Models

Video Depth Anything: GitHub
Flash Depth: GitHub
Depth Crafter: GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
Depth_Anything_AC		Depth_Anything_AC
Depth_Anything_V2		Depth_Anything_V2
Depth_Pro		Depth_Pro
Distill_Any_Depth		Distill_Any_Depth
MEMFOF		MEMFOF
MeFlow		MeFlow
Metric3D_V2		Metric3D_V2
MoGe_2		MoGe_2
NeuFlow		NeuFlow
RAFT		RAFT
StreamVGGT		StreamVGGT
UniK3D		UniK3D
Uni_Depth_V2		Uni_Depth_V2
VGGT		VGGT
data		data
later		later
video		video
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
common.py		common.py
common_runtime.py		common_runtime.py
gen_video2imgs.py		gen_video2imgs.py
vis_ply.py		vis_ply.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Monocular Depth Estimation Model to TensorRT

Project Overview

1. Development Environment

2. Supported Models

3. To-Do List for Project Improvement

3. Upcoming Models

About

Uh oh!

Releases

Packages

Languages

License

yester31/Monocular_Depth_Estimation_TRT

Folders and files

Latest commit

History

Repository files navigation

Monocular Depth Estimation Model to TensorRT

Project Overview

1. Development Environment

2. Supported Models

3. To-Do List for Project Improvement

3. Upcoming Models

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages