Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

124,984

open-mmlab/mmdetection3d

2 papers

4,810

google-research/big_vision

2 papers

1,554

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Latest papers

Most implemented Social Latest No code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

yvanyin/metric3d • • Under review for Transaction 2024

Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.

604

19 Apr 2024

Paper
Code

Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation?

abrain-one/anyu • 15 Apr 2024

We present ANYU, a new virtually augmented version of the NYU depth v2 dataset, designed for monocular depth estimation.

15 Apr 2024

Paper
Code

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation

agneet42/robustness_depth_lang • • 12 Apr 2024

Recent advances in monocular depth estimation have been made by incorporating natural language as additional guidance.

12 Apr 2024

Paper
Code

Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation

faceonlive/ai-research • 11 Apr 2024

In this work, we propose to fuse the scene reconstruction from multiple images and distill this knowledge into a more accurate single-view scene reconstruction.

152

11 Apr 2024

Paper
Code

RoadBEV: Road Surface Reconstruction in Bird's Eye View

ztsrxh/roadbev • • 9 Apr 2024

This paper uniformly proposes two simple yet effective models for road elevation reconstruction in BEV named RoadBEV-mono and RoadBEV-stereo, which estimate road elevation with monocular and stereo images, respectively.

09 Apr 2024

Paper
Code

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

ruili3/Know-Your-Neighbors • • 4 Apr 2024

We propose KYN, a novel method for single-view scene reconstruction that reasons about semantic and spatial context to predict each point's density.

04 Apr 2024

Paper
Code

WorDepth: Variational Language Prior for Monocular Depth Estimation

adonis-galaxy/wordepth • • 4 Apr 2024

To test this, we focus on monocular depth estimation, the problem of predicting a dense depth map from a single image, but with an additional text caption describing the scene.

04 Apr 2024

Paper
Code

MonoCD: Monocular 3D Object Detection with Complementary Depths

elvintanhust/monocd • • 4 Apr 2024

Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost.

04 Apr 2024

Paper
Code

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

skmhrk1209/VSRD • • 2 Apr 2024

In the auto-labeling stage, we represent the surface of each instance as a signed distance field (SDF) and render its silhouette as an instance mask through our proposed instance-aware volumetric silhouette rendering.

02 Apr 2024

Paper
Code

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

dtc111111/neslam • 29 Mar 2024

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

29 Mar 2024

Paper
Code

Depth Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result