Depth Estimation

803 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

125,545

open-mmlab/mmdetection3d

2 papers

4,842

google-research/big_vision

2 papers

1,570

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Latest papers

Most implemented Social Latest No code

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

dtc111111/neslam • 29 Mar 2024

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

29 Mar 2024

Paper
Code

UniDepth: Universal Monocular Metric Depth Estimation

lpiccinelli-eth/unidepth • • 27 Mar 2024

However, the remarkable accuracy of recent MMDE methods is confined to their training domains.

345

27 Mar 2024

Paper
Code

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

aradhye2002/ecodepth • • 27 Mar 2024

We argue that the embedding vector from a ViT model, pre-trained on a large dataset, captures greater relevant information for SIDE than the usual route of generating pseudo image captions, followed by CLIP based text embeddings.

101

27 Mar 2024

Paper
Code

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

haomo-ai/modalink • • 27 Mar 2024

Experimental results on the KITTI dataset show that our proposed methods achieve state-of-the-art performance while running in real time.

27 Mar 2024

Paper
Code

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing

maturk/dn-splatter • • 26 Mar 2024

3D Gaussian splatting, a novel differentiable rendering technique, has achieved state-of-the-art novel view synthesis results with high rendering speeds and relatively low training times.

210

26 Mar 2024

Paper
Code

Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving

gandolfczjh/3d2fool • • 26 Mar 2024

Deep learning-based monocular depth estimation (MDE), extensively applied in autonomous driving, is known to be vulnerable to adversarial attacks.

26 Mar 2024

Paper
Code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

yvanyin/metric3d • • Under review for Transaction 2024

For metric depth estimation, we show that the key to a zero-shot single-view model lies in resolving the metric ambiguity from various camera models and large-scale data training.

731

22 Mar 2024

Paper
Code

When Do We Not Need Larger Vision Models?

bfshi/scaling_on_scales • • 19 Mar 2024

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

195

19 Mar 2024

Paper
Code

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

mhamilton723/FeatUp • • 15 Mar 2024

Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime.

1,054

15 Mar 2024

Paper
Code

Robust Shape Fitting for 3D Scene Abstraction

fkluger/cuboids_revisited • • 15 Mar 2024

A RANSAC estimator guided by a neural network fits these primitives to a depth map.

15 Mar 2024

Paper
Code

Depth Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result