Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Libraries

Use these libraries to find Depth Estimation models and implementations

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

yvanyin/metric3d Under review for Transaction 2024

Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.

604
19 Apr 2024

Virtually Enriched NYU Depth V2 Dataset for Monocular Depth Estimation: Do We Need Artificial Augmentation?

abrain-one/anyu 15 Apr 2024

We present ANYU, a new virtually augmented version of the NYU depth v2 dataset, designed for monocular depth estimation.

0
15 Apr 2024

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation

agneet42/robustness_depth_lang 12 Apr 2024

Recent advances in monocular depth estimation have been made by incorporating natural language as additional guidance.

5
12 Apr 2024

Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation

faceonlive/ai-research 11 Apr 2024

In this work, we propose to fuse the scene reconstruction from multiple images and distill this knowledge into a more accurate single-view scene reconstruction.

152
11 Apr 2024

RoadBEV: Road Surface Reconstruction in Bird's Eye View

ztsrxh/roadbev 9 Apr 2024

This paper uniformly proposes two simple yet effective models for road elevation reconstruction in BEV named RoadBEV-mono and RoadBEV-stereo, which estimate road elevation with monocular and stereo images, respectively.

62
09 Apr 2024

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

ruili3/Know-Your-Neighbors 4 Apr 2024

We propose KYN, a novel method for single-view scene reconstruction that reasons about semantic and spatial context to predict each point's density.

53
04 Apr 2024

WorDepth: Variational Language Prior for Monocular Depth Estimation

adonis-galaxy/wordepth 4 Apr 2024

To test this, we focus on monocular depth estimation, the problem of predicting a dense depth map from a single image, but with an additional text caption describing the scene.

13
04 Apr 2024

MonoCD: Monocular 3D Object Detection with Complementary Depths

elvintanhust/monocd 4 Apr 2024

Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost.

11
04 Apr 2024

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

skmhrk1209/VSRD 2 Apr 2024

In the auto-labeling stage, we represent the surface of each instance as a signed distance field (SDF) and render its silhouette as an instance mask through our proposed instance-aware volumetric silhouette rendering.

18
02 Apr 2024

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

dtc111111/neslam 29 Mar 2024

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

1
29 Mar 2024