TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	absolute relative error	0.041	# 2
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	RMSE	1.856	# 6
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	Sq Rel	0.117	# 24
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	RMSE log	0.066	# 5
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	Delta < 1.25	0.984	# 4
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	Delta < 1.25^2	0.998	# 1
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	Delta < 1.25^3	1.000	# 1
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	Square relative error (SqRel)	0.117	# 1
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	RMSE	0.233	# 9
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	absolute relative error	0.063	# 9
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	Delta < 1.25	0.981	# 4
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	Delta < 1.25^2	0.996	# 8
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	Delta < 1.25^3	0.999	# 4
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	log 10	0.027	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/futuredepth-learning-to-predict-the-future/monocular-depth-estimation-on-kitti-eigen)](https://paperswithcode.com/sota/monocular-depth-estimation-on-kitti-eigen?p=futuredepth-learning-to-predict-the-future)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/futuredepth-learning-to-predict-the-future/monocular-depth-estimation-on-nyu-depth-v2)](https://paperswithcode.com/sota/monocular-depth-estimation-on-nyu-depth-v2?p=futuredepth-learning-to-predict-the-future)`

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

19 Mar 2024 · Rajeev Yasarla, Manish Kumar Singh, Hong Cai, Yunxiao Shi, Jisoo Jeong, Yinhao Zhu, Shizhong Han, Risheek Garrepalli, Fatih Porikli ·

In this paper, we propose a novel video depth estimation approach, FutureDepth, which enables the model to implicitly leverage multi-frame and motion cues to improve depth estimation by making it learn to predict the future at training. More specifically, we propose a future prediction network, F-Net, which takes the features of multiple consecutive frames and is trained to predict multi-frame features one time step ahead iteratively. In this way, F-Net learns the underlying motion and correspondence information, and we incorporate its features into the depth decoding process. Additionally, to enrich the learning of multiframe correspondence cues, we further leverage a reconstruction network, R-Net, which is trained via adaptively masked auto-encoding of multiframe feature volumes. At inference time, both F-Net and R-Net are used to produce queries to work with the depth decoder, as well as a final refinement network. Through extensive experiments on several benchmarks, i.e., NYUDv2, KITTI, DDAD, and Sintel, which cover indoor, driving, and open-domain scenarios, we show that FutureDepth significantly improves upon baseline models, outperforms existing video depth estimation methods, and sets new state-of-the-art (SOTA) accuracy. Furthermore, FutureDepth is more efficient than existing SOTA video depth estimation models and has similar latencies when comparing to monocular models

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Decoder

Depth Estimation

Future prediction

Monocular Depth Estimation

Datasets

KITTI

NYUv2

DDAD

Results from the Paper

Add Remove

Ranked #2 on Monocular Depth Estimation on KITTI Eigen split

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular Depth Estimation	KITTI Eigen split	FutureDepth	absolute relative error	0.041	# 2	Compare
			RMSE	1.856	# 6	Compare
			Sq Rel	0.117	# 24	Compare
			RMSE log	0.066	# 5	Compare
			Delta < 1.25	0.984	# 4	Compare
			Delta < 1.25^2	0.998	# 1	Compare
			Delta < 1.25^3	1.000	# 1	Compare
			Square relative error (SqRel)	0.117	# 1	Compare
Monocular Depth Estimation	NYU-Depth V2	FutureDepth	RMSE	0.233	# 9	Compare
			absolute relative error	0.063	# 9	Compare
			Delta < 1.25	0.981	# 4	Compare
			Delta < 1.25^2	0.996	# 8	Compare
			Delta < 1.25^3	0.999	# 4	Compare
			log 10	0.027	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove