TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	RMSE	6.223	# 5
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	RMSE log	0.170	# 5
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	Square relative error (SqRel)	1.193	# 6
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	Absolute relative error (AbsRel)	0.114	# 7
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	Test frames	2	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-temporal-opportunist-self-supervised/unsupervised-monocular-depth-estimation-on)](https://paperswithcode.com/sota/unsupervised-monocular-depth-estimation-on?p=the-temporal-opportunist-self-supervised)`

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

CVPR 2021 · Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel Brostow, Michael Firman ·

Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby frames as a supervision signal during training. However, for many applications, sequence information in the form of video frames is also available at test time. The vast majority of monocular networks do not make use of this extra signal, thus ignoring valuable information that could be used to improve the predicted depth. Those that do, either use computationally expensive test-time refinement techniques or off-the-shelf recurrent networks, which only indirectly make use of the geometric information that is inherently available. We propose ManyDepth, an adaptive approach to dense depth estimation that can make use of sequence information at test time, when it is available. Taking inspiration from multi-view stereo, we propose a deep end-to-end cost volume based approach that is trained using self-supervision only. We present a novel consistency loss that encourages the network to ignore the cost volume when it is deemed unreliable, e.g. in the case of moving objects, and an augmentation scheme to cope with static cameras. Our detailed experiments on both KITTI and Cityscapes show that we outperform all published self-supervised baselines, including those that use single or multiple frames at test time.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

nianticlabs/manydepth official

592

Tasks

Add Remove

Depth Estimation

Monocular Depth Estimation

Unsupervised Monocular Depth Estimation

Datasets

Cityscapes

KITTI

Results from the Paper

Edit

Ranked #7 on Unsupervised Monocular Depth Estimation on Cityscapes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Monocular Depth Estimation	Cityscapes	ManyDepth	RMSE	6.223	# 5	Compare
			RMSE log	0.170	# 5	Compare
			Square relative error (SqRel)	1.193	# 6	Compare
			Absolute relative error (AbsRel)	0.114	# 7	Compare
			Test frames	2	# 7	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove