TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Depth Prediction	KITTI 2015	H-Net (Ours) Full Eigen	RMSE log	0.166	# 1
Depth Estimation	KITTI 2015	H-Net (Ours) Full Eigen	Absolute relative error (AbsRel)	0.076	# 1
Depth Estimation	KITTI 2015	H-Net (Ours) Full Eigen	Sq Rel	0.607	# 1
Depth Estimation	KITTI 2015	H-Net (Ours) Full Eigen	RMSE	0.04025	# 1
Depth Prediction	KITTI 2015	H-Net (Ours)	RMSE	0.04197	# 1
Depth Prediction	KITTI 2015	H-Net (Ours)	RMSE log	0.174	# 2
Depth Estimation	KITTI 2015	H-Net (Ours)	Absolute relative error (AbsRel)	0.094	# 2
Depth Estimation	KITTI 2015	H-Net (Ours)	Sq Rel	0.6	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/h-net-unsupervised-attention-based-stereo/depth-prediction-on-kitti-2015)](https://paperswithcode.com/sota/depth-prediction-on-kitti-2015?p=h-net-unsupervised-attention-based-stereo)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/h-net-unsupervised-attention-based-stereo/depth-estimation-on-kitti-2015)](https://paperswithcode.com/sota/depth-estimation-on-kitti-2015?p=h-net-unsupervised-attention-based-stereo)`

H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging Epipolar Geometry

22 Apr 2021 · Baoru Huang, Jian-Qing Zheng, Stamatia Giannarou, Daniel S. Elson ·

Depth estimation from a stereo image pair has become one of the most explored applications in computer vision, with most of the previous methods relying on fully supervised learning settings. However, due to the difficulty in acquiring accurate and scalable ground truth data, the training of fully supervised methods is challenging. As an alternative, self-supervised methods are becoming more popular to mitigate this challenge. In this paper, we introduce the H-Net, a deep-learning framework for unsupervised stereo depth estimation that leverages epipolar geometry to refine stereo matching. For the first time, a Siamese autoencoder architecture is used for depth estimation which allows mutual information between the rectified stereo images to be extracted. To enforce the epipolar constraint, the mutual epipolar attention mechanism has been designed which gives more emphasis to correspondences of features which lie on the same epipolar line while learning mutual information between the input stereo pair. Stereo correspondences are further enhanced by incorporating semantic information to the proposed attention mechanism. More specifically, the optimal transport algorithm is used to suppress attention and eliminate outliers in areas not visible in both cameras. Extensive experiments on KITTI2015 and Cityscapes show that our method outperforms the state-ofthe-art unsupervised stereo depth estimation methods while closing the gap with the fully supervised approaches.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Depth Estimation

Depth Prediction

Stereo Depth Estimation

Stereo Matching

Datasets

Cityscapes

KITTI

Results from the Paper

Edit

Ranked #1 on Depth Prediction on KITTI 2015

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Depth Prediction	KITTI 2015	H-Net (Ours) Full Eigen	RMSE log	0.166	# 1	Compare
Depth Estimation	KITTI 2015	H-Net (Ours) Full Eigen	Absolute relative error (AbsRel)	0.076	# 1	Compare
			Sq Rel	0.607	# 1	Compare
			RMSE	0.04025	# 1	Compare
Depth Prediction	KITTI 2015	H-Net (Ours)	RMSE	0.04197	# 1	Compare
Depth Prediction	KITTI 2015	H-Net (Ours)	RMSE log	0.174	# 2	Compare
Depth Estimation	KITTI 2015	H-Net (Ours)	Absolute relative error (AbsRel)	0.094	# 2	Compare
Depth Estimation	KITTI 2015	H-Net (Ours)	Sq Rel	0.6	# 2	Compare

Methods

Add Remove

AutoEncoder

Edit Social Preview

H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging Epipolar Geometry

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove