TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Depth Estimation	Stanford2D3D Panoramic	PanoFormer	RMSE	0.3083	# 4
Depth Estimation	Stanford2D3D Panoramic	PanoFormer	absolute relative error	0.0405	# 1
Semantic Segmentation	Stanford2D3D Panoramic	PanoFormer	mIoU	48.9%	# 16
Semantic Segmentation	Stanford2D3D Panoramic	PanoFormer	mAcc	64.5	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/panoformer-panorama-transformer-for-indoor/depth-estimation-on-stanford2d3d-panoramic)](https://paperswithcode.com/sota/depth-estimation-on-stanford2d3d-panoramic?p=panoformer-panorama-transformer-for-indoor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/panoformer-panorama-transformer-for-indoor/semantic-segmentation-on-stanford2d3d-1)](https://paperswithcode.com/sota/semantic-segmentation-on-stanford2d3d-1?p=panoformer-panorama-transformer-for-indoor)`

PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation

17 Mar 2022 · Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao ·

Existing panoramic depth estimation methods based on convolutional neural networks (CNNs) focus on removing panoramic distortions, failing to perceive panoramic structures efficiently due to the fixed receptive field in CNNs. This paper proposes the panorama transformer (named PanoFormer) to estimate the depth in panorama images, with tangent patches from spherical domain, learnable token flows, and panorama specific metrics. In particular, we divide patches on the spherical tangent domain into tokens to reduce the negative effect of panoramic distortions. Since the geometric structures are essential for depth estimation, a self-attention module is redesigned with an additional learnable token flow. In addition, considering the characteristic of the spherical domain, we present two panorama-specific metrics to comprehensively evaluate the panoramic depth estimation models' performance. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art (SOTA) methods. Furthermore, the proposed method can be effectively extended to solve semantic panorama segmentation, a similar pixel2pixel task. Code will be available.

PDF Abstract

Code

Add Remove Mark official

zhijieshen-bjtu/panoformer official

Tasks

Add Remove

Depth Estimation

Semantic Segmentation

Datasets

Matterport3D

SUNCG

2D-3D-S 3D60

Results from the Paper

Edit

Ranked #4 on Depth Estimation on Stanford2D3D Panoramic

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Depth Estimation	Stanford2D3D Panoramic	PanoFormer	RMSE	0.3083	# 4	Compare
Depth Estimation	Stanford2D3D Panoramic	PanoFormer	absolute relative error	0.0405	# 1	Compare
Semantic Segmentation	Stanford2D3D Panoramic	PanoFormer	mIoU	48.9%	# 16	Compare
Semantic Segmentation	Stanford2D3D Panoramic	PanoFormer	mAcc	64.5	# 8	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove