TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	GAMUS	ShapeConv	mIoU	55.86	# 5
Semantic Segmentation	LLRGBD-synthetic	ShapeConv (ResNeXt-101)	mIoU	63.26	# 6
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNet-101)	Mean IoU	49.0%	# 61
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNext-101)	Mean IoU	51.3%	# 39
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNet-50)	Mean IoU	48.8%	# 64
Thermal Image Segmentation	RGB-T-Glass-Segmentation	ShapeConv	MAE	0.054	# 12
Semantic Segmentation	Stanford2D3D - RGBD	ShapeConv-101	mIoU	60.6	# 3
Semantic Segmentation	Stanford2D3D - RGBD	ShapeConv-101	mAcc	70.0	# 1
Semantic Segmentation	Stanford2D3D - RGBD	ShapeConv-101	Pixel Accuracy	82.7	# 1
Semantic Segmentation	SUN-RGBD	PSD-ResNet50	Mean IoU	48.6%	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/semantic-segmentation-on-stanford2d3d-rgbd)](https://paperswithcode.com/sota/semantic-segmentation-on-stanford2d3d-rgbd?p=shapeconv-shape-aware-convolutional-layer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/semantic-segmentation-on-gamus)](https://paperswithcode.com/sota/semantic-segmentation-on-gamus?p=shapeconv-shape-aware-convolutional-layer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/semantic-segmentation-on-llrgbd-synthetic)](https://paperswithcode.com/sota/semantic-segmentation-on-llrgbd-synthetic?p=shapeconv-shape-aware-convolutional-layer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/thermal-image-segmentation-on-rgb-t-glass)](https://paperswithcode.com/sota/thermal-image-segmentation-on-rgb-t-glass?p=shapeconv-shape-aware-convolutional-layer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/semantic-segmentation-on-sun-rgbd)](https://paperswithcode.com/sota/semantic-segmentation-on-sun-rgbd?p=shapeconv-shape-aware-convolutional-layer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/shapeconv-shape-aware-convolutional-layer-for/semantic-segmentation-on-nyu-depth-v2)](https://paperswithcode.com/sota/semantic-segmentation-on-nyu-depth-v2?p=shapeconv-shape-aware-convolutional-layer-for)`

ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

ICCV 2021 · Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li ·

RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as well as the base (whereabout) of it in a larger context. Compared with the base, the shape probably is more inherent and has a stronger connection to the semantics, and thus is more critical for segmentation accuracy. Inspired by this observation, we introduce a Shape-aware Convolutional layer (ShapeConv) for processing the depth feature, where the depth feature is firstly decomposed into a shape-component and a base-component, next two learnable weights are introduced to cooperate with them independently, and finally a convolution is applied on the re-weighted combination of these two components. ShapeConv is model-agnostic and can be easily integrated into most CNNs to replace vanilla convolutional layers for semantic segmentation. Extensive experiments on three challenging indoor RGB-D semantic segmentation benchmarks, i.e., NYU-Dv2(-13,-40), SUN RGB-D, and SID, demonstrate the effectiveness of our ShapeConv when employing it over five popular architectures. Moreover, the performance of CNNs with ShapeConv is boosted without introducing any computation and memory increase in the inference phase. The reason is that the learnt weights for balancing the importance between the shape and base components in ShapeConv become constants in the inference phase, and thus can be fused into the following convolution, resulting in a network that is identical to one with vanilla convolutional layers.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

hanchaoleng/shapeconv official

101

Tasks

Add Remove

Segmentation

Semantic Segmentation

Thermal Image Segmentation

Datasets

NYUv2

SUN RGB-D

2D-3D-S

Results from the Paper

Edit

Ranked #3 on Semantic Segmentation on Stanford2D3D - RGBD

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	GAMUS	ShapeConv	mIoU	55.86	# 5	Compare
Semantic Segmentation	LLRGBD-synthetic	ShapeConv (ResNeXt-101)	mIoU	63.26	# 6	Compare
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNet-101)	Mean IoU	49.0%	# 61	Compare
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNext-101)	Mean IoU	51.3%	# 39	Compare
Semantic Segmentation	NYU Depth v2	ShapeConv (ResNet-50)	Mean IoU	48.8%	# 64	Compare
Thermal Image Segmentation	RGB-T-Glass-Segmentation	ShapeConv	MAE	0.054	# 12	Compare
Semantic Segmentation	Stanford2D3D - RGBD	ShapeConv-101	mIoU	60.6	# 3	Compare
			mAcc	70.0	# 1	Compare
			Pixel Accuracy	82.7	# 1	Compare
Semantic Segmentation	SUN-RGBD	PSD-ResNet50	Mean IoU	48.6%	# 20	Compare

Methods

Add Remove

Convolution • ShapeConv

Edit Social Preview

ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove