TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Semantic Segmentation	LLRGBD-synthetic	CEN (ResNet-101)	mIoU	62.15	# 7
Semantic Segmentation	NYU Depth v2	CEN-PSPNet (ResNet-152)	Mean IoU	52.5%	# 30

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/channel-exchanging-networks-for-multimodal/semantic-segmentation-on-llrgbd-synthetic)](https://paperswithcode.com/sota/semantic-segmentation-on-llrgbd-synthetic?p=channel-exchanging-networks-for-multimodal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/channel-exchanging-networks-for-multimodal/semantic-segmentation-on-nyu-depth-v2)](https://paperswithcode.com/sota/semantic-segmentation-on-nyu-depth-v2?p=channel-exchanging-networks-for-multimodal)`

Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction

4 Dec 2021 · Yikai Wang, Fuchun Sun, Wenbing Huang, Fengxiang He, DaCheng Tao ·

Multimodal fusion and multitask learning are two vital topics in machine learning. Despite the fruitful progress, existing methods for both problems are still brittle to the same challenge -- it remains dilemmatic to integrate the common information across modalities (resp. tasks) meanwhile preserving the specific patterns of each modality (resp. task). Besides, while they are actually closely related to each other, multimodal fusion and multitask learning are rarely explored within the same methodological framework before. In this paper, we propose Channel-Exchanging-Network (CEN) which is self-adaptive, parameter-free, and more importantly, applicable for multimodal and multitask dense image prediction. At its core, CEN adaptively exchanges channels between subnetworks of different modalities. Specifically, the channel exchanging process is self-guided by individual channel importance that is measured by the magnitude of Batch-Normalization (BN) scaling factor during training. For the application of dense image prediction, the validity of CEN is tested by four different scenarios: multimodal fusion, cycle multimodal fusion, multitask learning, and multimodal multitask learning. Extensive experiments on semantic segmentation via RGB-D data and image translation through multi-domain input verify the effectiveness of CEN compared to state-of-the-art methods. Detailed ablation studies have also been carried out, which demonstrate the advantage of each component we propose. Our code is available at https://github.com/yikaiw/CEN.

PDF Abstract

Code

Add Remove Mark official

yikaiw/CEN official

276

Tasks

Add Remove

Semantic Segmentation

Datasets

Cityscapes

Visual Question Answering

NYUv2

SUN RGB-D

Taskonomy

NLPR

Results from the Paper

Edit

Ranked #7 on Semantic Segmentation on LLRGBD-synthetic

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Semantic Segmentation	LLRGBD-synthetic	CEN (ResNet-101)	mIoU	62.15	# 7		Compare
Semantic Segmentation	NYU Depth v2	CEN-PSPNet (ResNet-152)	Mean IoU	52.5%	# 30		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove