TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	DDD17	CMNeXt	mIoU	72.67	# 1
Semantic Segmentation	DeLiVER	CMNeXt (RGB-D-E-LiDAR)	mIoU	66.30	# 1
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-LiDAR)	mIoU	65.50	# 2
Semantic Segmentation	DELIVER	CMNeXt (RGB-LiDAR)	mIoU	58.04	# 5
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-Event)	mIoU	64.44	# 3
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-E-LiDAR)	mIoU	66.30	# 1
Semantic Segmentation	DELIVER	CMNeXt (RGB-Depth)	mIoU	63.58	# 4
Semantic Segmentation	DELIVER	CMNeXt (RGB-Event)	mIoU	57.48	# 6
Semantic Segmentation	DSEC	CMNeXt	mIoU	72.54	# 1
Semantic Segmentation	KITTI-360	CMNeXt (RGB-D-E-LiDAR)	mIoU	67.84	# 1
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A)	mIoU	48.42%	# 10
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A-D-N)	mIoU	51.54%	# 3
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A-D)	mIoU	49.48%	# 9
Semantic Segmentation	MCubeS (P)	CMNeXt (B2 RGB-A-D)	mIoU	49.48	# 7
Semantic Segmentation	MCubeS (P)	CMNeXt (B2 RGB-A)	mIoU	48.42	# 8
Thermal Image Segmentation	MFN Dataset	CMNeXt (B4)	mIOU	59.9	# 3
Thermal Image Segmentation	Noisy RS RGB-T Dataset	CMNeXt (B4)	mIoU	60.3	# 1
Semantic Segmentation	NYU Depth v2	CMNeXt (B4)	Mean IoU	56.9%	# 7
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF8)	mIoU (Real)	83.22	# 1
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF8)	mIoU (Syn)	80.74	# 3
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF33)	mIoU (Real)	82.62	# 3
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF33)	mIoU (Syn)	80.98	# 2
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF80)	mIoU (Real)	83.11	# 2
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF80)	mIoU (Syn)	81.02	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-ddd17)](https://paperswithcode.com/sota/semantic-segmentation-on-ddd17?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-deliver)](https://paperswithcode.com/sota/semantic-segmentation-on-deliver?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-deliver-1)](https://paperswithcode.com/sota/semantic-segmentation-on-deliver-1?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-dsec)](https://paperswithcode.com/sota/semantic-segmentation-on-dsec?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-kitti-360)](https://paperswithcode.com/sota/semantic-segmentation-on-kitti-360?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/thermal-image-segmentation-on-noisy-rs-rgb-t)](https://paperswithcode.com/sota/thermal-image-segmentation-on-noisy-rs-rgb-t?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-urbanlf)](https://paperswithcode.com/sota/semantic-segmentation-on-urbanlf?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-mcubes)](https://paperswithcode.com/sota/semantic-segmentation-on-mcubes?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/thermal-image-segmentation-on-mfn-dataset)](https://paperswithcode.com/sota/thermal-image-segmentation-on-mfn-dataset?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-mcubes-p)](https://paperswithcode.com/sota/semantic-segmentation-on-mcubes-p?p=delivering-arbitrary-modal-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/delivering-arbitrary-modal-semantic/semantic-segmentation-on-nyu-depth-v2)](https://paperswithcode.com/sota/semantic-segmentation-on-nyu-depth-v2?p=delivering-arbitrary-modal-semantic)`

Delivering Arbitrary-Modal Semantic Segmentation

CVPR 2023 · Jiaming Zhang, Ruiping Liu, Hao Shi, Kailun Yang, Simon Reiß, Kunyu Peng, Haodong Fu, Kaiwei Wang, Rainer Stiefelhagen ·

Multimodal fusion can make semantic segmentation more robust. However, fusing an arbitrary number of modalities remains underexplored. To delve into this problem, we create the DeLiVER arbitrary-modal segmentation benchmark, covering Depth, LiDAR, multiple Views, Events, and RGB. Aside from this, we provide this dataset in four severe weather conditions as well as five sensor failure cases to exploit modal complementarity and resolve partial outages. To make this possible, we present the arbitrary cross-modal segmentation model CMNeXt. It encompasses a Self-Query Hub (SQ-Hub) designed to extract effective information from any modality for subsequent fusion with the RGB representation and adds only negligible amounts of parameters (~0.01M) per additional modality. On top, to efficiently and flexibly harvest discriminative cues from the auxiliary modalities, we introduce the simple Parallel Pooling Mixer (PPX). With extensive experiments on a total of six benchmarks, our CMNeXt achieves state-of-the-art performance on the DeLiVER, KITTI-360, MFNet, NYU Depth V2, UrbanLF, and MCubeS datasets, allowing to scale from 1 to 81 modalities. On the freshly collected DeLiVER, the quad-modal CMNeXt reaches up to 66.30% in mIoU with a +9.10% gain as compared to the mono-modal baseline. The DeLiVER dataset and our code are at: https://jamycheung.github.io/DELIVER.html.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

jamycheung/DELIVER official

123

Tasks

Add Remove

Segmentation

Semantic Segmentation

Thermal Image Segmentation

Datasets

Introduced in the Paper:

DELIVER

Used in the Paper:

CARLA

NYUv2

KITTI-360 MFNet

DDD17 MCubeS DSEC MCubeS (P)

Results from the Paper

Edit

Ranked #1 on Semantic Segmentation on DSEC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	DDD17	CMNeXt	mIoU	72.67	# 1	Compare
Semantic Segmentation	DeLiVER	CMNeXt (RGB-D-E-LiDAR)	mIoU	66.30	# 1	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-LiDAR)	mIoU	65.50	# 2	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-LiDAR)	mIoU	58.04	# 5	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-Event)	mIoU	64.44	# 3	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-D-E-LiDAR)	mIoU	66.30	# 1	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-Depth)	mIoU	63.58	# 4	Compare
Semantic Segmentation	DELIVER	CMNeXt (RGB-Event)	mIoU	57.48	# 6	Compare
Semantic Segmentation	DSEC	CMNeXt	mIoU	72.54	# 1	Compare
Semantic Segmentation	KITTI-360	CMNeXt (RGB-D-E-LiDAR)	mIoU	67.84	# 1	Compare
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A)	mIoU	48.42%	# 10	Compare
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A-D-N)	mIoU	51.54%	# 3	Compare
Semantic Segmentation	MCubeS	CMNeXt (B2 RGB-A-D)	mIoU	49.48%	# 9	Compare
Semantic Segmentation	MCubeS (P)	CMNeXt (B2 RGB-A-D)	mIoU	49.48	# 7	Compare
Semantic Segmentation	MCubeS (P)	CMNeXt (B2 RGB-A)	mIoU	48.42	# 8	Compare
Thermal Image Segmentation	MFN Dataset	CMNeXt (B4)	mIOU	59.9	# 3	Compare
Thermal Image Segmentation	Noisy RS RGB-T Dataset	CMNeXt (B4)	mIoU	60.3	# 1	Compare
Semantic Segmentation	NYU Depth v2	CMNeXt (B4)	Mean IoU	56.9%	# 7	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF8)	mIoU (Real)	83.22	# 1	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF8)	mIoU (Syn)	80.74	# 3	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF33)	mIoU (Real)	82.62	# 3	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF33)	mIoU (Syn)	80.98	# 2	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF80)	mIoU (Real)	83.11	# 2	Compare
Semantic Segmentation	UrbanLF	CMNeXt (RGB-LF80)	mIoU (Syn)	81.02	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Delivering Arbitrary-Modal Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove