TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	ADE20K	SegFormer-B5	Validation mIoU	51.8	# 84
Semantic Segmentation	ADE20K	SegFormer-B5	Params (M)	84.7	# 31
Semantic Segmentation	ADE20K	SegFormer-B4	Validation mIoU	51.1	# 93
Semantic Segmentation	ADE20K	SegFormer-B4	Params (M)	64.1	# 39
Semantic Segmentation	ADE20K	SegFormer-B0	Validation mIoU	37.4	# 216
Semantic Segmentation	ADE20K	SegFormer-B0	Params (M)	3.8	# 64
Semantic Segmentation	ADE20K val	SegFormer-B5(MS, 87M #Params, ImageNet-1K pretrain)	mIoU	51.8	# 40
Semantic Segmentation	Cityscapes test	SegFormer (MiT-B5, Mapillary)	Mean IoU (class)	83.1%	# 19
Semantic Segmentation	Cityscapes val	SegFormer (MiT-B5, Mapillary)	mIoU	84.0	# 18
Semantic Segmentation	Cityscapes val	SegFormer-B0	Validation mIoU	76.2	# 2
Semantic Segmentation	COCO-Stuff full	SegFormer-B5 (Single Scale)	Mean IoU (class)	46.7	# 1
Semantic Segmentation	DADA-seg	SegFormer (MiT-B2)	mIoU	21.2	# 18
Semantic Segmentation	DADA-seg	SegFormer (MiT-B3)	mIoU	27.0	# 10
Semantic Segmentation	DADA-seg	SegFormer (MiT-B1)	mIoU	16.6	# 26
Semantic Segmentation	DDD17	SegFormer-B2	mIoU	71.05	# 4
Semantic Segmentation	DELIVER	SegFormer	mIoU	57.20	# 7
Semantic Segmentation	DensePASS	SegFormer (MiT-B1)	mIoU	38.5%	# 16
Semantic Segmentation	DensePASS	SegFormer (MiT-B2)	mIoU	42.4%	# 12
Semantic Segmentation	DSEC	SegFormer-B2	mIoU	71.99	# 3
Semantic Segmentation	EventScape	SegFormer-B4	mIoU	59.86	# 3
Semantic Segmentation	EventScape	SegFormer-B2	mIoU	58.69	# 4
Thermal Image Segmentation	MFN Dataset	SegFormer (B4)	mIOU	54.8	# 27
Thermal Image Segmentation	MFN Dataset	SegFormer (B2)	mIOU	53.2	# 31
Thermal Image Segmentation	RGB-T-Glass-Segmentation	SegFormer	MAE	0.053	# 11
Semantic Segmentation	SELMA	SegFormer	mIoU	77.2	# 2
Semantic Segmentation	SpectralWaste	SegFormer (HYPER)	mIoU	54.3	# 3
Semantic Segmentation	SpectralWaste	SegFormer (HYPER3)	mIoU	53.5	# 4
Semantic Segmentation	SpectralWaste	SegFormer (RGB)	mIoU	48.4	# 7
Semantic Segmentation	SynPASS	SegFomrer	mIoU	37.24%	# 3
Semantic Segmentation	UPLight	SegFormer-B2 (RGB)	mIoU	89.60	# 4
Semantic Segmentation	UrbanLF	SegFormer	mIoU (Real)	82.20	# 4
Semantic Segmentation	UrbanLF	SegFormer	mIoU (Syn)	78.53	# 8
Semantic Segmentation	ZJU-RGB-P	SegFormer-B2 (RGB)	mIoU	89.6	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-coco-stuff-full)](https://paperswithcode.com/sota/semantic-segmentation-on-coco-stuff-full?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-cityscapes-val)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes-val?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-selma)](https://paperswithcode.com/sota/semantic-segmentation-on-selma?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-dsec)](https://paperswithcode.com/sota/semantic-segmentation-on-dsec?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-eventscape)](https://paperswithcode.com/sota/semantic-segmentation-on-eventscape?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-spectralwaste)](https://paperswithcode.com/sota/semantic-segmentation-on-spectralwaste?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-synpass)](https://paperswithcode.com/sota/semantic-segmentation-on-synpass?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-ddd17)](https://paperswithcode.com/sota/semantic-segmentation-on-ddd17?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-uplight)](https://paperswithcode.com/sota/semantic-segmentation-on-uplight?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-zju-rgb-p)](https://paperswithcode.com/sota/semantic-segmentation-on-zju-rgb-p?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-deliver-1)](https://paperswithcode.com/sota/semantic-segmentation-on-deliver-1?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-urbanlf)](https://paperswithcode.com/sota/semantic-segmentation-on-urbanlf?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-dada-seg)](https://paperswithcode.com/sota/semantic-segmentation-on-dada-seg?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/thermal-image-segmentation-on-rgb-t-glass)](https://paperswithcode.com/sota/thermal-image-segmentation-on-rgb-t-glass?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-densepass)](https://paperswithcode.com/sota/semantic-segmentation-on-densepass?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-cityscapes)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/thermal-image-segmentation-on-mfn-dataset)](https://paperswithcode.com/sota/thermal-image-segmentation-on-mfn-dataset?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-ade20k-val)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k-val?p=segformer-simple-and-efficient-design-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/segformer-simple-and-efficient-design-for/semantic-segmentation-on-ade20k)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k?p=segformer-simple-and-efficient-design-for)`

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

NeurIPS 2021 · Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo ·

We present SegFormer, a simple, efficient yet powerful semantic segmentation framework which unifies Transformers with lightweight multilayer perception (MLP) decoders. SegFormer has two appealing features: 1) SegFormer comprises a novel hierarchically structured Transformer encoder which outputs multiscale features. It does not need positional encoding, thereby avoiding the interpolation of positional codes which leads to decreased performance when the testing resolution differs from training. 2) SegFormer avoids complex decoders. The proposed MLP decoder aggregates information from different layers, and thus combining both local attention and global attention to render powerful representations. We show that this simple and lightweight design is the key to efficient segmentation on Transformers. We scale our approach up to obtain a series of models from SegFormer-B0 to SegFormer-B5, reaching significantly better performance and efficiency than previous counterparts. For example, SegFormer-B4 achieves 50.3% mIoU on ADE20K with 64M parameters, being 5x smaller and 2.2% better than the previous best method. Our best model, SegFormer-B5, achieves 84.0% mIoU on Cityscapes validation set and shows excellent zero-shot robustness on Cityscapes-C. Code will be released at: github.com/NVlabs/SegFormer.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

NVlabs/SegFormer official

2,274

huggingface/transformers

125,725

PaddlePaddle/PaddleSeg

8,282

BR-IDL/PaddleViT

1,188

sithu31296/semantic-segmentation

↳ Quickstart in

Colab

764

See all 24 implementations

Tasks

Add Remove

C++ code

Decoder

Semantic Segmentation

Thermal Image Segmentation

Datasets

Cityscapes

ADE20K

COCO-Stuff MFNet

DensePASS

DDD17

DADA-seg DSEC

DELIVER UPLight ZJU-RGB-P

Results from the Paper

Edit

Ranked #1 on Semantic Segmentation on COCO-Stuff full

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	ADE20K	SegFormer-B5	Validation mIoU	51.8	# 84	Compare
Semantic Segmentation	ADE20K	SegFormer-B5	Params (M)	84.7	# 31	Compare
Semantic Segmentation	ADE20K	SegFormer-B4	Validation mIoU	51.1	# 93	Compare
Semantic Segmentation	ADE20K	SegFormer-B4	Params (M)	64.1	# 39	Compare
Semantic Segmentation	ADE20K	SegFormer-B0	Validation mIoU	37.4	# 216	Compare
Semantic Segmentation	ADE20K	SegFormer-B0	Params (M)	3.8	# 64	Compare
Semantic Segmentation	ADE20K val	SegFormer-B5(MS, 87M #Params, ImageNet-1K pretrain)	mIoU	51.8	# 40	Compare
Semantic Segmentation	Cityscapes test	SegFormer (MiT-B5, Mapillary)	Mean IoU (class)	83.1%	# 19	Compare
Semantic Segmentation	Cityscapes val	SegFormer (MiT-B5, Mapillary)	mIoU	84.0	# 18	Compare
Semantic Segmentation	Cityscapes val	SegFormer-B0	Validation mIoU	76.2	# 2	Compare
Semantic Segmentation	COCO-Stuff full	SegFormer-B5 (Single Scale)	Mean IoU (class)	46.7	# 1	Compare
Semantic Segmentation	DADA-seg	SegFormer (MiT-B2)	mIoU	21.2	# 18	Compare
Semantic Segmentation	DADA-seg	SegFormer (MiT-B3)	mIoU	27.0	# 10	Compare
Semantic Segmentation	DADA-seg	SegFormer (MiT-B1)	mIoU	16.6	# 26	Compare
Semantic Segmentation	DDD17	SegFormer-B2	mIoU	71.05	# 4	Compare
Semantic Segmentation	DELIVER	SegFormer	mIoU	57.20	# 7	Compare
Semantic Segmentation	DensePASS	SegFormer (MiT-B1)	mIoU	38.5%	# 16	Compare
Semantic Segmentation	DensePASS	SegFormer (MiT-B2)	mIoU	42.4%	# 12	Compare
Semantic Segmentation	DSEC	SegFormer-B2	mIoU	71.99	# 3	Compare
Semantic Segmentation	EventScape	SegFormer-B4	mIoU	59.86	# 3	Compare
Semantic Segmentation	EventScape	SegFormer-B2	mIoU	58.69	# 4	Compare
Thermal Image Segmentation	MFN Dataset	SegFormer (B4)	mIOU	54.8	# 27	Compare
Thermal Image Segmentation	MFN Dataset	SegFormer (B2)	mIOU	53.2	# 31	Compare
Thermal Image Segmentation	RGB-T-Glass-Segmentation	SegFormer	MAE	0.053	# 11	Compare
Semantic Segmentation	SELMA	SegFormer	mIoU	77.2	# 2	Compare
Semantic Segmentation	SpectralWaste	SegFormer (HYPER)	mIoU	54.3	# 3	Compare
Semantic Segmentation	SpectralWaste	SegFormer (HYPER3)	mIoU	53.5	# 4	Compare
Semantic Segmentation	SpectralWaste	SegFormer (RGB)	mIoU	48.4	# 7	Compare
Semantic Segmentation	SynPASS	SegFomrer	mIoU	37.24%	# 3	Compare
Semantic Segmentation	UPLight	SegFormer-B2 (RGB)	mIoU	89.60	# 4	Compare
Semantic Segmentation	UrbanLF	SegFormer	mIoU (Real)	82.20	# 4	Compare
Semantic Segmentation	UrbanLF	SegFormer	mIoU (Syn)	78.53	# 8	Compare
Semantic Segmentation	ZJU-RGB-P	SegFormer-B2 (RGB)	mIoU	89.6	# 5	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Convolution • Dense Connections • Dropout • GELU • Label Smoothing • Layer Normalization • Linear Layer • Mix-FFN • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • SegFormer • Softmax • Transformer

Edit Social Preview

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove