TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	ADE20K	EANet (ResNet-101)	Validation mIoU	45.33	# 184
Semantic Segmentation	ADE20K val	EANet (ResNet-101)	mIoU	45.33	# 75
Semantic Segmentation	Cityscapes val	EANet	mIoU	81.7%	# 34
Image Classification	ImageNet	T2T-ViT-14	Top 1 Accuracy	81.7%	# 563
Image Classification	ImageNet	T2T-ViT-14	Hardware Burden	None	# 1
Image Classification	ImageNet	T2T-ViT-14	Operations per network pass	None	# 1
Semantic Segmentation	PASCAL VOC 2012 test	EANet (ResNet-101)	Mean IoU	84%	# 16

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beyond-self-attention-external-attention/semantic-segmentation-on-pascal-voc-2012)](https://paperswithcode.com/sota/semantic-segmentation-on-pascal-voc-2012?p=beyond-self-attention-external-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beyond-self-attention-external-attention/semantic-segmentation-on-cityscapes-val)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes-val?p=beyond-self-attention-external-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beyond-self-attention-external-attention/semantic-segmentation-on-ade20k-val)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k-val?p=beyond-self-attention-external-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beyond-self-attention-external-attention/semantic-segmentation-on-ade20k)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k?p=beyond-self-attention-external-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beyond-self-attention-external-attention/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=beyond-self-attention-external-attention)`

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

5 May 2021 · Meng-Hao Guo, Zheng-Ning Liu, Tai-Jiang Mu, Shi-Min Hu ·

Attention mechanisms, especially self-attention, have played an increasingly important role in deep feature representation for visual tasks. Self-attention updates the feature at each position by computing a weighted sum of features using pair-wise affinities across all positions to capture the long-range dependency within a single sample. However, self-attention has quadratic complexity and ignores potential correlation between different samples. This paper proposes a novel attention mechanism which we call external attention, based on two external, small, learnable, shared memories, which can be implemented easily by simply using two cascaded linear layers and two normalization layers; it conveniently replaces self-attention in existing popular architectures. External attention has linear complexity and implicitly considers the correlations between all data samples. We further incorporate the multi-head mechanism into external attention to provide an all-MLP architecture, external attention MLP (EAMLP), for image classification. Extensive experiments on image classification, object detection, semantic segmentation, instance segmentation, image generation, and point cloud analysis reveal that our method provides results comparable or superior to the self-attention mechanism and some of its variants, with much lower computational and memory costs.

PDF Abstract

Code

Add Remove Mark official

MenghaoGuo/-EANet official

405

xmu-xiaoma666/External-Attention-py…

10,853

MenghaoGuo/Awesome-Vision-Attentions

2,678

MenghaoGuo/EANet

405

mindspore-courses/External-Attentio…

See all 7 implementations

Tasks

Add Remove

Image Classification

Image Generation

Instance Segmentation

object-detection

Object Detection

Point Cloud Classification

Semantic Segmentation

Datasets

ImageNet

MS COCO

Cityscapes

ModelNet

ADE20K PASCAL VOC 2012 test

Results from the Paper

Edit

Ranked #16 on Semantic Segmentation on PASCAL VOC 2012 test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	ADE20K	EANet (ResNet-101)	Validation mIoU	45.33	# 184	Compare
Semantic Segmentation	ADE20K val	EANet (ResNet-101)	mIoU	45.33	# 75	Compare
Semantic Segmentation	Cityscapes val	EANet	mIoU	81.7%	# 34	Compare
Image Classification	ImageNet	T2T-ViT-14	Top 1 Accuracy	81.7%	# 563	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Semantic Segmentation	PASCAL VOC 2012 test	EANet (ResNet-101)	Mean IoU	84%	# 16	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove