TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	Conformer-B	Top 1 Accuracy	84.1%	# 325
Image Classification	ImageNet	Conformer-B	Number of params	83.3M	# 811
Image Classification	ImageNet	Conformer-B	Hardware Burden	None	# 1
Image Classification	ImageNet	Conformer-B	Operations per network pass	None	# 1
Image Classification	ImageNet	Conformer-B	GFLOPs	46.6	# 418

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/conformer-local-features-coupling-global/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=conformer-local-features-coupling-global)`

Conformer: Local Features Coupling Global Representations for Visual Recognition

ICCV 2021 · Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, YaoWei Wang, Jianbin Jiao, Qixiang Ye ·

Within Convolutional Neural Network (CNN), the convolution operations are good at extracting local features but experience difficulty to capture global representations. Within visual transformer, the cascaded self-attention modules can capture long-distance feature dependencies but unfortunately deteriorate local feature details. In this paper, we propose a hybrid network structure, termed Conformer, to take advantage of convolutional operations and self-attention mechanisms for enhanced representation learning. Conformer roots in the Feature Coupling Unit (FCU), which fuses local features and global representations under different resolutions in an interactive fashion. Conformer adopts a concurrent structure so that local features and global representations are retained to the maximum extent. Experiments show that Conformer, under the comparable parameter complexity, outperforms the visual transformer (DeiT-B) by 2.3% on ImageNet. On MSCOCO, it outperforms ResNet-101 by 3.7% and 3.6% mAPs for object detection and instance segmentation, respectively, demonstrating the great potential to be a general backbone network. Code is available at https://github.com/pengzhiliang/Conformer.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

pengzhiliang/Conformer official

500

open-mmlab/mmclassification

3,137

vasgaowei/TS-CAM

130

vasgaowei/ts-cam-voc

Tasks

Add Remove

Image Classification

Instance Segmentation

object-detection

Object Detection

Representation Learning

Semantic Segmentation

Datasets

ImageNet

MS COCO

Results from the Paper

Add Remove

Ranked #322 on Image Classification on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	Conformer-B	Top 1 Accuracy	84.1%	# 325	Compare
			Number of params	83.3M	# 811	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
			GFLOPs	46.6	# 418	Compare

Methods

Add Remove

Convolution

Edit Social Preview

Conformer: Local Features Coupling Global Representations for Visual Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove