TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	Cityscapes test	CCNet	Mean IoU (class)	81.4%	# 40
Semantic Segmentation	FoodSeg103	CCNet (ResNet-50)	mIoU	35.5	# 7
Thermal Image Segmentation	MFN Dataset	CCNet	mIOU	43.3	# 44

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ccnet-criss-cross-attention-for-semantic/semantic-segmentation-on-foodseg103)](https://paperswithcode.com/sota/semantic-segmentation-on-foodseg103?p=ccnet-criss-cross-attention-for-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ccnet-criss-cross-attention-for-semantic/semantic-segmentation-on-cityscapes)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes?p=ccnet-criss-cross-attention-for-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ccnet-criss-cross-attention-for-semantic/thermal-image-segmentation-on-mfn-dataset)](https://paperswithcode.com/sota/thermal-image-segmentation-on-mfn-dataset?p=ccnet-criss-cross-attention-for-semantic)`

CCNet: Criss-Cross Attention for Semantic Segmentation

ICCV 2019 · Zilong Huang, Xinggang Wang, Yunchao Wei, Lichao Huang, Humphrey Shi, Wenyu Liu, Thomas S. Huang ·

Contextual information is vital in visual understanding problems, such as semantic segmentation and object detection. We propose a Criss-Cross Network (CCNet) for obtaining full-image contextual information in a very effective and efficient way. Concretely, for each pixel, a novel criss-cross attention module harvests the contextual information of all the pixels on its criss-cross path. By taking a further recurrent operation, each pixel can finally capture the full-image dependencies. Besides, a category consistent loss is proposed to enforce the criss-cross attention module to produce more discriminative features. Overall, CCNet is with the following merits: 1) GPU memory friendly. Compared with the non-local block, the proposed recurrent criss-cross attention module requires 11x less GPU memory usage. 2) High computational efficiency. The recurrent criss-cross attention significantly reduces FLOPs by about 85% of the non-local block. 3) The state-of-the-art performance. We conduct extensive experiments on semantic segmentation benchmarks including Cityscapes, ADE20K, human parsing benchmark LIP, instance segmentation benchmark COCO, video segmentation benchmark CamVid. In particular, our CCNet achieves the mIoU scores of 81.9%, 45.76% and 55.47% on the Cityscapes test set, the ADE20K validation set and the LIP validation set respectively, which are the new state-of-the-art results. The source codes are available at \url{https://github.com/speedinghzl/CCNet}.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

speedinghzl/CCNet official

1,394

open-mmlab/mmsegmentation

↳ Quickstart in

Colab

7,408

mindspore-courses/External-Attentio…

justld/CCNet_paddle

Tasks

Add Remove

Computational Efficiency

Human Parsing

Instance Segmentation

object-detection

Object Detection

Segmentation

Semantic Segmentation

Thermal Image Segmentation

Video Segmentation

Video Semantic Segmentation

Datasets

MS COCO

Cityscapes

ADE20K

CamVid MFNet

FoodSeg103

Results from the Paper

Edit

Ranked #7 on Semantic Segmentation on FoodSeg103 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	Cityscapes test	CCNet	Mean IoU (class)	81.4%	# 40	Compare
Semantic Segmentation	FoodSeg103	CCNet (ResNet-50)	mIoU	35.5	# 7	Compare
Thermal Image Segmentation	MFN Dataset	CCNet	mIOU	43.3	# 44	Compare

Methods

Add Remove

1x1 Convolution • CCNet • Non-Local Block • Non-Local Operation • Residual Connection

Edit Social Preview

CCNet: Criss-Cross Attention for Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove