TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Semantic Segmentation	COCO-Stuff-171	TransFGU (ViT-S/8)	mIoU	11.93	# 2
Unsupervised Semantic Segmentation	COCO-Stuff-171	TransFGU (ViT-S/8)	Pixel Accuracy	34.32	# 2
Unsupervised Semantic Segmentation	COCO-Stuff-81	TransFGU (ViT-S/8)	mIoU	12.7	# 3
Unsupervised Semantic Segmentation	COCO-Stuff-81	TransFGU (ViT-S/8)	Pixel Accuracy	64.3	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transfgu-a-top-down-approach-to-fine-grained/unsupervised-semantic-segmentation-on-coco-6)](https://paperswithcode.com/sota/unsupervised-semantic-segmentation-on-coco-6?p=transfgu-a-top-down-approach-to-fine-grained)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transfgu-a-top-down-approach-to-fine-grained/unsupervised-semantic-segmentation-on-coco-8)](https://paperswithcode.com/sota/unsupervised-semantic-segmentation-on-coco-8?p=transfgu-a-top-down-approach-to-fine-grained)`

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

2 Dec 2021 · Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin ·

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their visual cues or certain predefined rules. As a result, it is difficult for these bottom-up approaches to generate fine-grained semantic segmentation when coming to complicated scenes with multiple objects and some objects sharing similar visual appearance. In contrast, we propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios. Specifically, we first obtain rich high-level structured semantic concept information from large-scale vision data in a self-supervised learning manner, and use such information as a prior to discover potential semantic categories presented in target datasets. Secondly, the discovered high-level semantic categories are mapped to low-level pixel features by calculating the class activate map (CAM) with respect to certain discovered semantic representation. Lastly, the obtained CAMs serve as pseudo labels to train the segmentation module and produce the final semantic segmentation. Experimental results on multiple semantic segmentation benchmarks show that our top-down unsupervised segmentation is robust to both object-centric and scene-centric datasets under different semantic granularity levels, and outperforms all the current state-of-the-art bottom-up methods. Our code is available at \url{https://github.com/damo-cv/TransFGU}.

PDF Abstract

Code

Add Remove Mark official

damo-cv/transfgu official

Tasks

Add Remove

Segmentation

Self-Supervised Learning

Semantic Segmentation

Unsupervised Semantic Segmentation

Datasets

MS COCO

Cityscapes

COCO-Stuff

LIP

Results from the Paper

Edit

Ranked #2 on Unsupervised Semantic Segmentation on COCO-Stuff-171 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Semantic Segmentation	COCO-Stuff-171	TransFGU (ViT-S/8)	mIoU	11.93	# 2	Compare
Unsupervised Semantic Segmentation	COCO-Stuff-171	TransFGU (ViT-S/8)	Pixel Accuracy	34.32	# 2	Compare
Unsupervised Semantic Segmentation	COCO-Stuff-81	TransFGU (ViT-S/8)	mIoU	12.7	# 3	Compare
Unsupervised Semantic Segmentation	COCO-Stuff-81	TransFGU (ViT-S/8)	Pixel Accuracy	64.3	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove