TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-101)	Mean IoU	47	# 12
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-101)	FB-IoU	70	# 11
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-50)	Mean IoU	45.7	# 23
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-50)	FB-IoU	69.6	# 16
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-101)	Mean IoU	54.7	# 11
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-101)	FB-IoU	74.6	# 6
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-50)	Mean IoU	53.4	# 16
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-50)	FB-IoU	73.3	# 9
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-50)	Mean IoU	63.1	# 62
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-50)	FB-IoU	72.6	# 42
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-101)	Mean IoU	63.2	# 61
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-101)	FB-IoU	72.6	# 42
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-50)	Mean IoU	67.4	# 55
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-50)	FB-IoU	77.1	# 36
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-101)	Mean IoU	67	# 57
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-101)	FB-IoU	76.3	# 38

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prototype-as-query-for-few-shot-semantic/few-shot-semantic-segmentation-on-coco-20i-5)](https://paperswithcode.com/sota/few-shot-semantic-segmentation-on-coco-20i-5?p=prototype-as-query-for-few-shot-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prototype-as-query-for-few-shot-semantic/few-shot-semantic-segmentation-on-coco-20i-1)](https://paperswithcode.com/sota/few-shot-semantic-segmentation-on-coco-20i-1?p=prototype-as-query-for-few-shot-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prototype-as-query-for-few-shot-semantic/few-shot-semantic-segmentation-on-pascal-5i-5)](https://paperswithcode.com/sota/few-shot-semantic-segmentation-on-pascal-5i-5?p=prototype-as-query-for-few-shot-semantic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prototype-as-query-for-few-shot-semantic/few-shot-semantic-segmentation-on-pascal-5i-1)](https://paperswithcode.com/sota/few-shot-semantic-segmentation-on-pascal-5i-1?p=prototype-as-query-for-few-shot-semantic)`

Prototype as Query for Few Shot Semantic Segmentation

27 Nov 2022 · Leilei Cao, Yibo Guo, Ye Yuan, Qiangguo Jin ·

Few-shot Semantic Segmentation (FSS) was proposed to segment unseen classes in a query image, referring to only a few annotated examples named support images. One of the characteristics of FSS is spatial inconsistency between query and support targets, e.g., texture or appearance. This greatly challenges the generalization ability of methods for FSS, which requires to effectively exploit the dependency of the query image and the support examples. Most existing methods abstracted support features into prototype vectors and implemented the interaction with query features using cosine similarity or feature concatenation. However, this simple interaction may not capture spatial details in query features. To alleviate this limitation, a few methods utilized all pixel-wise support information via computing the pixel-wise correlations between paired query and support features implemented with the attention mechanism of Transformer. These approaches suffer from heavy computation on the dot-product attention between all pixels of support and query features. In this paper, we propose a simple yet effective framework built upon Transformer termed as ProtoFormer to fully capture spatial details in query features. It views the abstracted prototype of the target class in support features as Query and the query features as Key and Value embeddings, which are input to the Transformer decoder. In this way, the spatial details can be better captured and the semantic features of target class in the query image can be focused. The output of the Transformer-based module can be viewed as semantic-aware dynamic kernels to filter out the segmentation mask from the enriched query features. Extensive experiments on PASCAL-$5^{i}$ and COCO-$20^{i}$ show that our ProtoFormer significantly advances the state-of-the-art methods.

PDF Abstract

Code

Add Remove Mark official

leileicao/protoformer official

Tasks

Add Remove

Few-Shot Semantic Segmentation

Datasets

ImageNet

MS COCO

PASCAL-5i

Results from the Paper

Edit

Ranked #11 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-101)	Mean IoU	47	# 12	Compare
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-101)	FB-IoU	70	# 11	Compare
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-50)	Mean IoU	45.7	# 23	Compare
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	ProtoFormer (ResNet-50)	FB-IoU	69.6	# 16	Compare
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-101)	Mean IoU	54.7	# 11	Compare
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-101)	FB-IoU	74.6	# 6	Compare
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-50)	Mean IoU	53.4	# 16	Compare
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	ProtoFormer (ResNet-50)	FB-IoU	73.3	# 9	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-50)	Mean IoU	63.1	# 62	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-50)	FB-IoU	72.6	# 42	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-101)	Mean IoU	63.2	# 61	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	ProtoFormer (ResNet-101)	FB-IoU	72.6	# 42	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-50)	Mean IoU	67.4	# 55	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-50)	FB-IoU	77.1	# 36	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-101)	Mean IoU	67	# 57	Compare
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	ProtoFormer (ResNet-101)	FB-IoU	76.3	# 38	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dot-Product Attention • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Prototype as Query for Few Shot Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove