TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Generalized Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	PCN (ResNet-50)	Mean IoU	59.66	# 3
Generalized Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	PCN (ResNet-50)	Mean Base and Novel	58.47	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prediction-calibration-for-generalized-few/generalized-few-shot-semantic-segmentation-on-1)](https://paperswithcode.com/sota/generalized-few-shot-semantic-segmentation-on-1?p=prediction-calibration-for-generalized-few)`

Prediction Calibration for Generalized Few-shot Semantic Segmentation

15 Oct 2022 · Zhihe Lu, Sen He, Da Li, Yi-Zhe Song, Tao Xiang ·

Generalized Few-shot Semantic Segmentation (GFSS) aims to segment each image pixel into either base classes with abundant training examples or novel classes with only a handful of (e.g., 1-5) training images per class. Compared to the widely studied Few-shot Semantic Segmentation FSS, which is limited to segmenting novel classes only, GFSS is much under-studied despite being more practical. Existing approach to GFSS is based on classifier parameter fusion whereby a newly trained novel class classifier and a pre-trained base class classifier are combined to form a new classifier. As the training data is dominated by base classes, this approach is inevitably biased towards the base classes. In this work, we propose a novel Prediction Calibration Network PCN to address this problem. Instead of fusing the classifier parameters, we fuse the scores produced separately by the base and novel classifiers. To ensure that the fused scores are not biased to either the base or novel classes, a new Transformer-based calibration module is introduced. It is known that the lower-level features are useful of detecting edge information in an input image than higher-level features. Thus, we build a cross-attention module that guides the classifier's final prediction using the fused multi-level features. However, transformers are computationally demanding. Crucially, to make the proposed cross-attention module training tractable at the pixel level, this module is designed based on feature-score cross-covariance and episodically trained to be generalizable at inference time. Extensive experiments on PASCAL-$5^{i}$ and COCO-$20^{i}$ show that our PCN outperforms the state-the-the-art alternatives by large margins.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Few-Shot Semantic Segmentation

Generalized Few-Shot Semantic Segmentation

Semantic Segmentation

Datasets

MS COCO

PASCAL-5i

Results from the Paper

Edit

Ranked #3 on Generalized Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Generalized Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	PCN (ResNet-50)	Mean IoU	59.66	# 3		Compare
Generalized Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	PCN (ResNet-50)	Mean Base and Novel	58.47	# 3		Compare

Methods

Add Remove

BASE • Concatenated Skip Connection • Cross-Attention Module • Softmax

Edit Social Preview

Prediction Calibration for Generalized Few-shot Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove