TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Few-Shot Object Detection	MS-COCO (10-shot)	RISF (Resnet-101)	AP	21.9	# 6
Few-Shot Object Detection	MS-COCO (10-shot)	RISF (SWIN-Large)	AP	25.5	# 2
Few-Shot Object Detection	MS-COCO (1-shot)	RISF	AP	11.7	# 2
Few-Shot Object Detection	MS-COCO (30-shot)	RISF (SWIN-Large)	AP	31.9	# 2
Few-Shot Object Detection	MS-COCO (30-shot)	RISF (Resnet-101)	AP	24.4	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/re-scoring-using-image-language-similarity/few-shot-object-detection-on-ms-coco-10-shot)](https://paperswithcode.com/sota/few-shot-object-detection-on-ms-coco-10-shot?p=re-scoring-using-image-language-similarity)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/re-scoring-using-image-language-similarity/few-shot-object-detection-on-ms-coco-1-shot)](https://paperswithcode.com/sota/few-shot-object-detection-on-ms-coco-1-shot?p=re-scoring-using-image-language-similarity)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/re-scoring-using-image-language-similarity/few-shot-object-detection-on-ms-coco-30-shot)](https://paperswithcode.com/sota/few-shot-object-detection-on-ms-coco-30-shot?p=re-scoring-using-image-language-similarity)`

Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection

1 Nov 2023 · Min Jae Jung, Seung Dae Han, Joohee Kim ·

Few-shot object detection, which focuses on detecting novel objects with few labels, is an emerging challenge in the community. Recent studies show that adapting a pre-trained model or modified loss function can improve performance. In this paper, we explore leveraging the power of Contrastive Language-Image Pre-training (CLIP) and hard negative classification loss in low data setting. Specifically, we propose Re-scoring using Image-language Similarity for Few-shot object detection (RISF) which extends Faster R-CNN by introducing Calibration Module using CLIP (CM-CLIP) and Background Negative Re-scale Loss (BNRL). The former adapts CLIP, which performs zero-shot classification, to re-score the classification scores of a detector using image-class similarities, the latter is modified classification loss considering the punishment for fake backgrounds as well as confusing categories on a generalized few-shot object detection dataset. Extensive experiments on MS-COCO and PASCAL VOC show that the proposed RISF substantially outperforms the state-of-the-art approaches. The code will be available.

PDF Abstract

Code

Add Remove Mark official

INFINIQ-AI1/RISF official

Tasks

Add Remove

Classification

Few-Shot Object Detection

Object

object-detection

Object Detection

Zero-Shot Learning

Datasets

MS COCO

Results from the Paper

Edit

Ranked #2 on Few-Shot Object Detection on MS-COCO (30-shot)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Few-Shot Object Detection	MS-COCO (10-shot)	RISF (Resnet-101)	AP	21.9	# 6	Compare
Few-Shot Object Detection	MS-COCO (10-shot)	RISF (SWIN-Large)	AP	25.5	# 2	Compare
Few-Shot Object Detection	MS-COCO (1-shot)	RISF	AP	11.7	# 2	Compare
Few-Shot Object Detection	MS-COCO (30-shot)	RISF (SWIN-Large)	AP	31.9	# 2	Compare
Few-Shot Object Detection	MS-COCO (30-shot)	RISF (Resnet-101)	AP	24.4	# 6	Compare

Methods

Add Remove

CLIP • Convolution • Faster R-CNN • RoIPool • RPN • Softmax

Edit Social Preview

Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove