Zero-Shot Object Detection

26 papers with code • 7 benchmarks • 6 datasets

Zero-shot object detection (ZSD) is the task of object detection where no visual training data is available for some of the target object classes.

( Image credit: Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Object Detection

Dataset	Best Model	Compare
MS-COCO	SeeDS	See all
PASCAL VOC'07	SeeDS	See all
LVIS v1.0 minival	OWLv2 (OWL-ST+FT)	See all
LVIS v1.0 val	OWLv2 (OWL-ST+FT)	See all
ODinW	Grounding DINO	See all
MSCOCO	Grounding DINO (without COCO data)	See all
ImageNet Detection	SUZOD	See all

Libraries

Use these libraries to find Zero-Shot Object Detection models and implementations

microsoft/GLIP

2 papers

1,952

Datasets

Most implemented papers

Most implemented Social Latest No code

Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts

salman-h-khan/ZSD_Release • • 16 Mar 2018

We hypothesize that this setting is ill-suited for real-world applications where unseen objects appear only as a part of a complex scene, warranting both the `recognition' and `localization' of an unseen category.

Paper
Code

GTNet: Generative Transfer Network for Zero-Shot Object Detection

X-BrainLab/GTNet • • 19 Jan 2020

FFU and BFU add the IoU variance to the results of CFU, yielding class-specific foreground and background features, respectively.

Paper
Code

Background Learnable Cascade for Zero-Shot Object Detection

zhengye1995/BLC • • 9 Oct 2020

The major contributions for BLC are as follows: (i) we propose a multi-stage cascade structure named Cascade Semantic R-CNN to progressively refine the alignment between visual and semantic of ZSD; (ii) we develop the semantic information flow structure and directly add it between each stage in Cascade Semantic RCNN to further improve the semantic feature learning; (iii) we propose the background learnable region proposal network (BLRPN) to learn an appropriate word vector for background class and use this learned vector in Cascade Semantic R CNN, this design makes \Background Learnable" and reduces the confusion between background and unseen classes.

Paper
Code

Zero-shot Object Detection Through Vision-Language Embedding Alignment

Johnathan-Xie/ZSD-YOLO • • 24 Sep 2021

Recent approaches have shown that training deep neural networks directly on large-scale image-text pair collections enables zero-shot transfer on various recognition tasks.

Paper
Code

Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture

sberbank-ai/fusion_brain_aij2021 • • 22 Nov 2021

Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called Fusion Brain, the first competition which is targeted to make the universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language.

Paper
Code

Robust Region Feature Synthesizer for Zero-Shot Object Detection

HPL123/RRFS • • CVPR 2022

Zero-shot object detection aims at incorporating class semantic vectors to realize the detection of (both seen and) unseen classes given an unconstrained test image.

Paper
Code

From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection

witnessai/GRAN • • Winter Conference on Applications of Computer Vision (WACV) 2022

Zero-Shot Detection (ZSD), which aims at localizing andrecognizing unseen objects in a complicated scene, usuallyleverages the visual and semantic information of individ-ual objects alone.

Paper
Code

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection

mmaaz60/mvits_for_class_agnostic_od • • 7 Jul 2022

Two popular forms of weak-supervision used in open-vocabulary detection (OVD) include pretrained CLIP model and image-level supervision.

Paper
Code

Resolving Semantic Confusions for Improved Zero-Shot Detection

sandipan211/ZSD-SC-Resolver • • British Machine Vision Conference 2022

Zero-shot detection (ZSD) is a challenging task where we aim to recognize and localize objects simultaneously, even when our model has not been trained with visual samples of a few target ("unseen") classes.

Paper
Code

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection

casia-iva-lab/zbs • • CVPR 2023

However, previous unsupervised deep learning BGS algorithms perform poorly in sophisticated scenarios such as shadows or night lights, and they cannot detect objects outside the pre-defined categories.

Paper
Code

Zero-Shot Object Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result