Open Vocabulary Object Detection

55 papers with code • 4 benchmarks • 6 datasets

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Benchmarks

Add a Result

These leaderboards are used to track progress in Open Vocabulary Object Detection

Dataset	Best Model	Compare
MSCOCO	Cooperative Foundational Models	See all
LVIS v1.0	DITO	See all
OpenImages-v4	Object-Centric-OVD	See all
Objects365	Object-Centric-OVD	See all

Libraries

Use these libraries to find Open Vocabulary Object Detection models and implementations

faceonlive/ai-research

2 papers

132

om-ai-lab/OmDet

2 papers

Datasets

Subtasks

Open Vocabulary Attribute Detection

Most implemented papers

Most implemented Social Latest No code

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation

warlockwendell/aggdet • • 12 Apr 2024

Specifically, in the region-proposal stage, proposals that contain novel instances showcase lower objectness scores, since they are treated as background proposals during the training phase.

Paper
Code

Open-Vocabulary Object Detection Using Captions

alirezazareian/ovr-cnn • • CVPR 2021

Weakly supervised and zero-shot learning techniques have been explored to scale object detectors to more categories with less supervision, but they have not been as successful and widely adopted as supervised models.

Paper
Code

Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

salesforce/pb-ovd • • 18 Nov 2021

To enlarge the set of base classes, we propose a method to automatically generate pseudo bounding-box annotations of diverse objects from large-scale image-caption pairs.

Paper
Code

RegionCLIP: Region-based Language-Image Pretraining

microsoft/regionclip • • CVPR 2022

However, we show that directly applying such models to recognize image regions for object detection leads to poor performance due to a domain shift: CLIP was trained to match an image as a whole to a text description, without capturing the fine-grained alignment between image regions and text spans.

Paper
Code

Detecting Twenty-thousand Classes using Image-level Supervision

facebookresearch/Detic • • 7 Jan 2022

For the first time, we train a detector with all the twenty-one-thousand classes of the ImageNet dataset and show that it generalizes to new datasets without finetuning.

Paper
Code

Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation

mengqidyangge/hierkd • • CVPR 2022

Open-vocabulary object detection aims to detect novel object categories beyond the training set.

Paper
Code

Open-Vocabulary DETR with Conditional Matching

yuhangzang/ov-detr • • 22 Mar 2022

To this end, we propose a novel open-vocabulary detector based on DETR -- hence the name OV-DETR -- which, once trained, can detect any object given its class name or an exemplar image.

Paper
Code

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

dyabel/detpro • • CVPR 2022

In this paper, we introduce a novel method, detection prompt (DetPro), to learn continuous prompt representations for open-vocabulary object detection based on the pre-trained vision-language model.

Paper
Code

Localized Vision-Language Matching for Open-vocabulary Object Detection

lmb-freiburg/locov • • 12 May 2022

In this work, we propose an open-vocabulary object detection method that, based on image-caption pairs, learns to detect novel object classes along with a given set of known classes.

Paper
Code

GLIPv2: Unifying Localization and Vision-Language Understanding

microsoft/GLIP • • 12 Jun 2022

We present GLIPv2, a grounded VL understanding model, that serves both localization tasks (e. g., object detection, instance segmentation) and Vision-Language (VL) understanding tasks (e. g., VQA, image captioning).

Paper
Code

Open Vocabulary Object Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result