Computer Vision

Object Discovery

74 papers with code • 0 benchmarks • 2 datasets

Object Discovery is the task of identifying previously unseen objects.

Source: Unsupervised Object Discovery and Segmentation of RGBD-images

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Discovery

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Datasets

Most implemented papers

Most implemented Social Latest No code

Object-Centric Learning with Slot Attention

google-research/google-research • • NeurIPS 2020

Learning object-centric representations of complex scenes is a promising step towards enabling efficient abstract reasoning from low-level perceptual features.

Paper
Code

MONet: Unsupervised Scene Decomposition and Representation

deepmind/multi_object_datasets • • 22 Jan 2019

The ability to decompose scenes in terms of abstract building blocks is crucial for general intelligence.

Paper
Code

GuessWhat?! Visual object discovery through multi-modal dialogue

zhanyang-nwpu/rsvg-pytorch • • CVPR 2017

Our key contribution is the collection of a large-scale dataset consisting of 150K human-played games with a total of 800K visual question-answer pairs on 66K images.

Paper
Code

Learn To Pay Attention

SaoYan/LearnToPayAttention • • ICLR 2018

We propose an end-to-end-trainable attention module for convolutional neural network (CNN) architectures built for image classification.

Paper
Code

Learning Open-World Object Proposals without Learning to Classify

mcahny/object_localization_network • • 15 Aug 2021

In this paper, we identify that the problem is that the binary classifiers in existing proposal methods tend to overfit to the training categories.

Paper
Code

Vision Transformers Need Registers

facebookresearch/dinov2 • • 28 Sep 2023

Transformers have recently emerged as a powerful tool for learning visual representations.

Paper
Code

Efficient Dialog Policy Learning via Positive Memory Retention

ruizhaogit/MNIST-GuessNumber • 2 Oct 2018

This paper is concerned with the training of recurrent neural networks as goal-oriented dialog agents using reinforcement learning.

Paper
Code

GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

applied-ai-lab/genesis • • ICLR 2020

Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning.

Paper
Code

Localizing Objects with Self-Supervised Transformers and no Labels

valeoai/LOST • • 29 Sep 2021

We also show that training a class-agnostic detector on the discovered objects boosts results by another 7 points.

Paper
Code

Unsupervised Image Decomposition with Phase-Correlation Networks

angelvillar96/Unsupervised-Decomposition-PCDNet-1 • • 7 Oct 2021

The ability to decompose scenes into their object components is a desired property for autonomous agents, allowing them to reason and act in their surroundings.

Paper
Code

Object Discovery

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result