Browse SoTA > Computer Vision > Object Detection

Object Detection

1005 papers with code · Computer Vision

Object detection is the task of detecting instances of objects of a certain class within an image. The state-of-the-art methods can be categorized into two main types: one-stage methods and two stage-methods. One-stage methods prioritize inference speed, and example models include YOLO, SSD and RetinaNet. Two-stage methods prioritize detection accuracy, and example models include Faster R-CNN, Mask R-CNN and Cascade R-CNN.

The most popular benchmark is the MSCOCO dataset. Models are typically evaluated according to a Mean Average Precision metric.

( Image credit: Detectron )

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Latest papers with code

LambdaNetworks: Modeling long-range Interactions without Attention

ICLR 2021 lucidrains/lambda-networks

We present a general framework for capturing long-range interactions between an input and structured contextual information (e. g. a pixel surrounded by other pixels).

IMAGE CLASSIFICATION INSTANCE SEGMENTATION OBJECT DETECTION SCENE SEGMENTATION

1,165
01 Jan 2021

Rethinking Learnable Tree Filter for Generic Feature Transform

NeurIPS 2020 StevenGrove/LearnableTreeFilterV2

The Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation.

INSTANCE SEGMENTATION OBJECT DETECTION SEMANTIC SEGMENTATION

22
01 Dec 2020

Is normalization indispensable for training deep neural network?

NeurIPS 2020 hukkai/rescaling

In this paper, we study what would happen when normalization layers are removed from the network, and show how to train deep neural networks without normalization layers and without performance degradation.

IMAGE CLASSIFICATION MACHINE TRANSLATION OBJECT DETECTION VIDEO CLASSIFICATION

13
01 Dec 2020

Auto Learning Attention

NeurIPS 2020 btma48/AutoLA

Attention modules have been demonstrated effective in strengthening the representation ability of a neural network via reweighting spatial or channel features or stacking both operations sequentially.

IMAGE CLASSIFICATION KEYPOINT DETECTION OBJECT DETECTION

4
01 Dec 2020

Few-Cost Salient Object Detection with Adversarial-Paced Learning

NeurIPS 2020 hb-stone/FC-SOD

To address this problem, this paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only, thus dramatically alleviating human labor in training models.

OBJECT DETECTION SALIENCY DETECTION SALIENT OBJECT DETECTION

3
01 Dec 2020

Make One-Shot Video Object Segmentation Efficient Again

NeurIPS 2020 dvl-tum/e-osvos

However, recently the VOS community has deemed such a test time optimization and its impact on the test runtime as unfeasible.

OBJECT DETECTION SEMANTIC SEGMENTATION VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION YOUTUBE-VOS

3
01 Dec 2020

Real-time gun detection in CCTV: An open problem

1 Dec 2020jossalgon/US-Real-time-gun-detection-in-CCTV-An-open-problem-dataset

This article presents a new dataset obtained from a real CCTV installed in a university and the generation of synthetic images, to which Faster R-CNN was applied using Feature Pyramid Network with ResNet-50 resulting in a weapon detection model able to be used in quasi-real-time CCTV (90 ms of inference time with an NVIDIA GeForce GTX-1080Ti card) improving the state of the art on weapon detection in a two stages training.

SMALL OBJECT DETECTION

1
01 Dec 2020

Monocular 3D Object Detection with Sequential Feature Association and Depth Hint Augmentation

30 Nov 2020gtzly/FADNet

In this work, a single-stage keypoint-based network, named as FADNet, is presented to address the task of monocular 3D object detection.

AUTONOMOUS DRIVING DEPTH ESTIMATION MONOCULAR 3D OBJECT DETECTION

1
30 Nov 2020

Move to See Better: Towards Self-Supervised Amodal Object Detection

30 Nov 2020ayushjain1144/SeeingByMoving

In this paper, we propose a self-supervised framework to improve an object detector in unseen scenarios by moving an agent around in a 3D environment and aggregating multi-view RGB-D information.

OBJECT DETECTION

0
30 Nov 2020
139
26 Nov 2020