Browse > Computer Vision > Object Detection

Object Detection

234 papers with code · Computer Vision

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos.

State-of-the-art leaderboards

Greatest papers with code

Pooling Pyramid Network for Object Detection

9 Jul 2018tensorflow/models

We'd like to share a simple tweak of Single Shot Multibox Detector (SSD) family of detectors, which is effective in reducing model size while maintaining the same quality. We share box predictors across all scales, and replace convolution between scales with max pooling.

OBJECT DETECTION

MobileNetV2: Inverted Residuals and Linear Bottlenecks

CVPR 2018 tensorflow/models

In this paper we describe a new mobile architecture, MobileNetV2, that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes. We also describe efficient ways of applying these mobile models to object detection in a novel framework we call SSDLite.

OBJECT DETECTION SEMANTIC SEGMENTATION

Focal Loss for Dense Object Detection

ICCV 2017 tensorflow/models

We propose to address this class imbalance by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples. Our novel Focal Loss focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training.

#13 best model for Object Detection on COCO

OBJECT DETECTION

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

17 Apr 2017tensorflow/models

We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks.

IMAGE CLASSIFICATION OBJECT DETECTION

Mask R-CNN

ICCV 2017 tensorflow/models

Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. We show top results in all three tracks of the COCO suite of challenges, including instance segmentation, bounding-box object detection, and person keypoint detection.

HUMAN PART SEGMENTATION INSTANCE SEGMENTATION KEYPOINT DETECTION MULTI-HUMAN PARSING OBJECT DETECTION SEMANTIC SEGMENTATION

Speed/accuracy trade-offs for modern convolutional object detectors

CVPR 2017 tensorflow/models

The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. On the opposite end in which accuracy is critical, we present a detector that achieves state-of-the-art performance measured on the COCO detection task.

#19 best model for Object Detection on COCO

OBJECT DETECTION

Deep Residual Learning for Image Recognition

CVPR 2016 tensorflow/models

We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

IMAGE CLASSIFICATION OBJECT DETECTION

Going Deeper with Convolutions

CVPR 2015 tensorflow/models

We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014). The main hallmark of this architecture is the improved utilization of the computing resources inside the network.

IMAGE CLASSIFICATION OBJECT DETECTION OBJECT RECOGNITION

Group Normalization

ECCV 2018 facebookresearch/detectron

GN's computation is independent of batch sizes, and its accuracy is stable in a wide range of batch sizes. GN can outperform its BN-based counterparts for object detection and segmentation in COCO, and for video classification in Kinetics, showing that GN can effectively replace the powerful BN in a variety of tasks.

OBJECT DETECTION VIDEO CLASSIFICATION

Data Distillation: Towards Omni-Supervised Learning

CVPR 2018 facebookresearch/detectron

We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data. Omni-supervised learning is lower-bounded by performance on existing labeled datasets, offering the potential to surpass state-of-the-art fully supervised methods.

KEYPOINT DETECTION OBJECT DETECTION