Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

Going Deeper with Convolutions

tensorflow/models CVPR 2015

We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014).

Feature Pyramid Grids

open-mmlab/mmdetection 7 Apr 2020

Feature pyramid networks have been widely adopted in the object detection literature to improve feature representations for better handling of variations in scale.

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

open-mmlab/mmdetection 25 Apr 2019

In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation.

Densely Connected Convolutional Networks

pytorch/vision CVPR 2017

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output.

Distinctive Image Features from Scale-Invariant Keypoints

kornia/kornia 5 Jan 2004

This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene.

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

jetpacapp/DeepBeliefSDK 6 Oct 2013

We evaluate whether features extracted from the activation of a deep convolutional network trained in a fully supervised fashion on a large, fixed set of object recognition tasks can be re-purposed to novel generic tasks.

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

soumith/convnet-benchmarks 21 Dec 2013

This integrated framework is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013) and obtained very competitive results for the detection and classifications tasks.

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

xiaofengShi/CHINESE-OCR 26 Jan 2016

The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images.

Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification

microsoft/unilm 11 Apr 2017

We present an exhaustive investigation of recent Deep Learning architectures, algorithms, and strategies for the task of document image classification to finally reduce the error by more than half.

