Real-Time Object Detection

107 papers with code • 7 benchmarks • 8 datasets

Real-Time Object Detection is a computer vision task that involves identifying and locating objects of interest in real-time video sequences with fast inference while maintaining a base level of accuracy.

This is typically solved using algorithms that combine object detection and tracking techniques to accurately detect and track objects in real-time. They use a combination of feature extraction, object proposal generation, and classification to detect and localize objects of interest.

( Image credit: CenterNet )

Benchmarks

Add a Result

These leaderboards are used to track progress in Real-Time Object Detection

Dataset	Best Model	Compare
MS COCO	YOLOv6-L6(1280)	See all
PASCAL VOC 2007	YOLO	See all
Argoverse-HD (Full-Stack, Val)	RL-Based Dynamic-Online Policy	See all
Argoverse-HD (Detection-Only, Val)	YOLOX	See all
Argoverse-HD (Detection-Only, Test)	YOLOX	See all
Argoverse-HD (Full-Stack, Test)	YOLOX	See all
COCO 2017 val	DINO-5scale (12 epochs, 900 queries)	See all

Libraries

Use these libraries to find Real-Time Object Detection models and implementations

PaddlePaddle/PaddleDetection

21 papers

12,008

open-mmlab/mmdetection

8 papers

27,643

AlexeyAB/darknet

8 papers

21,422

osmr/imgclsmob

8 papers

2,916

See all 36 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

EfficientDet: Scalable and Efficient Object Detection

google/automl • • CVPR 2020

Model efficiency has become increasingly important in computer vision.

Paper
Code

R-FCN: Object Detection via Region-based Fully Convolutional Networks

daijifeng001/r-fcn • • NeurIPS 2016

In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our region-based detector is fully convolutional with almost all computation shared on the entire image.

Paper
Code

A ConvNet for the 2020s

facebookresearch/ConvNeXt • • CVPR 2022

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.

Paper
Code

Scaled-YOLOv4: Scaling Cross Stage Partial Network

WongKinYiu/ScaledYOLOv4 • • CVPR 2021

We show that the YOLOv4 object detection neural network based on the CSP approach, scales both up and down and is applicable to small and large networks while maintaining optimal speed and accuracy.

Paper
Code

YOLOX: Exceeding YOLO Series in 2021

Megvii-BaseDetection/YOLOX • • 18 Jul 2021

In this report, we present some experienced improvements to YOLO series, forming a new high-performance detector -- YOLOX.

Paper
Code

End-to-End Object Detection with Transformers

facebookresearch/detr • • ECCV 2020

We present a new method that views object detection as a direct set prediction problem.

Paper
Code

MnasNet: Platform-Aware Neural Architecture Search for Mobile

tensorflow/tpu • • CVPR 2019

In this paper, we propose an automated mobile neural architecture search (MNAS) approach, which explicitly incorporate model latency into the main objective so that the search can identify a model that achieves a good trade-off between accuracy and latency.

Paper
Code

HarDNet: A Low Memory Traffic Network

PingoLH/Pytorch-HarDNet • • ICCV 2019

We propose a Harmonic Densely Connected Network to achieve high efficiency in terms of both low MACs and memory traffic.

Paper
Code

Deformable DETR: Deformable Transformers for End-to-End Object Detection

fundamentalvision/Deformable-DETR • • ICLR 2021

DETR has been recently proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance.

Paper
Code

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

wongkinyiu/yolov7 • • CVPR 2023

YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56. 8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100.

Paper
Code

Real-Time Object Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result