2D Object Detection

84 papers with code • 14 benchmarks • 59 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in 2D Object Detection

Dataset	Best Model	Compare
SARDet-100K	MSFA (F-RCNN+ConvNext-T)	See all
CeyMo	TransMind	See all
Clear Weather	HRFuser-T	See all
Dense Fog	HRFuser-T	See all
BDD100K val	InternImage-H	See all
ExDark	MAET	See all
RF100	GLIP	See all
TRR360D	rotated-retinanet-rbox-r360_r50_fpn_6x.py	See all
FishEye8K	Yolov8x (640x640)	See all
VisDrone	CZ-Det	See all
RADIATE	TempoRadar	See all
CLCXray	LACLS	See all
RadioGalaxyNET Dataset	Gal-DINO	See all
ETDII Dataset	SCAResNet	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 2D Object Detection models and implementations

PaddlePaddle/PaddleDetection

12 papers

12,085

open-mmlab/mmdetection

9 papers

27,845

alibaba/EasyCV

4 papers

1,685

AlexeyAB/darknet

3 papers

21,460

See all 19 libraries.

Datasets

Subtasks

Open Vocabulary Object Detection

Semi-Supervised Object Detection

Novel Object Detection

Long-tailed Object Detection

Hand Detection

Vessel Detection

medical image detection

Drivable Area Detection

2D Cyclist Detection

SAR Ship Detection

2D Tiny Object Detection

Intention-oriented Object Detection

Most implemented papers

Most implemented Social Latest No code

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection

wyf-accept/backtoreality • • ICCV 2021

In particular, we propose to generate random layouts of a scene by making use of the objects in the synthetic CAD dataset and learn the 3D scene representation by applying object-level contrastive learning on two random scenes generated from the same set of synthetic objects.

Paper
Code

Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDAR

anshulpaigwar/Frustum-Pointpillars • • 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 2021

We train our network on the KITTI dataset and perform experiments to show the effectiveness of our network.

Paper
Code

Grounded Language-Image Pre-training

microsoft/GLIP • • CVPR 2022

The unification brings two benefits: 1) it allows GLIP to learn from both detection and grounding data to improve both tasks and bootstrap a good grounding model; 2) GLIP can leverage massive image-text pairs by generating grounding boxes in a self-training fashion, making the learned representation semantic-rich.

Paper
Code

Detecting Overlapping Objects in X-ray Security Imagery by a Label-aware Mechanism

greysonphoenix/clcxray • IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2022

One of the key challenges to the X-ray security check is to detect the overlapped items in backpacks or suitcases in the X-ray images.

Paper
Code

Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

cuiziteng/maet • • ICCV 2021

To enhance object detection in a dark environment, we propose a novel multitask auto encoding transformation (MAET) model which is able to explore the intrinsic pattern behind illumination translation.

Paper
Code

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

opengvlab/internimage • • CVPR 2023

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, large-scale models based on convolutional neural networks (CNNs) are still in an early state.

Paper
Code

Object Detection with Transformers: A Review

mindgarage-shan/transformer_object_detection_survey • 7 Jun 2023

The astounding performance of transformers in natural language processing (NLP) has motivated researchers to explore their applications in computer vision tasks.

Paper
Code

OpenAgents: An Open Platform for Language Agents in the Wild

xlang-ai/openagents • 16 Oct 2023

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

Paper
Code