Monocular 3D Object Detection

77 papers with code • 15 benchmarks • 5 datasets

Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple-images.

Benchmarks

Add a Result

These leaderboards are used to track progress in Monocular 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	CIE	See all
SUN RGB-D	IM3D	See all
KITTI Cars Hard	CIE	See all
KITTI Pedestrian Hard	DD3D	See all
KITTI Cars Easy	CIE	See all
KITTI Pedestrian Easy	CMKD	See all
KITTI Pedestrian Moderate	CMKD	See all
Google Objectron	Lin2021	See all
KITTI Cyclist Easy	CMKD	See all
KITTI Cyclist Moderate	CMKD	See all
KITTI Cyclist Hard	CMKD	See all
KITTI Pedestrians Moderate val	CubifAE-3D	See all
Virtual KITTI 2	CubifAE-3D	See all
CoPerception-UAVs	Where2comm	See all
OPV2V	Where2comm	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Monocular 3D Object Detection models and implementations

open-mmlab/mmdetection3d

3 papers

4,834

PaddlePaddle/Paddle3D

2 papers

541

Owen-Liuyuxuan/visualDet3D

2 papers

358

Datasets

Latest papers with no code

Most implemented Social Latest No code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code yet • 30 Sep 2023

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Paper
Add Code

Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection

no code yet • 13 Sep 2023

Monocular 3D object detection is a challenging task because depth information is difficult to obtain from 2D images.

Paper
Add Code

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

no code yet • 2 Sep 2023

These methods typically use visual and depth representations to generate query points on objects, whose quality plays a decisive role in the detection accuracy.

Paper
Add Code

MonoNext: A 3D Monocular Object Detection with ConvNext

no code yet • 1 Aug 2023

Autonomous driving perception tasks rely heavily on cameras as the primary sensor for Object Detection, Semantic Segmentation, Instance Segmentation, and Object Tracking.

Paper
Add Code

Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning

no code yet • 17 Jul 2023

We propose a novel semi-supervised active learning (SSAL) framework for monocular 3D object detection with LiDAR guidance (MonoLiG), which leverages all modalities of collected data during model development.

Paper
Add Code

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection

no code yet • 19 Jun 2023

In this paper, we propose a framework named the Adaptive Distance Interval Separation Network (ADISN) that adopts a novel perspective on understanding depth maps, as a form that lies between LiDAR and images.

Paper
Add Code

Weakly Supervised 3D Object Detection with Multi-Stage Generalization

no code yet • 8 Jun 2023

We devise the DoubleClustering algorithm to obtain object clusters from reconstructed scene-level points, and further enhance the model's detection capabilities by developing three stages of generalization: progressing from complete to partial, static to dynamic, and close to distant.

Paper
Add Code

Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction Sites

no code yet • 29 May 2023

This study preliminarily reveals the potential and feasibility of proximity monitoring using only a 2D camera, providing a new promising and economical way for early warning of human-machine collisions.

Paper
Add Code

MonoTDP: Twin Depth Perception for Monocular 3D Object Detection in Adverse Scenes

no code yet • 18 May 2023

3D object detection plays a crucial role in numerous intelligent vision systems.

Paper
Add Code

Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver

no code yet • ICCV 2023

Motivated by a new and strong observation that this challenge can be remedied by a 3D-space local-grid search scheme in an ideal case, we propose a stage-wise approach, which combines the information flow from 2D-to-3D (3D bounding box proposal generation with a single 2D image) and 3D-to-2D (proposal verification by denoising with 3D-to-2D contexts) in a top-down manner.

Paper
Add Code

Monocular 3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result