Monocular 3D Object Detection

77 papers with code • 15 benchmarks • 5 datasets

Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple-images.

Libraries

Use these libraries to find Monocular 3D Object Detection models and implementations

Latest papers with no code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code yet • 30 Sep 2023

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection

no code yet • 13 Sep 2023

Monocular 3D object detection is a challenging task because depth information is difficult to obtain from 2D images.

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

no code yet • 2 Sep 2023

These methods typically use visual and depth representations to generate query points on objects, whose quality plays a decisive role in the detection accuracy.

MonoNext: A 3D Monocular Object Detection with ConvNext

no code yet • 1 Aug 2023

Autonomous driving perception tasks rely heavily on cameras as the primary sensor for Object Detection, Semantic Segmentation, Instance Segmentation, and Object Tracking.

Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning

no code yet • 17 Jul 2023

We propose a novel semi-supervised active learning (SSAL) framework for monocular 3D object detection with LiDAR guidance (MonoLiG), which leverages all modalities of collected data during model development.

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection

no code yet • 19 Jun 2023

In this paper, we propose a framework named the Adaptive Distance Interval Separation Network (ADISN) that adopts a novel perspective on understanding depth maps, as a form that lies between LiDAR and images.

Weakly Supervised 3D Object Detection with Multi-Stage Generalization

no code yet • 8 Jun 2023

We devise the DoubleClustering algorithm to obtain object clusters from reconstructed scene-level points, and further enhance the model's detection capabilities by developing three stages of generalization: progressing from complete to partial, static to dynamic, and close to distant.

Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction Sites

no code yet • 29 May 2023

This study preliminarily reveals the potential and feasibility of proximity monitoring using only a 2D camera, providing a new promising and economical way for early warning of human-machine collisions.

MonoTDP: Twin Depth Perception for Monocular 3D Object Detection in Adverse Scenes

no code yet • 18 May 2023

3D object detection plays a crucial role in numerous intelligent vision systems.

Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver

no code yet • ICCV 2023

Motivated by a new and strong observation that this challenge can be remedied by a 3D-space local-grid search scheme in an ideal case, we propose a stage-wise approach, which combines the information flow from 2D-to-3D (3D bounding box proposal generation with a single 2D image) and 3D-to-2D (proposal verification by denoising with 3D-to-2D contexts) in a top-down manner.