Monocular 3D Object Detection

76 papers with code • 15 benchmarks • 5 datasets

Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple-images.

Benchmarks

Add a Result

These leaderboards are used to track progress in Monocular 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	CIE	See all
SUN RGB-D	IM3D	See all
KITTI Cars Hard	CIE	See all
KITTI Pedestrian Hard	DD3D	See all
KITTI Cars Easy	CIE	See all
KITTI Pedestrian Easy	CMKD	See all
KITTI Pedestrian Moderate	CMKD	See all
Google Objectron	Lin2021	See all
KITTI Cyclist Easy	CMKD	See all
KITTI Cyclist Moderate	CMKD	See all
KITTI Cyclist Hard	CMKD	See all
KITTI Pedestrians Moderate val	CubifAE-3D	See all
Virtual KITTI 2	CubifAE-3D	See all
CoPerception-UAVs	Where2comm	See all
OPV2V	Where2comm	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Monocular 3D Object Detection models and implementations

open-mmlab/mmdetection3d

3 papers

4,808

PaddlePaddle/Paddle3D

2 papers

534

Owen-Liuyuxuan/visualDet3D

2 papers

358

Datasets

Latest papers with no code

Most implemented Social Latest No code

VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection

no code yet • 15 Apr 2024

Therefore, an effective solution involves transforming monocular images into LiDAR-like representations and employing a LiDAR-based 3D object detector to predict the 3D coordinates of objects.

Paper
Add Code

MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues

no code yet • 8 Apr 2024

3D object detection based on roadside cameras is an additional way for autonomous driving to alleviate the challenges of occlusion and short perception range from vehicle cameras.

Paper
Add Code

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection

no code yet • 7 Apr 2024

Subsequently, we introduce the cross-modal residual distillation to transfer the 3D spatial cues.

Paper
Add Code

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

no code yet • 26 Mar 2024

Additionally, we present a DepthGradient Projection (DGP) module to mitigate optimization conflicts caused by noisy depth supervision of pseudo-labels, effectively decoupling the depth gradient and removing conflicting gradients.

Paper
Add Code

Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator

no code yet • 6 Mar 2024

Real-time processing is crucial in autonomous driving systems due to the imperative of instantaneous decision-making and rapid response.

Paper
Add Code

UniMODE: Unified Monocular 3D Object Detection

no code yet • 28 Feb 2024

To address these challenges, we build a detector based on the bird's-eye-view (BEV) detection paradigm, where the explicit feature projection is beneficial to addressing the geometry learning ambiguity when employing multiple scenarios of data to train detectors.

Paper
Add Code

You Only Look Bottom-Up for Monocular 3D Object Detection

no code yet • 27 Jan 2024

Monocular 3D Object Detection is an essential task for autonomous driving.

Paper
Add Code

Depth-discriminative Metric Learning for Monocular 3D Object Detection

no code yet • NeurIPS 2023

Moreover, we introduce an auxiliary head for object-wise depth estimation, which enhances depth quality while maintaining the inference time.

Paper
Add Code

Rotation Matters: Generalized Monocular 3D Object Detection for Various Camera Systems

no code yet • 9 Oct 2023

In this paper, we conduct extensive experiments to analyze the factors that cause performance degradation.

Paper
Add Code

Every Dataset Counts: Scaling up Monocular 3D Object Detection with Joint Datasets Training

no code yet • 2 Oct 2023

Monocular 3D object detection plays a crucial role in autonomous driving.

Paper
Add Code

Monocular 3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result