3D Object Detection From Monocular Images

11 papers with code • 3 benchmarks • 3 datasets

This is the task of detecting 3D objects from monocular images (as opposed to LiDAR based counterparts). It is usually associated with autonomous driving based tasks.

( Image credit: Orthographic Feature Transform for Monocular 3D Object Detection )

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Object Detection From Monocular Images

Dataset	Best Model	Compare
KITTI-360	SeaBird + PanopticBEV	See all
Waymo Open Dataset	DEVIANT	See all
nuScenes Cars	MonoDIS	See all

Datasets

Most implemented papers

Most implemented Social Latest No code

Deep Hough Voting for 3D Object Detection in Point Clouds

facebookresearch/votenet • • ICCV 2019

Current 3D object detection methods are heavily influenced by 2D detectors.

Paper
Code

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

garrickbrazil/M3D-RPN • • ICCV 2019

Understanding the world in 3D is a critical component of urban autonomous driving.

Paper
Code

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

abhi1kumar/deviant • • 21 Jul 2022

As a result, DEVIANT is equivariant to the depth translations in the projective manifold whereas vanilla networks are not.

Paper
Code

Orthographic Feature Transform for Monocular 3D Object Detection

tom-roddick/oft • • 20 Nov 2018

This allows us to reason holistically about the spatial configuration of the scene in a domain where scale is consistent and distances between objects are meaningful.

Paper
Code

Delving into Localization Errors for Monocular 3D Object Detection

xinzhuma/monodle • • CVPR 2021

Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving, while accurate 3D object detection from this kind of data is very challenging.

Paper
Code

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

abhi1kumar/groomed_nms • • CVPR 2021

In this paper, we present and integrate GrooMeD-NMS -- a novel Grouped Mathematically Differentiable NMS for monocular 3D object detection, such that the network is trained end-to-end with a loss on the boxes after NMS.

Paper
Code

Geometry Uncertainty Projection Network for Monocular 3D Object Detection

supermhp/gupnet • • ICCV 2021

In this paper, we propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages.

Paper
Code