TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular 3D Object Detection	KITTI Cars Hard	CubifAE-3D	AP Hard	6.42	# 6
Monocular 3D Object Detection	KITTI Cars Moderate	CubifAE-3D	AP Medium	7.94	# 25
Monocular 3D Object Detection	KITTI Pedestrian Hard	CubifAE-3D	AP Hard	4.82	# 4
Monocular 3D Object Detection	KITTI Pedestrians Moderate val	CubifAE-3D	AP Medium	5.43	# 1
Monocular 3D Object Detection	Virtual KITTI 2	CubifAE-3D	mAP@0.3	86.6	# 1
Monocular 3D Object Detection	Virtual KITTI 2	CubifAE-3D	mAP@0.5	66.7	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cubifae-3d-monocular-camera-space/monocular-3d-object-detection-on-kitti)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti?p=cubifae-3d-monocular-camera-space)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cubifae-3d-monocular-camera-space/monocular-3d-object-detection-on-virtual)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-virtual?p=cubifae-3d-monocular-camera-space)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cubifae-3d-monocular-camera-space/monocular-3d-object-detection-on-kitti-1)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-1?p=cubifae-3d-monocular-camera-space)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cubifae-3d-monocular-camera-space/monocular-3d-object-detection-on-kitti-cars-1)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars-1?p=cubifae-3d-monocular-camera-space)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cubifae-3d-monocular-camera-space/monocular-3d-object-detection-on-kitti-cars)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars?p=cubifae-3d-monocular-camera-space)`

CubifAE-3D: Monocular Camera Space Cubification for Auto-Encoder based 3D Object Detection

7 Jun 2020 · Shubham Shrivastava, Punarjay Chakravarty ·

We introduce a method for 3D object detection using a single monocular image. Starting from a synthetic dataset, we pre-train an RGB-to-Depth Auto-Encoder (AE). The embedding learnt from this AE is then used to train a 3D Object Detector (3DOD) CNN which is used to regress the parameters of 3D object poses after the encoder from the AE generates a latent embedding from the RGB image. We show that we can pre-train the AE using paired RGB and depth images from simulation data once and subsequently only train the 3DOD network using real data, comprising of RGB images and 3D object pose labels (without the requirement of dense depth). Our 3DOD network utilizes a particular `cubification' of 3D space around the camera, where each cuboid is tasked with predicting N object poses, along with their class and confidence values. The AE pre-training and this method of dividing the 3D space around the camera into cuboids give our method its name - CubifAE-3D. We demonstrate results for monocular 3D object detection in the Autonomous Vehicle (AV) use-case with the Virtual KITTI 2 and the KITTI datasets.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

3D Object Detection

3D Object Detection From Monocular Images

Autonomous Vehicles

Monocular 3D Object Detection

Object

object-detection

Object Detection

Datasets

KITTI

nuScenes

Virtual KITTI

Virtual KITTI 2

Results from the Paper

Edit

Ranked #1 on Monocular 3D Object Detection on KITTI Pedestrians Moderate val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular 3D Object Detection	KITTI Cars Hard	CubifAE-3D	AP Hard	6.42	# 6	Compare
Monocular 3D Object Detection	KITTI Cars Moderate	CubifAE-3D	AP Medium	7.94	# 25	Compare
Monocular 3D Object Detection	KITTI Pedestrian Hard	CubifAE-3D	AP Hard	4.82	# 4	Compare
Monocular 3D Object Detection	KITTI Pedestrians Moderate val	CubifAE-3D	AP Medium	5.43	# 1	Compare
Monocular 3D Object Detection	Virtual KITTI 2	CubifAE-3D	mAP@0.3	86.6	# 1	Compare
Monocular 3D Object Detection	Virtual KITTI 2	CubifAE-3D	mAP@0.5	66.7	# 1	Compare

Edit Social Preview

CubifAE-3D: Monocular Camera Space Cubification for Auto-Encoder based 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove