TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	DAIR-V2X-I	ImVoxelNet	AP\|R40(moderate)	37.6	# 9
3D Object Detection	DAIR-V2X-I	ImVoxelNet	AP\|R40(easy)	44.8	# 9
3D Object Detection	DAIR-V2X-I	ImVoxelNet	AP\|R40(hard)	37.6	# 9
3D Object Detection	ScanNetV2	ImVoxelNet (RGB only)	mAP@0.25	48.1	# 25
3D Object Detection	ScanNetV2	ImVoxelNet (RGB only)	mAP@0.5	22.7	# 25
Monocular 3D Object Detection	SUN RGB-D	ImVoxelNet	AP@0.15 (10 / NYU-37)	42.69	# 2
Monocular 3D Object Detection	SUN RGB-D	ImVoxelNet	AP@0.15 (NYU-37)	21.08	# 2
Monocular 3D Object Detection	SUN RGB-D	ImVoxelNet	AP@0.15 (10 / PNet-30)	48.74	# 1
Room Layout Estimation	SUN RGB-D	ImVoxelNet	IoU	59.3	# 2
Room Layout Estimation	SUN RGB-D	ImVoxelNet	Camera Pitch	2.63	# 1
Room Layout Estimation	SUN RGB-D	ImVoxelNet	Camera Roll	1.96	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imvoxelnet-image-to-voxels-projection-for/monocular-3d-object-detection-on-sun-rgb-d)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-sun-rgb-d?p=imvoxelnet-image-to-voxels-projection-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imvoxelnet-image-to-voxels-projection-for/room-layout-estimation-on-sun-rgb-d)](https://paperswithcode.com/sota/room-layout-estimation-on-sun-rgb-d?p=imvoxelnet-image-to-voxels-projection-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imvoxelnet-image-to-voxels-projection-for/3d-object-detection-on-dair-v2x-i)](https://paperswithcode.com/sota/3d-object-detection-on-dair-v2x-i?p=imvoxelnet-image-to-voxels-projection-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imvoxelnet-image-to-voxels-projection-for/3d-object-detection-on-scannetv2)](https://paperswithcode.com/sota/3d-object-detection-on-scannetv2?p=imvoxelnet-image-to-voxels-projection-for)`

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

2 Jun 2021 · Danila Rukhovich, Anna Vorontsova, Anton Konushin ·

In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxelNet, a novel fully convolutional method of 3D object detection based on monocular or multi-view RGB images. The number of monocular images in each multi-view input can variate during training and inference; actually, this number might be unique for each multi-view input. ImVoxelNet successfully handles both indoor and outdoor scenes, which makes it general-purpose. Specifically, it achieves state-of-the-art results in car detection on KITTI (monocular) and nuScenes (multi-view) benchmarks among all methods that accept RGB images. Moreover, it surpasses existing RGB-based 3D object detection methods on the SUN RGB-D dataset. On ScanNet, ImVoxelNet sets a new benchmark for multi-view 3D object detection. The source code and the trained models are available at https://github.com/saic-vul/imvoxelnet.

PDF Abstract

Code

Add Remove Mark official

saic-vul/imvoxelnet official

260

open-mmlab/mmdetection3d

4,785

chetanmreddy/voxelnet_chetan

Tasks

Add Remove

3D Object Detection

Monocular 3D Object Detection

Monocular 3D Object Detection (10 / NYU-37)

Monocular 3D Object Detection (10 / PNet-30)

Object

object-detection

Object Detection

Room Layout Estimation

Datasets

KITTI

nuScenes

ScanNet

SUN RGB-D DAIR-V2X

Results from the Paper

Edit

Ranked #2 on Monocular 3D Object Detection on SUN RGB-D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	DAIR-V2X-I	ImVoxelNet	AP\|R40(moderate)	37.6	# 9	Compare
			AP\|R40(easy)	44.8	# 9	Compare
			AP\|R40(hard)	37.6	# 9	Compare
3D Object Detection	ScanNetV2	ImVoxelNet (RGB only)	mAP@0.25	48.1	# 25	Compare
3D Object Detection	ScanNetV2	ImVoxelNet (RGB only)	mAP@0.5	22.7	# 25	Compare
Monocular 3D Object Detection	SUN RGB-D	ImVoxelNet	AP@0.15 (10 / NYU-37)	42.69	# 2	Compare
			AP@0.15 (NYU-37)	21.08	# 2	Compare
			AP@0.15 (10 / PNet-30)	48.74	# 1	Compare
Room Layout Estimation	SUN RGB-D	ImVoxelNet	IoU	59.3	# 2	Compare
			Camera Pitch	2.63	# 1	Compare
			Camera Roll	1.96	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove