TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	nuScenes	BEVDet4D	NDS	0.569	# 227
3D Object Detection	nuScenes	BEVDet4D	mAP	0.451	# 234
3D Object Detection	nuScenes	BEVDet4D	mATE	0.511	# 140
3D Object Detection	nuScenes	BEVDet4D	mASE	0.241	# 212
3D Object Detection	nuScenes	BEVDet4D	mAOE	0.386	# 166
3D Object Detection	nuScenes	BEVDet4D	mAVE	0.301	# 206
3D Object Detection	nuScenes	BEVDet4D	mAAE	0.121	# 268
3D Object Detection	nuScenes Camera Only	BEVDet4D	NDS	56.9	# 17
3D Object Detection	nuScenes Camera Only	BEVDet4D	Future Frame	false	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bevdet4d-exploit-temporal-cues-in-multi/3d-object-detection-on-nuscenes-camera-only)](https://paperswithcode.com/sota/3d-object-detection-on-nuscenes-camera-only?p=bevdet4d-exploit-temporal-cues-in-multi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bevdet4d-exploit-temporal-cues-in-multi/3d-object-detection-on-nuscenes)](https://paperswithcode.com/sota/3d-object-detection-on-nuscenes?p=bevdet4d-exploit-temporal-cues-in-multi)`

BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection

31 Mar 2022 · JunJie Huang, Guan Huang ·

Single frame data contains finite information which limits the performance of the existing vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the performance boundary in this area, a novel paradigm dubbed BEVDet4D is proposed to lift the scalable BEVDet paradigm from the spatial-only 3D space to the spatial-temporal 4D space. We upgrade the naive BEVDet framework with a few modifications just for fusing the feature from the previous frame with the corresponding one in the current frame. In this way, with negligible additional computing budget, we enable BEVDet4D to access the temporal cues by querying and comparing the two candidate features. Beyond this, we simplify the task of velocity prediction by removing the factors of ego-motion and time in the learning target. As a result, BEVDet4D with robust generalization performance reduces the velocity error by up to -62.9%. This makes the vision-based methods, for the first time, become comparable with those relied on LiDAR or radar in this aspect. On challenge benchmark nuScenes, we report a new record of 54.5% NDS with the high-performance configuration dubbed BEVDet4D-Base, which surpasses the previous leading method BEVDet-Base by +7.3% NDS. The source code is publicly available for further research at https://github.com/HuangJunJie2017/BEVDet .

PDF Abstract

Code

Add Remove Mark official

HuangJunJie2017/BEVDet official

1,261

Tasks

Add Remove

3D Object Detection

object-detection

Object Detection

Datasets

nuScenes

Results from the Paper

Edit

Ranked #17 on 3D Object Detection on nuScenes Camera Only

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	nuScenes	BEVDet4D	NDS	0.569	# 227	Compare
			mAP	0.451	# 234	Compare
			mATE	0.511	# 140	Compare
			mASE	0.241	# 212	Compare
			mAOE	0.386	# 166	Compare
			mAVE	0.301	# 206	Compare
			mAAE	0.121	# 268	Compare
3D Object Detection	nuScenes Camera Only	BEVDet4D	NDS	56.9	# 17	Compare
3D Object Detection	nuScenes Camera Only	BEVDet4D	Future Frame	false	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove