FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

8 Aug 2023  ·  Yilun Chen, Zhiding Yu, Yukang Chen, Shiyi Lan, Animashree Anandkumar, Jiaya Jia, Jose Alvarez ·

False negatives (FN) in 3D object detection, {\em e.g.}, missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies \textit{FN} in a multi-stage manner and guides the models to focus on excavating difficult instances. For 3D object detection, we instantiate this method as FocalFormer3D, a simple yet effective detector that excels at excavating difficult objects and improving prediction recall. FocalFormer3D features a multi-stage query generation to discover hard objects and a box-level transformer decoder to efficiently distinguish objects from massive object candidates. Experimental results on the nuScenes and Waymo datasets validate the superior performance of FocalFormer3D. The advantage leads to strong performance on both detection and tracking, in both LiDAR and multi-modal settings. Notably, FocalFormer3D achieves a 70.5 mAP and 73.9 NDS on nuScenes detection benchmark, while the nuScenes tracking benchmark shows 72.1 AMOTA, both ranking 1st place on the nuScenes LiDAR leaderboard. Our code is available at \url{https://github.com/NVlabs/FocalFormer3D}.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Object Detection nuScenes FocalFormer3D-F NDS 0.75 # 8
mAP 0.72 # 11
mATE 0.25 # 304
mASE 0.24 # 213
mAOE 0.33 # 298
mAVE 0.23 # 319
mAAE 0.13 # 103
3D Object Detection nuScenes FocalFormer3D-TTA NDS 0.74 # 15
mAP 0.71 # 18
mATE 0.24 # 335
mASE 0.24 # 213
mAOE 0.32 # 312
mAVE 0.20 # 352
mAAE 0.13 # 103
3D Object Detection nuScenes FocalFormer3D-L NDS 0.73 # 25
mAP 0.69 # 41
mATE 0.25 # 304
mASE 0.24 # 213
mAOE 0.34 # 275
mAVE 0.22 # 330
mAAE 0.13 # 103

Methods


No methods listed for this paper. Add relevant methods here