PointPillars: Fast Encoders for Object Detection from Point Clouds

Object detection in point clouds is an important aspect of many robotics applications such as autonomous driving. In this paper we consider the problem of encoding a point cloud into a format appropriate for a downstream detection pipeline. Recent literature suggests two types of encoders; fixed encoders tend to be fast but sacrifice accuracy, while encoders that are learned from data are more accurate, but slower. In this work we propose PointPillars, a novel encoder which utilizes PointNets to learn a representation of point clouds organized in vertical columns (pillars). While the encoded features can be used with any standard 2D convolutional detection architecture, we further propose a lean downstream network. Extensive experimentation shows that PointPillars outperforms previous encoders with respect to both speed and accuracy by a large margin. Despite only using lidar, our full detection pipeline significantly outperforms the state of the art, even among fusion methods, with respect to both the 3D and bird's eye view KITTI benchmarks. This detection performance is achieved while running at 62 Hz: a 2 - 4 fold runtime improvement. A faster version of our method matches the state of the art at 105 Hz. These benchmarks suggest that PointPillars is an appropriate encoding for object detection in point clouds.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Object Detection DAIR-V2X-I PointPillars AP|R40(moderate) 54.0 # 6
AP|R40(easy) 63.1 # 7
AP|R40(hard) 54.0 # 6
Robust 3D Object Detection KITTI-C PointPillars mean Corruption Error (mCE) 110.67% # 5
Robust 3D Object Detection KITTI-C SECOND mean Corruption Error (mCE) 95.93% # 3
Birds Eye View Object Detection KITTI Cars Easy PointPillars AP 88.35% # 8
3D Object Detection KITTI Cars Easy PointPillars AP 79.05% # 23
Birds Eye View Object Detection KITTI Cars Hard PointPillars AP 79.83 # 5
3D Object Detection KITTI Cars Moderate PointPillars AP 74.99% # 22
Birds Eye View Object Detection KITTI Cars Moderate PointPillars AP 86.1% # 6
3D Object Detection KITTI Cyclists Easy PointPillars AP 75.78% # 7
3D Object Detection KITTI Cyclists Hard PointPillars AP 52.92% # 8
3D Object Detection KITTI Cyclists Moderate PointPillars AP 59.07% # 8
Birds Eye View Object Detection KITTI Cyclists Moderate PointPillars AP 62.25% # 3
3D Object Detection KITTI Pedestrians Moderate PointPillars AP 41.92% # 10
Birds Eye View Object Detection KITTI Pedestrians Moderate PointPillars AP 50.23% # 4
3D Object Detection nuScenes LiDAR only PointPillar NDS 45.3 # 6
mAP 30.5 # 6

Methods