Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting

Though Faster R-CNN based two-stage detectors have witnessed significant boost in pedestrian detection accuracy, it is still slow for practical applications. One solution is to simplify this working flow as a single-stage detector. However, current single-stage detectors (e.g. SSD) have not presented competitive accuracy on common pedestrian detection benchmarks. This paper is towards a successful pedestrian detector enjoying the speed of SSD while maintaining the accuracy of Faster R-CNN. Specifically, a structurally simple but effective module called emph{Asymptotic Localization Fitting} (ALF) is proposed, which stacks a series of predictors to directly evolve the default anchor boxes of SSD step by step into improving detection results. As a result, during training the latter predictors enjoy more and better-quality positive samples, meanwhile harder negatives could be mined with increasing IoU thresholds. On top of this, an efficient single-stage pedestrian detection architecture (denoted as ALFNet) is designed, achieving state-of-the-art performance on CityPersons and Caltech, two of the largest pedestrian detection benchmarks, and hence resulting in an attractive pedestrian detector in both accuracy and speed. Code is available at href{https://github.com/VideoObjectSearch/ALFNet}{https://github.com/VideoObjectSearch/ALFNet}.

PDF Abstract

Datasets


Results from the Paper


Ranked #11 on Pedestrian Detection on Caltech (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Pedestrian Detection Caltech ALFNet + CityPersons dataset Reasonable Miss Rate 4.5 # 11
Pedestrian Detection Caltech ALFNet Reasonable Miss Rate 6.1 # 17

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Uses Extra
Training Data
Source Paper Compare
Pedestrian Detection CityPersons ALFNet Reasonable MR^-2 12.0 # 15
Heavy MR^-2 51.9 # 13
Partial MR^-2 11.4 # 8
Bare MR^-2 8.4 # 10
Small MR^-2 19.0 # 9
Medium MR^-2 5.7 # 3
Large MR^-2 6.6 # 3
Test Time 0.27 # 3

Methods