TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Pedestrian Detection	Caltech	CSP + CityPersons dataset	Reasonable Miss Rate	3.8	# 8
Pedestrian Detection	Caltech	CSP	Reasonable Miss Rate	4.5	# 11
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Reasonable MR^-2	11.0	# 14
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Heavy MR^-2	49.3	# 12
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Partial MR^-2	10.4	# 6
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Bare MR^-2	7.3	# 8
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Small MR^-2	16.0	# 8
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Medium MR^-2	3.7	# 2
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Large MR^-2	6.5	# 2
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Test Time	0.33s/img	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-level-semantic-feature-detectiona-new/pedestrian-detection-on-caltech)](https://paperswithcode.com/sota/pedestrian-detection-on-caltech?p=high-level-semantic-feature-detectiona-new)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-level-semantic-feature-detectiona-new/pedestrian-detection-on-citypersons)](https://paperswithcode.com/sota/pedestrian-detection-on-citypersons?p=high-level-semantic-feature-detectiona-new)`

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

CVPR 2019 · Wei Liu, Irtiza Hasan, Shengcai Liao ·

Object detection generally requires sliding-window classifiers in tradition or anchor box based predictions in modern deep learning approaches. However, either of these approaches requires tedious configurations in boxes. In this paper, we provide a new perspective where detecting objects is motivated as a high-level semantic feature detection task. Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited. However, unlike these traditional low-level features, the proposed detector goes for a higher-level abstraction, that is, we are looking for central points where there are objects, and modern deep models are already capable of such a high-level semantic abstraction. Besides, like blob detection, we also predict the scales of the central points, which is also a straightforward convolution. Therefore, in this paper, pedestrian and face detection is simplified as a straightforward center and scale prediction task through convolutions. This way, the proposed method enjoys a box-free setting. Though structurally simple, it presents competitive accuracy on several challenging benchmarks, including pedestrian detection and face detection. Furthermore, a cross-dataset evaluation is performed, demonstrating a superior generalization ability of the proposed method. Code and models can be accessed at (https://github.com/liuwei16/CSP and https://github.com/hasanirtiza/Pedestron).

PDF Abstract

Code

Add Remove Mark official

liuwei16/CSP official

748

hasanirtiza/Pedestron official

677

Tasks

Add Remove

Face Detection

object-detection

Object Detection

Pedestrian Detection

Datasets

ImageNet

ssd

WIDER FACE

FDDB

CrowdHuman

CityPersons

Results from the Paper

Edit

Ranked #8 on Pedestrian Detection on Caltech (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Pedestrian Detection	Caltech	CSP + CityPersons dataset	Reasonable Miss Rate	3.8	# 8	Compare
Pedestrian Detection	Caltech	CSP	Reasonable Miss Rate	4.5	# 11	Compare
Pedestrian Detection	CityPersons	CSP (with offset) + ResNet-50	Reasonable MR^-2	11.0	# 14	Compare
			Heavy MR^-2	49.3	# 12	Compare
			Partial MR^-2	10.4	# 6	Compare
			Bare MR^-2	7.3	# 8	Compare
			Small MR^-2	16.0	# 8	Compare
			Medium MR^-2	3.7	# 2	Compare
			Large MR^-2	6.5	# 2	Compare
			Test Time	0.33s/img	# 4	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove