TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO test-dev	SNIPER (ResNet-101)	box mAP	46.1	# 124
Object Detection	COCO test-dev	SNIPER (ResNet-101)	AP50	67.0	# 64
Object Detection	COCO test-dev	SNIPER (ResNet-101)	AP75	51.6	# 61
Object Detection	COCO test-dev	SNIPER (ResNet-101)	APS	29.6	# 54
Object Detection	COCO test-dev	SNIPER (ResNet-101)	APM	48.9	# 68
Object Detection	COCO test-dev	SNIPER (ResNet-101)	APL	58.1	# 68
Object Detection	COCO test-dev	SNIPER (ResNet-101)	Hardware Burden	29G	# 1
Object Detection	COCO test-dev	SNIPER (ResNet-101)	Operations per network pass	None	# 1
Object Detection	COCO test-dev	SNIPER (ResNet-50)	box mAP	43.5	# 152
Object Detection	COCO test-dev	SNIPER (ResNet-50)	AP50	65.0	# 77
Object Detection	COCO test-dev	SNIPER (ResNet-50)	AP75	48.6	# 83
Object Detection	COCO test-dev	SNIPER (ResNet-50)	APS	26.1	# 86
Object Detection	COCO test-dev	SNIPER (ResNet-50)	APM	46.3	# 98
Object Detection	COCO test-dev	SNIPER (ResNet-50)	APL	56.0	# 91
Object Detection	COCO test-dev	SNIPER (ResNet-50)	Hardware Burden	29G	# 1
Object Detection	COCO test-dev	SNIPER (ResNet-50)	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sniper-efficient-multi-scale-training/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=sniper-efficient-multi-scale-training)`

SNIPER: Efficient Multi-Scale Training

NeurIPS 2018 · Bharat Singh, Mahyar Najibi, Larry S. Davis ·

We present SNIPER, an algorithm for performing efficient multi-scale training in instance level visual recognition tasks. Instead of processing every pixel in an image pyramid, SNIPER processes context regions around ground-truth instances (referred to as chips) at the appropriate scale. For background sampling, these context-regions are generated using proposals extracted from a region proposal network trained with a short learning schedule. Hence, the number of chips generated per image during training adaptively changes based on the scene complexity. SNIPER only processes 30% more pixels compared to the commonly used single scale training at 800x1333 pixels on the COCO dataset. But, it also observes samples from extreme resolutions of the image pyramid, like 1400x2000 pixels. As SNIPER operates on resampled low resolution chips (512x512 pixels), it can have a batch size as large as 20 on a single GPU even with a ResNet-101 backbone. Therefore it can benefit from batch-normalization during training without the need for synchronizing batch-normalization statistics across GPUs. SNIPER brings training of instance level recognition tasks like object detection closer to the protocol for image classification and suggests that the commonly accepted guideline that it is important to train on high resolution images for instance level visual recognition tasks might not be correct. Our implementation based on Faster-RCNN with a ResNet-101 backbone obtains an mAP of 47.6% on the COCO dataset for bounding box detection and can process 5 images per second during inference with a single GPU. Code is available at https://github.com/MahyarNajibi/SNIPER/.

PDF Abstract NeurIPS 2018 PDF NeurIPS 2018 Abstract

Code

Add Remove Mark official

MahyarNajibi/SNIPER official

2,681

PaddlePaddle/PaddleDetection

12,128

Hwang64/PSIS

starimpact/arm_SNIPER

Tasks

Add Remove

object-detection

Object Detection

Region Proposal

Datasets

MS COCO

Results from the Paper

Edit

Ranked #124 on Object Detection on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO test-dev	SNIPER (ResNet-101)	box mAP	46.1	# 124	Compare
			AP50	67.0	# 64	Compare
			AP75	51.6	# 61	Compare
			APS	29.6	# 54	Compare
			APM	48.9	# 68	Compare
			APL	58.1	# 68	Compare
			Hardware Burden	29G	# 1	Compare
			Operations per network pass	None	# 1	Compare
Object Detection	COCO test-dev	SNIPER (ResNet-50)	box mAP	43.5	# 152	Compare
			AP50	65.0	# 77	Compare
			AP75	48.6	# 83	Compare
			APS	26.1	# 86	Compare
			APM	46.3	# 98	Compare
			APL	56.0	# 91	Compare
			Hardware Burden	29G	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Faster R-CNN • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet • RoIPool • RPN • SNIPER • Softmax • Weight Decay

Edit Social Preview

SNIPER: Efficient Multi-Scale Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove