TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Saliency Detection	MSU Video Saliency Prediction	DINet	SIM	0.592	# 6
Video Saliency Detection	MSU Video Saliency Prediction	DINet	CC	0.671	# 6
Video Saliency Detection	MSU Video Saliency Prediction	DINet	NSS	1.85	# 5
Video Saliency Detection	MSU Video Saliency Prediction	DINet	AUC-J	0.858	# 2
Video Saliency Detection	MSU Video Saliency Prediction	DINet	KLDiv	0.575	# 6
Video Saliency Detection	MSU Video Saliency Prediction	DINet	FPS	4.85	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-dilated-inception-network-for-visual/video-saliency-detection-on-msu-video)](https://paperswithcode.com/sota/video-saliency-detection-on-msu-video?p=a-dilated-inception-network-for-visual)`

A Dilated Inception Network for Visual Saliency Prediction

7 Apr 2019 · Sheng Yang, Guosheng Lin, Qiuping Jiang, Weisi Lin ·

Recently, with the advent of deep convolutional neural networks (DCNN), the improvements in visual saliency prediction research are impressive. One possible direction to approach the next improvement is to fully characterize the multi-scale saliency-influential factors with a computationally-friendly module in DCNN architectures. In this work, we proposed an end-to-end dilated inception network (DINet) for visual saliency prediction. It captures multi-scale contextual features effectively with very limited extra parameters. Instead of utilizing parallel standard convolutions with different kernel sizes as the existing inception module, our proposed dilated inception module (DIM) uses parallel dilated convolutions with different dilation rates which can significantly reduce the computation load while enriching the diversity of receptive fields in feature maps. Moreover, the performance of our saliency model is further improved by using a set of linear normalization-based probability distribution distance metrics as loss functions. As such, we can formulate saliency prediction as a probability distribution prediction task for global saliency inference instead of a typical pixel-wise regression problem. Experimental results on several challenging saliency benchmark datasets demonstrate that our DINet with proposed loss functions can achieve state-of-the-art performance with shorter inference time.

PDF Abstract

Code

Add Remove Mark official

ysyscool/DINet official

Tasks

Add Remove

Saliency Prediction

Video Saliency Detection

Datasets

Places205

SALICON

MSU Video Saliency Prediction

Results from the Paper

Add Remove

Ranked #6 on Video Saliency Detection on MSU Video Saliency Prediction

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Saliency Detection	MSU Video Saliency Prediction	DINet	SIM	0.592	# 6	Compare
			CC	0.671	# 6	Compare
			NSS	1.85	# 5	Compare
			AUC-J	0.858	# 2	Compare
			KLDiv	0.575	# 6	Compare
			FPS	4.85	# 4	Compare

Methods

Add Remove

1x1 Convolution • Convolution • DCNN • Inception Module • Max Pooling

Edit Social Preview

A Dilated Inception Network for Visual Saliency Prediction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove