TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Saliency Detection	MSU Video Saliency Prediction	GASP	SIM	0.557	# 10
Video Saliency Detection	MSU Video Saliency Prediction	GASP	CC	0.613	# 10
Video Saliency Detection	MSU Video Saliency Prediction	GASP	NSS	1.57	# 10
Video Saliency Detection	MSU Video Saliency Prediction	GASP	AUC-J	0.810	# 12
Video Saliency Detection	MSU Video Saliency Prediction	GASP	KLDiv	0.687	# 10
Video Saliency Detection	MSU Video Saliency Prediction	GASP	FPS	3.77	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gasp-gated-attention-for-saliency-prediction-1/video-saliency-detection-on-msu-video)](https://paperswithcode.com/sota/video-saliency-detection-on-msu-video?p=gasp-gated-attention-for-saliency-prediction-1)`

GASP: Gated Attention For Saliency Prediction

International Joint Conference on Artificial Intelligence 2021 · Fares Abawi, Tom Weber, Stefan Wermter ·

Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by following gaze, estimating gaze direction, and recognizing affect. These features are then transformed into spatiotemporal maps through image processing operations. The transformed representations are propagated to the second stage (GASP) where we explore various techniques of late fusion for integrating social cues and introduce two sub-networks for directing attention to relevant stimuli. Our experiments indicate that fusion approaches achieve better results for static integration methods, whereas non-fusion approaches for which the influence of each modality is unknown, result in better outcomes when coupled with recurrent models for dynamic saliency prediction. We show that gaze direction and affective representations contribute a prediction to ground-truth correspondence improvement of at least 5% compared to dynamic saliency models without social cues. Furthermore, affective representations improve GASP, supporting the necessity of considering affect-biased attention in predicting saliency.

PDF Abstract International Joint 2021 PDF International Joint 2021 Abstract

Code

Add Remove Mark official

knowledgetechnologyuhh/gasp official

Tasks

Add Remove

Saliency Prediction

Video Saliency Detection

Video Saliency Prediction

Datasets

AVE

MSU Video Saliency Prediction

Results from the Paper

Edit

Ranked #10 on Video Saliency Detection on MSU Video Saliency Prediction

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Saliency Detection	MSU Video Saliency Prediction	GASP	SIM	0.557	# 10	Compare
			CC	0.613	# 10	Compare
			NSS	1.57	# 10	Compare
			AUC-J	0.810	# 12	Compare
			KLDiv	0.687	# 10	Compare
			FPS	3.77	# 6	Compare

Methods

Add Remove

1x1 Convolution • Attention Gate • Average Pooling • Convolution • Dense Connections • Gated Convolution • GLU • LSTM • ReLU • Sigmoid Activation • Squeeze-and-Excitation Block • Tanh Activation

Edit Social Preview

GASP: Gated Attention For Saliency Prediction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove