TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Summarization	SumMe	M-AVS	F1-score (Canonical)	44.4	# 5
Video Summarization	SumMe	M-AVS	F1-score (Augmented)	46.1	# 4
Video Summarization	TvSum	M-AVS	F1-score (Canonical)	61.0	# 4
Video Summarization	TvSum	M-AVS	F1-score (Augmented)	61.8	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-summarization-with-attention-based/video-summarization-on-tvsum)](https://paperswithcode.com/sota/video-summarization-on-tvsum?p=video-summarization-with-attention-based)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-summarization-with-attention-based/video-summarization-on-summe)](https://paperswithcode.com/sota/video-summarization-on-summe?p=video-summarization-with-attention-based)`

Video Summarization with Attention-Based Encoder-Decoder Networks

31 Aug 2017 · Zhong Ji, Kailin Xiong, Yanwei Pang, Xuelong. Li ·

This paper addresses the problem of supervised video summarization by formulating it as a sequence-to-sequence learning problem, where the input is a sequence of original video frames, the output is a keyshot sequence. Our key idea is to learn a deep summarization network with attention mechanism to mimic the way of selecting the keyshots of human. To this end, we propose a novel video summarization framework named Attentive encoder-decoder networks for Video Summarization (AVS), in which the encoder uses a Bidirectional Long Short-Term Memory (BiLSTM) to encode the contextual information among the input video frames. As for the decoder, two attention-based LSTM networks are explored by using additive and multiplicative objective functions, respectively. Extensive experiments are conducted on three video summarization benchmark datasets, i.e., SumMe, and TVSum. The results demonstrate the superiority of the proposed AVS-based approaches against the state-of-the-art approaches,with remarkable improvements from 0.8% to 3% on two datasets,respectively..

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Supervised Video Summarization

Video Summarization

Datasets

TVSum

SumMe

Results from the Paper

Edit

Ranked #4 on Video Summarization on TvSum (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Summarization	SumMe	M-AVS	F1-score (Canonical)	44.4	# 5	Compare
Video Summarization	SumMe	M-AVS	F1-score (Augmented)	46.1	# 4	Compare
Video Summarization	TvSum	M-AVS	F1-score (Canonical)	61.0	# 4	Compare
Video Summarization	TvSum	M-AVS	F1-score (Augmented)	61.8	# 4	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Video Summarization with Attention-Based Encoder-Decoder Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove