TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Visual Storytelling	VIST	GAN	BLEU-1	62.8	# 11
Visual Storytelling	VIST	GAN	BLEU-2	38.8	# 8
Visual Storytelling	VIST	GAN	BLEU-3	23.0	# 11
Visual Storytelling	VIST	GAN	BLEU-4	14	# 15
Visual Storytelling	VIST	GAN	METEOR	35	# 21
Visual Storytelling	VIST	GAN	CIDEr	9	# 16
Visual Storytelling	VIST	GAN	ROUGE-L	29.5	# 22
Visual Storytelling	VIST	XE-ss	BLEU-1	62.3	# 12
Visual Storytelling	VIST	XE-ss	BLEU-2	38.2	# 10
Visual Storytelling	VIST	XE-ss	BLEU-3	22.5	# 12
Visual Storytelling	VIST	XE-ss	BLEU-4	13.7	# 17
Visual Storytelling	VIST	XE-ss	METEOR	34.8	# 23
Visual Storytelling	VIST	XE-ss	CIDEr	8.7	# 18
Visual Storytelling	VIST	XE-ss	ROUGE-L	29.7	# 19
Visual Storytelling	VIST	AREL-t-100	BLEU-1	63.8	# 8
Visual Storytelling	VIST	AREL-t-100	BLEU-2	39.1	# 7
Visual Storytelling	VIST	AREL-t-100	BLEU-3	23.2	# 9
Visual Storytelling	VIST	AREL-t-100	BLEU-4	14.1	# 13
Visual Storytelling	VIST	AREL-t-100	METEOR	35	# 21
Visual Storytelling	VIST	AREL-t-100	CIDEr	9.4	# 13
Visual Storytelling	VIST	AREL-t-100	ROUGE-L	29.5	# 22

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/no-metrics-are-perfect-adversarial-reward/visual-storytelling-on-vist)](https://paperswithcode.com/sota/visual-storytelling-on-vist?p=no-metrics-are-perfect-adversarial-reward)`

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

ACL 2018 · Xin Wang, Wenhu Chen, Yuan-Fang Wang, William Yang Wang ·

Though impressive results have been achieved in visual captioning, the task of generating abstract stories from photo streams is still a little-tapped problem. Different from captions, stories have more expressive language styles and contain many imaginary concepts that do not appear in the images. Thus it poses challenges to behavioral cloning algorithms. Furthermore, due to the limitations of automatic metrics on evaluating story quality, reinforcement learning methods with hand-crafted rewards also face difficulties in gaining an overall performance boost. Therefore, we propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function. Though automatic eval- uation indicates slight performance boost over state-of-the-art (SOTA) methods in cloning expert behaviors, human evaluation shows that our approach achieves significant improvement in generating more human-like stories than SOTA systems.

PDF Abstract ACL 2018 PDF ACL 2018 Abstract

Code

Add Remove Mark official

littlekobe/AREL official

136

eric-xw/AREL

136

Tasks

Add Remove

Image Captioning

Visual Storytelling

Datasets

VIST

Results from the Paper

Edit

Ranked #13 on Visual Storytelling on VIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Visual Storytelling	VIST	GAN	BLEU-1	62.8	# 11	Compare
			BLEU-2	38.8	# 8	Compare
			BLEU-3	23.0	# 11	Compare
			BLEU-4	14	# 15	Compare
			METEOR	35	# 21	Compare
			CIDEr	9	# 16	Compare
			ROUGE-L	29.5	# 22	Compare
Visual Storytelling	VIST	XE-ss	BLEU-1	62.3	# 12	Compare
			BLEU-2	38.2	# 10	Compare
			BLEU-3	22.5	# 12	Compare
			BLEU-4	13.7	# 17	Compare
			METEOR	34.8	# 23	Compare
			CIDEr	8.7	# 18	Compare
			ROUGE-L	29.7	# 19	Compare
Visual Storytelling	VIST	AREL-t-100	BLEU-1	63.8	# 8	Compare
			BLEU-2	39.1	# 7	Compare
			BLEU-3	23.2	# 9	Compare
			BLEU-4	14.1	# 13	Compare
			METEOR	35	# 21	Compare
			CIDEr	9.4	# 13	Compare
			ROUGE-L	29.5	# 22	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove