TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Machine Translation	IWSLT2014 German-English	Minimum Risk Training [Edunov2017]	BLEU score	32.84	# 29
Machine Translation	IWSLT2015 German-English	ConvS2S+Risk	BLEU score	32.93	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/classical-structured-prediction-losses-for/machine-translation-on-iwslt2015-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2015-german?p=classical-structured-prediction-losses-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/classical-structured-prediction-losses-for/machine-translation-on-iwslt2014-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2014-german?p=classical-structured-prediction-losses-for)`

Classical Structured Prediction Losses for Sequence to Sequence Learning

NAACL 2018 · Sergey Edunov, Myle Ott, Michael Auli, David Grangier, Marc'Aurelio Ranzato ·

There has been much recent work on training neural attention models at the sequence-level using either reinforcement learning-style methods or by optimizing the beam. In this paper, we survey a range of classical objective functions that have been widely used to train linear models for structured prediction and apply them to neural sequence to sequence models. Our experiments show that these losses can perform surprisingly well by slightly outperforming beam search optimization in a like for like setup. We also report new state of the art results on both IWSLT'14 German-English translation as well as Gigaword abstractive summarization. On the larger WMT'14 English-French translation task, sequence-level training achieves 41.5 BLEU which is on par with the state of the art.

PDF Abstract NAACL 2018 PDF NAACL 2018 Abstract

Code

Add Remove Mark official

pytorch/fairseq official

29,255

Tasks

Add Remove

Abstractive Text Summarization

Machine Translation

Reinforcement Learning (RL)

Structured Prediction

Translation

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #4 on Machine Translation on IWSLT2015 German-English

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Machine Translation	IWSLT2014 German-English	Minimum Risk Training [Edunov2017]	BLEU score	32.84	# 29		Compare
Machine Translation	IWSLT2015 German-English	ConvS2S+Risk	BLEU score	32.93	# 4		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Classical Structured Prediction Losses for Sequence to Sequence Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove