TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	IWSLT2014 German-English	Data Diversification	BLEU score	37.2	# 11
Machine Translation	WMT2014 English-German	Data Diversification - Transformer	BLEU score	30.7	# 9
Machine Translation	WMT2014 English-German	Data Diversification - Transformer	Hardware Burden	None	# 1
Machine Translation	WMT2014 English-German	Data Diversification - Transformer	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/data-diversification-an-elegant-strategy-for/machine-translation-on-wmt2014-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-german?p=data-diversification-an-elegant-strategy-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/data-diversification-an-elegant-strategy-for/machine-translation-on-iwslt2014-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2014-german?p=data-diversification-an-elegant-strategy-for)`

Data Diversification: A Simple Strategy For Neural Machine Translation

NeurIPS 2020 · Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw ·

We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging them with the original dataset on which the final NMT model is trained. Our method is applicable to all NMT models. It does not require extra monolingual data like back-translation, nor does it add more computations and parameters like ensembles of models. Our method achieves state-of-the-art BLEU scores of 30.7 and 43.7 in the WMT'14 English-German and English-French translation tasks, respectively. It also substantially improves on 8 other translation tasks: 4 IWSLT tasks (English-German and English-French) and 4 low-resource translation tasks (English-Nepali and English-Sinhala). We demonstrate that our method is more effective than knowledge distillation and dual learning, it exhibits strong correlation with ensembles of models, and it trades perplexity off for better BLEU score. We have released our source code at https://github.com/nxphi47/data_diversification

PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract

Code

Add Remove Mark official

nxphi47/data_diversification official

nxphi47/multiagent_crosstranslate

Tasks

Add Remove

Knowledge Distillation

Machine Translation

NMT

Translation

Datasets

WMT 2014

Results from the Paper

Edit

Ranked #9 on Machine Translation on WMT2014 English-German

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	IWSLT2014 German-English	Data Diversification	BLEU score	37.2	# 11	Compare
Machine Translation	WMT2014 English-German	Data Diversification - Transformer	BLEU score	30.7	# 9	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

Knowledge Distillation

Edit Social Preview

Data Diversification: A Simple Strategy For Neural Machine Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove