TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	WMT2016 English-German	Exploiting Mono at Scale (single)	SacreBLEU	40.9	# 1
Machine Translation	WMT2016 German-English	Exploiting Mono at Scale (single)	SacreBLEU	47.5	# 1
Machine Translation	WMT2019 English-German	Exploiting Mono at Scale (single)	SacreBLEU	43.8	# 1
Machine Translation	WMT2019 German-English	Exploiting Mono at Scale (single)	SacreBLEU	41.9	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/exploiting-monolingual-data-at-scale-for/machine-translation-on-wmt2016-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2016-english-german?p=exploiting-monolingual-data-at-scale-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/exploiting-monolingual-data-at-scale-for/machine-translation-on-wmt2016-german-english)](https://paperswithcode.com/sota/machine-translation-on-wmt2016-german-english?p=exploiting-monolingual-data-at-scale-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/exploiting-monolingual-data-at-scale-for/machine-translation-on-wmt2019-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2019-english-german?p=exploiting-monolingual-data-at-scale-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/exploiting-monolingual-data-at-scale-for/machine-translation-on-wmt2019-german-english)](https://paperswithcode.com/sota/machine-translation-on-wmt2019-german-english?p=exploiting-monolingual-data-at-scale-for)`

Exploiting Monolingual Data at Scale for Neural Machine Translation

IJCNLP 2019 · Lijun Wu, Yiren Wang, Yingce Xia, Tao Qin, Jian-Huang Lai, Tie-Yan Liu ·

While target-side monolingual data has been proven to be very useful to improve neural machine translation (briefly, NMT) through back translation, source-side monolingual data is not well investigated. In this work, we study how to use both the source-side and target-side monolingual data for NMT, and propose an effective strategy leveraging both of them. First, we generate synthetic bitext by translating monolingual data from the two domains into the other domain using the models pretrained on genuine bitext. Next, a model is trained on a noised version of the concatenated synthetic bitext where each source sequence is randomly corrupted. Finally, the model is fine-tuned on the genuine bitext and a clean version of a subset of the synthetic bitext without adding any noise. Our approach achieves state-of-the-art results on WMT16, WMT17, WMT18 English$\leftrightarrow$German translations and WMT19 German$\to$French translations, which demonstrate the effectiveness of our method. We also conduct a comprehensive study on how each part in the pipeline works.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Machine Translation

NMT

Translation

Datasets

WMT 2016

Results from the Paper

Add Remove

Ranked #1 on Machine Translation on WMT2016 English-German (SacreBLEU metric, using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	WMT2016 English-German	Exploiting Mono at Scale (single)	SacreBLEU	40.9	# 1	Compare
Machine Translation	WMT2016 German-English	Exploiting Mono at Scale (single)	SacreBLEU	47.5	# 1	Compare
Machine Translation	WMT2019 English-German	Exploiting Mono at Scale (single)	SacreBLEU	43.8	# 1	Compare
Machine Translation	WMT2019 German-English	Exploiting Mono at Scale (single)	SacreBLEU	41.9	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Exploiting Monolingual Data at Scale for Neural Machine Translation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove