MTet: Multi-domain Translation for English-Vietnamese

We are excited to introduce a new larger and better quality Machine Translation dataset, MTet, which stands for Multi-domain Translation for English and VieTnamese. In our new release, we extend our previous dataset (v1.0) by adding more high-quality English-Vietnamese sentence pairs on various domains. In addition, we also show our new larger Transformer models can achieve state-of-the-art results on multiple test sets.

PDF

Datasets


  Add Datasets introduced or used in this paper

Results from the Paper


 Ranked #1 on Machine Translation on IWSLT2015 English-Vietnamese (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Machine Translation IWSLT2015 English-Vietnamese Transformer Tall 18 BLEU 40.2 # 1

Methods