MLQE (MultiLingual Quality Estimation)

Introduced by Fomicheva et al. in Unsupervised Quality Estimation for Neural Machine Translation

The MLQE dataset is a dataset for sentence-level Machine Translation Quality Estimation. It consists of 6 language pairs representing NMT training in high, medium, and low-resource scenarios. The corpus is extracted from Wikipedia, and 10K segments per language pair are annotated.

Source: https://github.com/facebookresearch/mlqe

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages