Just Ask! Evaluating Machine Translation by Asking and Answering Questions

WMT (EMNLP) 2021 · Mateusz Krubiński, Erfan Ghadery, Marie-Francine Moens, Pavel Pecina ·

In this paper, we show that automatically-generated questions and answers can be used to evaluate the quality of Machine Translation (MT) systems. Building on recent work on the evaluation of abstractive text summarization, we propose a new metric for system-level MT evaluation, compare it with other state-of-the-art solutions, and show its robustness by conducting experiments for various MT directions.

PDF Abstract