An automatic model and Gold Standard for translation alignment of Ancient Greek

LREC 2022 · Tariq Yousef, Chiara Palladino, Farnoosh Shamsian, Anise d’Orange Ferreira, Michel Ferreira dos Reis ·

This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.

PDF Abstract