no code implementations • ACL 2017 • Milo{\v{s}} Stanojevi{\'c}, Khalil Sima{'}an
MT evaluation metrics are tested for correlation with human judgments either at the sentence- or the corpus-level.
no code implementations • COLING 2016 • Joachim Daiber, Milo{\v{s}} Stanojevi{\'c}, Khalil Sima{'}an
In this paper we explore the novel idea of building a single universal reordering model from English to a large number of target languages.
no code implementations • COLING 2016 • Milo{\v{s}} Stanojevi{\'c}, Khalil Sima{'}an
Subsequently we define example tight metrics and empirically test them in word order evaluation.
1 code implementation • TACL 2016 • Hoang Cuong, Khalil Sima{'}an, Ivan Titov
Existing work on domain adaptation for statistical machine translation has consistently assumed access to a small sample from the test distribution (target domain) at training time.
no code implementations • LREC 2014 • Jasmijn Bastings, Khalil Sima{'}an
PARSEVAL, the default paradigm for evaluating constituency parsers, calculates parsing success (Precision/Recall) as a function of the number of matching labeled brackets across the test set.