Demetr

Introduced by Karpinska et al. in DEMETR: Diagnosing Evaluation Metrics for Translation

Demetr is a diagnostic dataset with 31K English examples (translated from 10 source languages) for evaluating the sensitivity of MT evaluation metrics to 35 different linguistic perturbations spanning semantic, syntactic, and morphological error categories.

Source: DEMETR: Diagnosing Evaluation Metrics for Translation

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets