WMT 2019 Metrics Task

Introduced by Ma et al. in Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges

This shared task will examine automatic evaluation metrics for machine translation. The goals of the shared metrics task are:

To achieve the strongest correlation with human judgement of translation quality; To illustrate the suitability of an automatic evaluation metric as a surrogate for human evaluation; To address problems associated with comparison with a single reference translation; To move automatic evaluation beyond system-level ranking to finer-grained sentence-level ranking.

All datasets for this task are available here.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

PAWS

Usage

License

Unknown

WMT 2019 Metrics Task

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

PAWS

Usage

License

Modalities

Languages

WMT 2019 Metrics Task

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

PAWS

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages