Texts

PheMT

Introduced by Fujii et al. in PheMT: A Phenomenon-wise Dataset for Machine Translation Robustness on User-Generated Contents

PheMT is a phenomenon-wise dataset designed for evaluating the robustness of Japanese-English machine translation systems. The dataset is based on the MTNT dataset, with additional annotations of four linguistic phenomena common in UGC; Proper Noun, Abbreviated Noun, Colloquial Expression, and Variant

Source: https://github.com/cl-tohoku/PheMT

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

cl-tohoku/PheMT

Tasks

Machine Translation

Similar Datasets

MTNT

Source: Fujii et al.

Usage

License

Unknown

Modalities

Texts

Languages

English
Japanese

PheMT

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit