Kor-Lang8 (Lang-8 Korean Corpus)

Introduced by Yoon et al. in Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Kor-Lang8 is a Korean grammatical error correction (GEC) dataset extracted from the NAIST Lang-8 Learner Corpora by the language label. It contains more than 109K sentence pairs.

Source: Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages