KorQuAD1.0: Korean QA Dataset for Machine Reading Comprehension

16 Sep 2019  ·  Seungyoung Lim, Myungji Kim, Jooyoul Lee ·

Machine Reading Comprehension (MRC) is a task that requires machine to understand natural language and answer questions by reading a document. It is the core of automatic response technology such as chatbots and automatized customer supporting systems. We present Korean Question Answering Dataset(KorQuAD), a large-scale Korean dataset for extractive machine reading comprehension task. It consists of 70,000+ human generated question-answer pairs on Korean Wikipedia articles. We release KorQuAD1.0 and launch a challenge at https://KorQuAD.github.io to encourage the development of multilingual natural language processing research.

PDF Abstract

Datasets


Introduced in the Paper:

KorQuAD

Used in the Paper:

SQuAD HotpotQA

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods


No methods listed for this paper. Add relevant methods here