LogiQA consists of 8,678 QA instances, covering multiple types of deductive reasoning. Results show that state-of-the-art neural models perform by far worse than human ceiling. The dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting.
Source: LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical ReasoningPaper | Code | Results | Date | Stars |
---|