CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense

8 Apr 2019Michael ChenMike D'ArcyAlisa LiuJared FernandezDoug Downey

Commonsense reasoning is a critical AI capability, but it is difficult to construct challenging datasets that test common sense. Recent neural question answering systems, based on large pre-trained models of language, have already achieved near-human-level performance on commonsense knowledge benchmarks... (read more)

PDF Abstract

Code


No code implementations yet. Submit your code now

Evaluation Results from the Paper


 SOTA for Question Answering on CODAH (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
COMPARE
Question Answering CODAH BERT Large Accuracy 69.6 # 1
Common Sense Reasoning CODAH BERT Large Accuracy 69.6 # 1