QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

The problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) presents two challenges: given a QA context (question and answer choice), methods need to (i) identify relevant knowledge from large KGs, and (ii) perform joint reasoning over the QA context and KG. In this work, we propose a new model, QA-GNN, which addresses the above challenges through two key innovations: (i) relevance scoring, where we use LMs to estimate the importance of KG nodes relative to the given QA context, and (ii) joint reasoning, where we connect the QA context and KG to form a joint graph, and mutually update their representations through graph neural networks. We evaluate our model on QA benchmarks in the commonsense (CommonsenseQA, OpenBookQA) and biomedical (MedQA-USMLE) domains. QA-GNN outperforms existing LM and LM+KG models, and exhibits capabilities to perform interpretable and structured reasoning, e.g., correctly handling negation in questions.

PDF Abstract NAACL 2021 PDF NAACL 2021 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Common Sense Reasoning CommonsenseQA QA-GNN Accuracy 76.1 # 13
Question Answering OpenBookQA QA-GNN Accuracy 82.8 # 20
Question Answering OpenBookQA AristoRoBERTa + QA-GNN Accuracy 82.8 # 20
Question Answering OpenBookQA AristoRoBERTa Accuracy 77.8 # 25
Riddle Sense RiddleSense QAGNN Accuracy (%) 67 # 2

Methods