TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Common Sense Reasoning	CommonsenseQA	QA-GNN	Accuracy	76.1	# 12
Question Answering	OpenBookQA	QA-GNN	Accuracy	82.8	# 17
Question Answering	OpenBookQA	AristoRoBERTa + QA-GNN	Accuracy	82.8	# 17
Question Answering	OpenBookQA	AristoRoBERTa	Accuracy	77.8	# 21
Riddle Sense	RiddleSense	QAGNN	Accuracy (%)	67	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/qa-gnn-reasoning-with-language-models-and/riddle-sense-on-riddle-sense)](https://paperswithcode.com/sota/riddle-sense-on-riddle-sense?p=qa-gnn-reasoning-with-language-models-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/qa-gnn-reasoning-with-language-models-and/common-sense-reasoning-on-commonsenseqa)](https://paperswithcode.com/sota/common-sense-reasoning-on-commonsenseqa?p=qa-gnn-reasoning-with-language-models-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/qa-gnn-reasoning-with-language-models-and/question-answering-on-openbookqa)](https://paperswithcode.com/sota/question-answering-on-openbookqa?p=qa-gnn-reasoning-with-language-models-and)`

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

NAACL 2021 · Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang, Jure Leskovec ·

The problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) presents two challenges: given a QA context (question and answer choice), methods need to (i) identify relevant knowledge from large KGs, and (ii) perform joint reasoning over the QA context and KG. In this work, we propose a new model, QA-GNN, which addresses the above challenges through two key innovations: (i) relevance scoring, where we use LMs to estimate the importance of KG nodes relative to the given QA context, and (ii) joint reasoning, where we connect the QA context and KG to form a joint graph, and mutually update their representations through graph neural networks. We evaluate our model on QA benchmarks in the commonsense (CommonsenseQA, OpenBookQA) and biomedical (MedQA-USMLE) domains. QA-GNN outperforms existing LM and LM+KG models, and exhibits capabilities to perform interpretable and structured reasoning, e.g., correctly handling negation in questions.

PDF Abstract NAACL 2021 PDF NAACL 2021 Abstract

Code

Add Remove Mark official

michiyasunaga/qagnn official

603

worksheets/0xf215deb0 official

rucaibox/safe

CMSC470-Team/Model

Tasks

Add Remove

Common Sense Reasoning

Graph Representation Learning

Knowledge Graphs

Language Modelling

Multi-hop Question Answering

Negation

Question Answering

Riddle Sense

Datasets

ConceptNet

OpenBookQA

CommonsenseQA RiddleSense

Results from the Paper

Edit

Ranked #2 on Riddle Sense on RiddleSense

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Common Sense Reasoning	CommonsenseQA	QA-GNN	Accuracy	76.1	# 12	Compare
Question Answering	OpenBookQA	QA-GNN	Accuracy	82.8	# 17	Compare
Question Answering	OpenBookQA	AristoRoBERTa + QA-GNN	Accuracy	82.8	# 17	Compare
Question Answering	OpenBookQA	AristoRoBERTa	Accuracy	77.8	# 21	Compare
Riddle Sense	RiddleSense	QAGNN	Accuracy (%)	67	# 2	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • GAT • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove