TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Common Sense Reasoning	CommonsenseQA	DRAGON	Accuracy	78.2	# 8
Question Answering	MedQA	DRAGON + BioLinkBERT	Accuracy	47.5	# 13
Riddle Sense	RiddleSense	DRAGON	Accuracy (%)	71.3	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-bidirectional-language-knowledge-graph/riddle-sense-on-riddle-sense)](https://paperswithcode.com/sota/riddle-sense-on-riddle-sense?p=deep-bidirectional-language-knowledge-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-bidirectional-language-knowledge-graph/common-sense-reasoning-on-commonsenseqa)](https://paperswithcode.com/sota/common-sense-reasoning-on-commonsenseqa?p=deep-bidirectional-language-knowledge-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-bidirectional-language-knowledge-graph/question-answering-on-medqa-usmle)](https://paperswithcode.com/sota/question-answering-on-medqa-usmle?p=deep-bidirectional-language-knowledge-graph)`

Deep Bidirectional Language-Knowledge Graph Pretraining

17 Oct 2022 · Michihiro Yasunaga, Antoine Bosselut, Hongyu Ren, Xikun Zhang, Christopher D Manning, Percy Liang, Jure Leskovec ·

Pretraining a language model (LM) on text has been shown to help various downstream NLP tasks. Recent works show that a knowledge graph (KG) can complement text data, offering structured background knowledge that provides a useful scaffold for reasoning. However, these works are not pretrained to learn a deep fusion of the two modalities at scale, limiting the potential to acquire fully joint representations of text and KG. Here we propose DRAGON (Deep Bidirectional Language-Knowledge Graph Pretraining), a self-supervised approach to pretraining a deeply joint language-knowledge foundation model from text and KG at scale. Specifically, our model takes pairs of text segments and relevant KG subgraphs as input and bidirectionally fuses information from both modalities. We pretrain this model by unifying two self-supervised reasoning tasks, masked language modeling and KG link prediction. DRAGON outperforms existing LM and LM+KG models on diverse downstream tasks including question answering across general and biomedical domains, with +5% absolute gain on average. In particular, DRAGON achieves notable performance on complex reasoning about language and knowledge (+10% on questions involving long contexts or multi-step reasoning) and low-resource QA (+8% on OBQA and RiddleSense), and new state-of-the-art results on various BioNLP tasks. Our code and trained models are available at https://github.com/michiyasunaga/dragon.

PDF Abstract

Code

Add Remove Mark official

michiyasunaga/dragon official

288

Tasks

Add Remove

Common Sense Reasoning

Knowledge Graphs

Language Modelling

Link Prediction

Masked Language Modeling

Question Answering

Riddle Sense

Datasets

ConceptNet

HellaSwag

PIQA

OpenBookQA

CommonsenseQA

BookCorpus

PubMedQA

MedQA

CosmosQA

SIQA RiddleSense

Results from the Paper

Edit

Ranked #1 on Riddle Sense on RiddleSense

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Common Sense Reasoning	CommonsenseQA	DRAGON	Accuracy	78.2	# 8	Compare
Question Answering	MedQA	DRAGON + BioLinkBERT	Accuracy	47.5	# 13	Compare
Riddle Sense	RiddleSense	DRAGON	Accuracy (%)	71.3	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Deep Bidirectional Language-Knowledge Graph Pretraining

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove