TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	SQuAD1.1	BiDAF + Self Attention (single model)	EM	72.139	# 144
Question Answering	SQuAD1.1	BiDAF + Self Attention (single model)	F1	81.048	# 146
Question Answering	TriviaQA	S-Norm	EM	66.37	# 28
Question Answering	TriviaQA	S-Norm	F1	71.32	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-and-effective-multi-paragraph-reading/question-answering-on-triviaqa)](https://paperswithcode.com/sota/question-answering-on-triviaqa?p=simple-and-effective-multi-paragraph-reading)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-and-effective-multi-paragraph-reading/question-answering-on-squad11)](https://paperswithcode.com/sota/question-answering-on-squad11?p=simple-and-effective-multi-paragraph-reading)`

Simple and Effective Multi-Paragraph Reading Comprehension

ACL 2018 · Christopher Clark, Matt Gardner ·

We consider the problem of adapting neural paragraph-level question answering models to the case where entire documents are given as input. Our proposed solution trains models to produce well calibrated confidence scores for their results on individual paragraphs. We sample multiple paragraphs from the documents during training, and use a shared-normalization training objective that encourages the model to produce globally correct output. We combine this method with a state-of-the-art pipeline for training models on document QA data. Experiments demonstrate strong performance on several document QA datasets. Overall, we are able to achieve a score of 71.3 F1 on the web portion of TriviaQA, a large improvement from the 56.7 F1 of the previous best system.

PDF Abstract ACL 2018 PDF ACL 2018 Abstract

Code

Add Remove Mark official

allenai/document-qa official

435

Tasks

Add Remove

Question Answering

Reading Comprehension

TriviaQA

Datasets

SQuAD

TriviaQA

Results from the Paper

Edit

Ranked #28 on Question Answering on TriviaQA (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	SQuAD1.1	BiDAF + Self Attention (single model)	EM	72.139	# 144	Compare
Question Answering	SQuAD1.1	BiDAF + Self Attention (single model)	F1	81.048	# 146	Compare
Question Answering	TriviaQA	S-Norm	EM	66.37	# 28	Compare
Question Answering	TriviaQA	S-Norm	F1	71.32	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Simple and Effective Multi-Paragraph Reading Comprehension

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove