TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	Natural Questions	EMDR^2	EM	52.5	# 7
Open-Domain Question Answering	Natural Questions (short)	EMDR2	Exact Match	52.5	# 1
Question Answering	TriviaQA	EMDR2	EM	71.4	# 21
Open-Domain Question Answering	WebQuestions	EMDR2	Exact Match	48.7	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-training-of-multi-document-reader/open-domain-question-answering-on-natural-1)](https://paperswithcode.com/sota/open-domain-question-answering-on-natural-1?p=end-to-end-training-of-multi-document-reader)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-training-of-multi-document-reader/open-domain-question-answering-on)](https://paperswithcode.com/sota/open-domain-question-answering-on?p=end-to-end-training-of-multi-document-reader)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-training-of-multi-document-reader/question-answering-on-natural-questions)](https://paperswithcode.com/sota/question-answering-on-natural-questions?p=end-to-end-training-of-multi-document-reader)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-training-of-multi-document-reader/question-answering-on-triviaqa)](https://paperswithcode.com/sota/question-answering-on-triviaqa?p=end-to-end-training-of-multi-document-reader)`

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

NeurIPS 2021 · Devendra Singh Sachan, Siva Reddy, William Hamilton, Chris Dyer, Dani Yogatama ·

We present an end-to-end differentiable training method for retrieval-augmented open-domain question answering systems that combine information from multiple retrieved documents when generating answers. We model retrieval decisions as latent variables over sets of relevant documents. Since marginalizing over sets of retrieved documents is computationally hard, we approximate this using an expectation-maximization algorithm. We iteratively estimate the value of our latent variable (the set of relevant documents for a given question) and then use this estimate to update the retriever and reader parameters. We hypothesize that such end-to-end training allows training signals to flow to the reader and then to the retriever better than staged-wise training. This results in a retriever that is able to select more relevant documents for a question and a reader that is trained on more accurate documents to generate an answer. Experiments on three benchmark datasets demonstrate that our proposed method outperforms all existing approaches of comparable size by 2-3% absolute exact match points, achieving new state-of-the-art results. Our results also demonstrate the feasibility of learning to retrieve to improve answer generation without explicit supervision of retrieval decisions.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

DevSinghSachan/emdr2 official

106

DevSinghSachan/art

Tasks

Add Remove

Answer Generation

Open-Domain Question Answering

Question Answering

Retrieval

Datasets

Natural Questions

TriviaQA

WebQuestions

Results from the Paper

Edit

Ranked #1 on Open-Domain Question Answering on Natural Questions (short)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	Natural Questions	EMDR^2	EM	52.5	# 7	Compare
Open-Domain Question Answering	Natural Questions (short)	EMDR2	Exact Match	52.5	# 1	Compare
Question Answering	TriviaQA	EMDR2	EM	71.4	# 21	Compare
Open-Domain Question Answering	WebQuestions	EMDR2	Exact Match	48.7	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove