TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Passage Retrieval	Natural Questions	ReAtt	Precision@20	86.00	# 1
Passage Retrieval	Natural Questions	ReAtt	Precision@100	90.40	# 1
Question Answering	Natural Questions	ReAtt	EM	54.7	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/retrieval-as-attention-end-to-end-learning-of/passage-retrieval-on-natural-questions)](https://paperswithcode.com/sota/passage-retrieval-on-natural-questions?p=retrieval-as-attention-end-to-end-learning-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/retrieval-as-attention-end-to-end-learning-of/question-answering-on-natural-questions)](https://paperswithcode.com/sota/question-answering-on-natural-questions?p=retrieval-as-attention-end-to-end-learning-of)`

Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer

5 Dec 2022 · Zhengbao Jiang, Luyu Gao, Jun Araki, Haibo Ding, Zhiruo Wang, Jamie Callan, Graham Neubig ·

Systems for knowledge-intensive tasks such as open-domain question answering (QA) usually consist of two stages: efficient retrieval of relevant documents from a large corpus and detailed reading of the selected documents to generate answers. Retrievers and readers are usually modeled separately, which necessitates a cumbersome implementation and is hard to train and adapt in an end-to-end fashion. In this paper, we revisit this design and eschew the separate architecture and training in favor of a single Transformer that performs Retrieval as Attention (ReAtt), and end-to-end training solely based on supervision from the end QA task. We demonstrate for the first time that a single model trained end-to-end can achieve both competitive retrieval and QA performance, matching or slightly outperforming state-of-the-art separately trained retrievers and readers. Moreover, end-to-end adaptation significantly boosts its performance on out-of-domain datasets in both supervised and unsupervised settings, making our model a simple and adaptable solution for knowledge-intensive tasks. Code and models are available at https://github.com/jzbjyb/ReAtt.

PDF Abstract

Code

Add Remove Mark official

jzbjyb/reatt official

Tasks

Add Remove

Open-Domain Question Answering

Passage Retrieval

Question Answering

Retrieval

Datasets

Natural Questions

MS MARCO

BEIR

BioASQ SciFact

SciDocs

Results from the Paper

Edit

Ranked #1 on Passage Retrieval on Natural Questions

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Passage Retrieval	Natural Questions	ReAtt	Precision@20	86.00	# 1	Compare
Passage Retrieval	Natural Questions	ReAtt	Precision@100	90.40	# 1	Compare
Question Answering	Natural Questions	ReAtt	EM	54.7	# 5	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove