TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Open-Domain Question Answering	Natural Questions	BPR (linear scan; l=1000)	Exact Match	41.6	# 5
Question Answering	Natural Questions (long)	BPR (linear scan; l=1000)	EM	41.6	# 6
Open-Domain Question Answering	TQA	BPR (linear scan; l=1000)	Exact Match	56.8	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-passage-retrieval-with-hashing-for/open-domain-question-answering-on-tqa)](https://paperswithcode.com/sota/open-domain-question-answering-on-tqa?p=efficient-passage-retrieval-with-hashing-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-passage-retrieval-with-hashing-for/open-domain-question-answering-on-natural)](https://paperswithcode.com/sota/open-domain-question-answering-on-natural?p=efficient-passage-retrieval-with-hashing-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-passage-retrieval-with-hashing-for/question-answering-on-natural-questions-long)](https://paperswithcode.com/sota/question-answering-on-natural-questions-long?p=efficient-passage-retrieval-with-hashing-for)`

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

ACL 2021 · Ikuya Yamada, Akari Asai, Hannaneh Hajishirzi ·

Most state-of-the-art open-domain question answering systems use a neural retrieval model to encode passages into continuous vectors and extract them from a knowledge source. However, such retrieval models often require large memory to run because of the massive size of their passage index. In this paper, we introduce Binary Passage Retriever (BPR), a memory-efficient neural retrieval model that integrates a learning-to-hash technique into the state-of-the-art Dense Passage Retriever (DPR) to represent the passage index using compact binary codes rather than continuous vectors. BPR is trained with a multi-task objective over two tasks: efficient candidate generation based on binary codes and accurate reranking based on continuous vectors. Compared with DPR, BPR substantially reduces the memory cost from 65GB to 2GB without a loss of accuracy on two standard open-domain question answering benchmarks: Natural Questions and TriviaQA. Our code and trained models are available at https://github.com/studio-ousia/bpr.

PDF Abstract ACL 2021 PDF ACL 2021 Abstract

Code

Add Remove Mark official

studio-ousia/bpr official

161

Tasks

Add Remove

Natural Questions

Open-Domain Question Answering

Passage Retrieval

Question Answering

Retrieval

TriviaQA

Datasets

Natural Questions

TriviaQA

TQA

Results from the Paper

Edit

Ranked #2 on Open-Domain Question Answering on TQA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Open-Domain Question Answering	Natural Questions	BPR (linear scan; l=1000)	Exact Match	41.6	# 5	Compare
Question Answering	Natural Questions (long)	BPR (linear scan; l=1000)	EM	41.6	# 6	Compare
Open-Domain Question Answering	TQA	BPR (linear scan; l=1000)	Exact Match	56.8	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove