TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Language Modelling	WikiText-103	kNN-LM w/ Adaptive Coefficient	Validation perplexity	15.72	# 2
Language Modelling	WikiText-103	kNN-LM w/ Adaptive Coefficient	Test perplexity	15.5	# 9
Language Modelling	WikiText-103	kNN-LM w/ Adaptive Coefficient	Number of params	247M	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/you-can-t-pick-your-neighbors-or-can-you-when/language-modelling-on-wikitext-103)](https://paperswithcode.com/sota/language-modelling-on-wikitext-103?p=you-can-t-pick-your-neighbors-or-can-you-when)`

You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

28 Oct 2022 · Andrew Drozdov, Shufan Wang, Razieh Rahimi, Andrew McCallum, Hamed Zamani, Mohit Iyyer ·

Retrieval-enhanced language models (LMs), which condition their predictions on text retrieved from large external datastores, have recently shown significant perplexity improvements compared to standard LMs. One such approach, the $k$NN-LM, interpolates any existing LM's predictions with the output of a $k$-nearest neighbors model and requires no additional training. In this paper, we explore the importance of lexical and semantic matching in the context of items retrieved by $k$NN-LM. We find two trends: (1) the presence of large overlapping $n$-grams between the datastore and evaluation set plays an important factor in strong performance, even when the datastore is derived from the training data; and (2) the $k$NN-LM is most beneficial when retrieved items have high semantic similarity with the query. Based on our analysis, we define a new formulation of the $k$NN-LM that uses retrieval quality to assign the interpolation coefficient. We empirically measure the effectiveness of our approach on two English language modeling datasets, Wikitext-103 and PG-19. Our re-formulation of the $k$NN-LM is beneficial in both cases, and leads to nearly 4% improvement in perplexity on the Wikitext-103 test set.

PDF Abstract

Code

Add Remove Mark official

iesl/knnlm-retrieval-quality official

Tasks

Add Remove

Language Modelling

Retrieval

Semantic Similarity

Semantic Textual Similarity

Datasets

WikiText-2

WikiText-103 PG-19

Results from the Paper

Add Remove

Ranked #9 on Language Modelling on WikiText-103

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Language Modelling	WikiText-103	kNN-LM w/ Adaptive Coefficient	Validation perplexity	15.72	# 2	Compare
			Test perplexity	15.5	# 9	Compare
			Number of params	247M	# 19	Compare

Methods

Add Remove

Test

Edit Social Preview

You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove