TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Language Modelling	Hutter Prize	mLSTM + dynamic eval	Bit per Character (BPC)	1.08	# 10
Language Modelling	Hutter Prize	mLSTM + dynamic eval	Number of params	46M	# 10
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM + dynamic eval	Validation perplexity	51.6	# 11
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM + dynamic eval	Test perplexity	51.1	# 14
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM + dynamic eval	Params	24M	# 7
Language Modelling	Text8	mLSTM + dynamic eval	Bit per Character (BPC)	1.19	# 15
Language Modelling	Text8	mLSTM + dynamic eval	Number of params	45M	# 8
Language Modelling	WikiText-2	AWD-LSTM + dynamic eval	Validation perplexity	46.4	# 10
Language Modelling	WikiText-2	AWD-LSTM + dynamic eval	Test perplexity	44.3	# 18
Language Modelling	WikiText-2	AWD-LSTM + dynamic eval	Number of params	33M	# 23

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-evaluation-of-neural-sequence-models/language-modelling-on-hutter-prize)](https://paperswithcode.com/sota/language-modelling-on-hutter-prize?p=dynamic-evaluation-of-neural-sequence-models)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-evaluation-of-neural-sequence-models/language-modelling-on-penn-treebank-word)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-word?p=dynamic-evaluation-of-neural-sequence-models)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-evaluation-of-neural-sequence-models/language-modelling-on-text8)](https://paperswithcode.com/sota/language-modelling-on-text8?p=dynamic-evaluation-of-neural-sequence-models)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dynamic-evaluation-of-neural-sequence-models/language-modelling-on-wikitext-2)](https://paperswithcode.com/sota/language-modelling-on-wikitext-2?p=dynamic-evaluation-of-neural-sequence-models)`

Dynamic Evaluation of Neural Sequence Models

ICML 2018 · Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals ·

We present methodology for using dynamic evaluation to improve neural sequence models. Models are adapted to recent history via a gradient descent based mechanism, causing them to assign higher probabilities to re-occurring sequential patterns. Dynamic evaluation outperforms existing adaptation approaches in our comparisons. Dynamic evaluation improves the state-of-the-art word-level perplexities on the Penn Treebank and WikiText-2 datasets to 51.1 and 44.3 respectively, and the state-of-the-art character-level cross-entropies on the text8 and Hutter Prize datasets to 1.19 bits/char and 1.08 bits/char respectively.

PDF Abstract ICML 2018 PDF ICML 2018 Abstract

Code

Add Remove Mark official

benkrause/dynamic-evaluation official

105

benkrause/dynamiceval-transformer

sacmehta/PRU

Tasks

Add Remove

Language Modelling

Datasets

Penn Treebank

WikiText-2 Text8 Hutter Prize

Results from the Paper

Edit

Ranked #10 on Language Modelling on Hutter Prize

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Language Modelling	Hutter Prize	mLSTM + dynamic eval	Bit per Character (BPC)	1.08	# 10	Compare
Language Modelling	Hutter Prize	mLSTM + dynamic eval	Number of params	46M	# 10	Compare
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM + dynamic eval	Validation perplexity	51.6	# 11	Compare
			Test perplexity	51.1	# 14	Compare
			Params	24M	# 7	Compare
Language Modelling	Text8	mLSTM + dynamic eval	Bit per Character (BPC)	1.19	# 15	Compare
Language Modelling	Text8	mLSTM + dynamic eval	Number of params	45M	# 8	Compare
Language Modelling	WikiText-2	AWD-LSTM + dynamic eval	Validation perplexity	46.4	# 10	Compare
			Test perplexity	44.3	# 18	Compare
			Number of params	33M	# 23	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Dynamic Evaluation of Neural Sequence Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove