TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Constituency Parsing	Penn Treebank	LSTM Encoder-Decoder + LSTM-LM	F1 score	94.47	# 15
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC x5	Validation perplexity	48.63	# 8
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC x5	Test perplexity	47.17	# 8
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC x5	Params	185M	# 4
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC	Validation perplexity	54.12	# 14
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC	Test perplexity	52.38	# 16
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC	Params	23M	# 19
Language Modelling	WikiText-2	AWD-LSTM-DOC	Validation perplexity	60.29	# 17
Language Modelling	WikiText-2	AWD-LSTM-DOC	Test perplexity	58.03	# 24
Language Modelling	WikiText-2	AWD-LSTM-DOC	Number of params	37M	# 9
Language Modelling	WikiText-2	AWD-LSTM-DOC x5	Validation perplexity	54.19	# 12
Language Modelling	WikiText-2	AWD-LSTM-DOC x5	Test perplexity	53.09	# 20
Language Modelling	WikiText-2	AWD-LSTM-DOC x5	Number of params	185M	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/direct-output-connection-for-a-high-rank/language-modelling-on-penn-treebank-word)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-word?p=direct-output-connection-for-a-high-rank)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/direct-output-connection-for-a-high-rank/constituency-parsing-on-penn-treebank)](https://paperswithcode.com/sota/constituency-parsing-on-penn-treebank?p=direct-output-connection-for-a-high-rank)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/direct-output-connection-for-a-high-rank/language-modelling-on-wikitext-2)](https://paperswithcode.com/sota/language-modelling-on-wikitext-2?p=direct-output-connection-for-a-high-rank)`

Direct Output Connection for a High-Rank Language Model

EMNLP 2018 · Sho Takase, Jun Suzuki, Masaaki Nagata ·

This paper proposes a state-of-the-art recurrent neural network (RNN) language model that combines probability distributions computed not only from a final RNN layer but also from middle layers. Our proposed method raises the expressive power of a language model based on the matrix factorization interpretation of language modeling introduced by Yang et al. (2018). The proposed method improves the current state-of-the-art language model and achieves the best score on the Penn Treebank and WikiText-2, which are the standard benchmark datasets. Moreover, we indicate our proposed method contributes to two application tasks: machine translation and headline generation. Our code is publicly available at: https://github.com/nttcslab-nlp/doc_lm.

PDF Abstract EMNLP 2018 PDF EMNLP 2018 Abstract

Code

Add Remove Mark official

nttcslab-nlp/doc_lm official

Tasks

Add Remove

Constituency Parsing

Headline Generation

Language Modelling

Machine Translation

Translation

Vocal Bursts Intensity Prediction

Datasets

Penn Treebank

WikiText-2

Results from the Paper

Edit

Ranked #8 on Language Modelling on Penn Treebank (Word Level)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Constituency Parsing	Penn Treebank	LSTM Encoder-Decoder + LSTM-LM	F1 score	94.47	# 15	Compare
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC x5	Validation perplexity	48.63	# 8	Compare
			Test perplexity	47.17	# 8	Compare
			Params	185M	# 4	Compare
Language Modelling	Penn Treebank (Word Level)	AWD-LSTM-DOC	Validation perplexity	54.12	# 14	Compare
			Test perplexity	52.38	# 16	Compare
			Params	23M	# 19	Compare
Language Modelling	WikiText-2	AWD-LSTM-DOC	Validation perplexity	60.29	# 17	Compare
			Test perplexity	58.03	# 24	Compare
			Number of params	37M	# 9	Compare
Language Modelling	WikiText-2	AWD-LSTM-DOC x5	Validation perplexity	54.19	# 12	Compare
			Test perplexity	53.09	# 20	Compare
			Number of params	185M	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Direct Output Connection for a High-Rank Language Model

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove