TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sequential Image Classification	Sequential MNIST	BN LSTM	Unpermuted Accuracy	99%	# 16
Sequential Image Classification	Sequential MNIST	BN LSTM	Permuted Accuracy	95.4%	# 22
Language Modelling	Text8	BN LSTM	Bit per Character (BPC)	1.36	# 20
Language Modelling	Text8	BN LSTM	Number of params	16M	# 17

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-batch-normalization/language-modelling-on-text8)](https://paperswithcode.com/sota/language-modelling-on-text8?p=recurrent-batch-normalization)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/recurrent-batch-normalization/sequential-image-classification-on-sequential)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential?p=recurrent-batch-normalization)`

Recurrent Batch Normalization

30 Mar 2016 · Tim Cooijmans, Nicolas Ballas, César Laurent, Çağlar Gülçehre, Aaron Courville ·

We propose a reparameterization of LSTM that brings the benefits of batch normalization to recurrent neural networks. Whereas previous works only apply batch normalization to the input-to-hidden transformation of RNNs, we demonstrate that it is both possible and beneficial to batch-normalize the hidden-to-hidden transition, thereby reducing internal covariate shift between time steps. We evaluate our proposal on various sequential problems such as sequence classification, language modeling and question answering. Our empirical results show that our batch-normalized LSTM consistently leads to faster convergence and improved generalization.

PDF Abstract

Code

Add Remove Mark official

cooijmanstim/recurrent-batch-normal…

codedecde/Recognizing-Textual-Entai…

Tetsuya-Nishikawa/ConvLSTM_DEMO

Tasks

Add Remove

General Classification

Language Modelling

Question Answering

Reading Comprehension

Sequential Image Classification

Datasets

MNIST

Penn Treebank Text8

Results from the Paper

Edit

Ranked #20 on Language Modelling on Text8

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sequential Image Classification	Sequential MNIST	BN LSTM	Unpermuted Accuracy	99%	# 16	Compare
Sequential Image Classification	Sequential MNIST	BN LSTM	Permuted Accuracy	95.4%	# 22	Compare
Language Modelling	Text8	BN LSTM	Bit per Character (BPC)	1.36	# 20	Compare
Language Modelling	Text8	BN LSTM	Number of params	16M	# 17	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Recurrent Batch Normalization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove