TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Sequential Image Classification	Sequential MNIST	iRNN	Unpermuted Accuracy	97%	# 21
Sequential Image Classification	Sequential MNIST	iRNN	Permuted Accuracy	82%	# 27

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-simple-way-to-initialize-recurrent-networks/sequential-image-classification-on-sequential)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential?p=a-simple-way-to-initialize-recurrent-networks)`

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units

3 Apr 2015 · Quoc V. Le, Navdeep Jaitly, Geoffrey E. Hinton ·

Learning long term dependencies in recurrent networks is difficult due to vanishing and exploding gradients. To overcome this difficulty, researchers have developed sophisticated optimization techniques and network architectures. In this paper, we propose a simpler solution that use recurrent neural networks composed of rectified linear units. Key to our solution is the use of the identity matrix or its scaled version to initialize the recurrent weight matrix. We find that our solution is comparable to LSTM on our four benchmarks: two toy problems involving long-range temporal structures, a large language modeling problem and a benchmark speech recognition problem.

PDF Abstract

Code

Add Remove Mark official

facebookresearch/salina

426

sobhan-moosavi/characterizingdrivin…

trevor-richardson/rnn_zoo

mindspore-courses/DeepNLP-models-Mi…

adamx97/Data-Science-Advanced-Capst…

See all 6 implementations

Tasks

Add Remove

Language Modelling

Sequential Image Classification

speech-recognition

Speech Recognition

Datasets

MNIST Billion Word Benchmark

Results from the Paper

Edit

Ranked #27 on Sequential Image Classification on Sequential MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Sequential Image Classification	Sequential MNIST	iRNN	Unpermuted Accuracy	97%	# 21		Compare
Sequential Image Classification	Sequential MNIST	iRNN	Permuted Accuracy	82%	# 27		Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove