Long Short-Term Memory

An LSTM is a type of recurrent neural network that addresses the vanishing gradient problem in vanilla RNNs through additional cells, input and output gates. Intuitively, vanishing gradients are solved through additional additive components, and forget gate activations, that allow the gradients to flow through the network without vanishing as quickly.

(Image Source here)

(Introduced by Hochreiter and Schmidhuber)

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Language Modelling	30	4.32%
Sentence	25	3.60%
Sentiment Analysis	22	3.17%
Time Series Forecasting	20	2.88%
Management	16	2.30%
Machine Translation	14	2.01%
Classification	13	1.87%
Decision Making	13	1.87%
Speech Recognition	13	1.87%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Sigmoid Activation	Activation Functions
Tanh Activation	Activation Functions

Categories

Add Remove

Recurrent Neural Networks