TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Time Series Classification	EigenWorms	LEM	% Test Accuracy	92.3	# 1
Sequential Image Classification	noise padded CIFAR-10	LEM	% Test Accuracy	60.5	# 3
Sequential Image Classification	Sequential MNIST	LEM	Unpermuted Accuracy	99.5%	# 5
Sequential Image Classification	Sequential MNIST	LEM	Permuted Accuracy	96.6%	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-expressive-memory-for-sequence-modeling-1/time-series-classification-on-eigenworms)](https://paperswithcode.com/sota/time-series-classification-on-eigenworms?p=long-expressive-memory-for-sequence-modeling-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-expressive-memory-for-sequence-modeling-1/sequential-image-classification-on-noise)](https://paperswithcode.com/sota/sequential-image-classification-on-noise?p=long-expressive-memory-for-sequence-modeling-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-expressive-memory-for-sequence-modeling-1/sequential-image-classification-on-sequential)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential?p=long-expressive-memory-for-sequence-modeling-1)`

Long Expressive Memory for Sequence Modeling

ICLR 2022 · T. Konstantin Rusch, Siddhartha Mishra, N. Benjamin Erichson, Michael W. Mahoney ·

We propose a novel method called Long Expressive Memory (LEM) for learning long-term sequential dependencies. LEM is gradient-based, it can efficiently process sequential tasks with very long-term dependencies, and it is sufficiently expressive to be able to learn complicated input-output maps. To derive LEM, we consider a system of multiscale ordinary differential equations, as well as a suitable time-discretization of this system. For LEM, we derive rigorous bounds to show the mitigation of the exploding and vanishing gradients problem, a well-known challenge for gradient-based recurrent sequential learning methods. We also prove that LEM can approximate a large class of dynamical systems to high accuracy. Our empirical results, ranging from image and time-series classification through dynamical systems prediction to speech recognition and language modeling, demonstrate that LEM outperforms state-of-the-art recurrent neural networks, gated recurrent units, and long short-term memory models.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

tk-rusch/lem official

Tasks

Add Remove

Language Modelling

Sequential Image Classification

speech-recognition

Speech Recognition

Time Series

Time Series Analysis

Time Series Classification

Datasets

CIFAR-10

MNIST

Penn Treebank EigenWorms

Results from the Paper

Edit

Ranked #1 on Time Series Classification on EigenWorms

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Time Series Classification	EigenWorms	LEM	% Test Accuracy	92.3	# 1	Compare
Sequential Image Classification	noise padded CIFAR-10	LEM	% Test Accuracy	60.5	# 3	Compare
Sequential Image Classification	Sequential MNIST	LEM	Unpermuted Accuracy	99.5%	# 5	Compare
Sequential Image Classification	Sequential MNIST	LEM	Permuted Accuracy	96.6%	# 18	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Long Expressive Memory for Sequence Modeling

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove