TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sequential Image Classification	Sequential CIFAR-10	LSSL	Unpermuted Accuracy	84.65%	# 5
Sequential Image Classification	Sequential MNIST	LSSL	Unpermuted Accuracy	99.53%	# 4
Sequential Image Classification	Sequential MNIST	LSSL	Permuted Accuracy	98.76%	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/combining-recurrent-convolutional-and/sequential-image-classification-on-sequential)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential?p=combining-recurrent-convolutional-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/combining-recurrent-convolutional-and/sequential-image-classification-on-sequential-1)](https://paperswithcode.com/sota/sequential-image-classification-on-sequential-1?p=combining-recurrent-convolutional-and)`

Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers

NeurIPS 2021 · Albert Gu, Isys Johnson, Karan Goel, Khaled Saab, Tri Dao, Atri Rudra, Christopher Ré ·

Recurrent neural networks (RNNs), temporal convolutions, and neural differential equations (NDEs) are popular families of deep learning models for time-series data, each with unique strengths and tradeoffs in modeling power and computational efficiency. We introduce a simple sequence model inspired by control systems that generalizes these approaches while addressing their shortcomings. The Linear State-Space Layer (LSSL) maps a sequence $u \mapsto y$ by simply simulating a linear continuous-time state-space representation $\dot{x} = Ax + Bu, y = Cx + Du$. Theoretically, we show that LSSL models are closely related to the three aforementioned families of models and inherit their strengths. For example, they generalize convolutions to continuous-time, explain common RNN heuristics, and share features of NDEs such as time-scale adaptation. We then incorporate and generalize recent theory on continuous-time memorization to introduce a trainable subset of structured matrices $A$ that endow LSSLs with long-range memory. Empirically, stacking LSSL layers into a simple deep neural network obtains state-of-the-art results across time series benchmarks for long dependencies in sequential image classification, real-world healthcare regression tasks, and speech. On a difficult speech classification task with length-16000 sequences, LSSL outperforms prior approaches by 24 accuracy points, and even outperforms baselines that use hand-crafted features on 100x shorter sequences.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

hazyresearch/state-spaces official

2,089

ag1988/dss

Tasks

Add Remove

Computational Efficiency

Image Classification

Memorization

Sequential Image Classification

Time Series

Time Series Analysis

Datasets

MNIST

CelebA

Results from the Paper

Edit

Ranked #2 on Sequential Image Classification on Sequential MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sequential Image Classification	Sequential CIFAR-10	LSSL	Unpermuted Accuracy	84.65%	# 5	Compare
Sequential Image Classification	Sequential MNIST	LSSL	Unpermuted Accuracy	99.53%	# 4	Compare
Sequential Image Classification	Sequential MNIST	LSSL	Permuted Accuracy	98.76%	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove