TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Paraphrase Identification	Quora Question Pairs	Bi-CAS-LSTM	Accuracy	88.6	# 14
Natural Language Inference	SNLI	300D 2-layer Bi-CAS-LSTM	% Test Accuracy	87	# 47
Sentiment Analysis	SST-2 Binary classification	Bi-CAS-LSTM	Accuracy	91.3	# 54
Sentiment Analysis	SST-5 Fine-grained classification	Bi-CAS-LSTM	Accuracy	53.6	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cell-aware-stacked-lstms-for-modeling/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=cell-aware-stacked-lstms-for-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cell-aware-stacked-lstms-for-modeling/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=cell-aware-stacked-lstms-for-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cell-aware-stacked-lstms-for-modeling/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=cell-aware-stacked-lstms-for-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cell-aware-stacked-lstms-for-modeling/sentiment-analysis-on-sst-2-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-2-binary?p=cell-aware-stacked-lstms-for-modeling)`

Cell-aware Stacked LSTMs for Modeling Sentences

7 Sep 2018 · Jihun Choi, Taeuk Kim, Sang-goo Lee ·

We propose a method of stacking multiple long short-term memory (LSTM) layers for modeling sentences. In contrast to the conventional stacked LSTMs where only hidden states are fed as input to the next layer, the suggested architecture accepts both hidden and memory cell states of the preceding layer and fuses information from the left and the lower context using the soft gating mechanism of LSTMs. Thus the architecture modulates the amount of information to be delivered not only in horizontal recurrence but also in vertical connections, from which useful features extracted from lower layers are effectively conveyed to upper layers. We dub this architecture Cell-aware Stacked LSTM (CAS-LSTM) and show from experiments that our models bring significant performance gain over the standard LSTMs on benchmark datasets for natural language inference, paraphrase detection, sentiment classification, and machine translation. We also conduct extensive qualitative analysis to understand the internal behavior of the suggested approach.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Machine Translation

Natural Language Inference

Paraphrase Identification

Sentiment Analysis

Sentiment Classification

Translation

Datasets

SST

MultiNLI SST-2

SNLI SST-5

Quora

Quora Question Pairs

Results from the Paper

Edit

Ranked #10 on Sentiment Analysis on SST-5 Fine-grained classification

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Paraphrase Identification	Quora Question Pairs	Bi-CAS-LSTM	Accuracy	88.6	# 14	Compare
Natural Language Inference	SNLI	300D 2-layer Bi-CAS-LSTM	% Test Accuracy	87	# 47	Compare
Sentiment Analysis	SST-2 Binary classification	Bi-CAS-LSTM	Accuracy	91.3	# 54	Compare
Sentiment Analysis	SST-5 Fine-grained classification	Bi-CAS-LSTM	Accuracy	53.6	# 10	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Cell-aware Stacked LSTMs for Modeling Sentences

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove