TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Natural Language Inference	SNLI	Star-Transformer (no cross sentence attention)	% Test Accuracy	86.0	# 62
Sentiment Analysis	SST-5 Fine-grained classification	Star-Transformer	Accuracy	53.0	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/star-transformer/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=star-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/star-transformer/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=star-transformer)`

Star-Transformer

NAACL 2019 · Qipeng Guo, Xipeng Qiu, PengFei Liu, Yunfan Shao, xiangyang xue, Zheng Zhang ·

Although Transformer has achieved great successes on many NLP tasks, its heavy structure with fully-connected attention connections leads to dependencies on large training data. In this paper, we present Star-Transformer, a lightweight alternative by careful sparsification. To reduce model complexity, we replace the fully-connected structure with a star-shaped topology, in which every two non-adjacent nodes are connected through a shared relay node. Thus, complexity is reduced from quadratic to linear, while preserving capacity to capture both local composition and long-range dependency. The experiments on four tasks (22 datasets) show that Star-Transformer achieved significant improvements against the standard Transformer for the modestly sized datasets.

PDF Abstract NAACL 2019 PDF NAACL 2019 Abstract

Code

Add Remove Mark official

dmlc/dgl official

12,976

fastnlp/fastNLP

3,030

Tasks

Add Remove

Named Entity Recognition (NER)

Natural Language Inference

Sentiment Analysis

Text Classification

Datasets

SST

SNLI

Penn Treebank CoNLL 2003 SST-5

Results from the Paper

Edit

Ranked #13 on Sentiment Analysis on SST-5 Fine-grained classification

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Natural Language Inference	SNLI	Star-Transformer (no cross sentence attention)	% Test Accuracy	86.0	# 62		Compare
Sentiment Analysis	SST-5 Fine-grained classification	Star-Transformer	Accuracy	53.0	# 13		Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • ReLU • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Star-Transformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove