TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sentiment Analysis	MR	GRU-RNN-WORD2VEC	Accuracy	78.26	# 11
Sentiment Analysis	SST-5 Fine-grained classification	GRU-RNN-WORD2VEC	Accuracy	45.02	# 25
Subjectivity Analysis	SUBJ	GRU-RNN-GLOVE	Accuracy	91.85	# 15
Text Classification	TREC-6	GRU-RNN-GLOVE	Error	7.0	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/all-but-the-top-simple-and-effective/sentiment-analysis-on-mr)](https://paperswithcode.com/sota/sentiment-analysis-on-mr?p=all-but-the-top-simple-and-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/all-but-the-top-simple-and-effective/text-classification-on-trec-6)](https://paperswithcode.com/sota/text-classification-on-trec-6?p=all-but-the-top-simple-and-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/all-but-the-top-simple-and-effective/subjectivity-analysis-on-subj)](https://paperswithcode.com/sota/subjectivity-analysis-on-subj?p=all-but-the-top-simple-and-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/all-but-the-top-simple-and-effective/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=all-but-the-top-simple-and-effective)`

All-but-the-Top: Simple and Effective Postprocessing for Word Representations

ICLR 2018 · Jiaqi Mu, Suma Bhat, Pramod Viswanath ·

Real-valued word representations have transformed NLP applications; popular examples are word2vec and GloVe, recognized for their ability to capture linguistic regularities. In this paper, we demonstrate a {\em very simple}, and yet counter-intuitive, postprocessing technique -- eliminate the common mean vector and a few top dominating directions from the word vectors -- that renders off-the-shelf representations {\em even stronger}. The postprocessing is empirically validated on a variety of lexical-level intrinsic tasks (word similarity, concept categorization, word analogy) and sentence-level tasks (semantic textural similarity and { text classification}) on multiple datasets and with a variety of representation methods and hyperparameter choices in multiple languages; in each case, the processed representations are consistently better than the original ones.

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Code

Add Remove Mark official

lgalke/vec4ir

226

nlpAThits/WOMBAT

s1998/All-but-the-top

woctezuma/steam-descriptions

↳ Quickstart in

Colab

Tasks

Add Remove

General Classification

Sentence

Sentiment Analysis

Subjectivity Analysis

Text Classification

Word Similarity

Datasets

SST

IMDb Movie Reviews

SICK SST-5

MR SUBJ

Results from the Paper

Edit

Ranked #11 on Sentiment Analysis on MR

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sentiment Analysis	MR	GRU-RNN-WORD2VEC	Accuracy	78.26	# 11	Compare
Sentiment Analysis	SST-5 Fine-grained classification	GRU-RNN-WORD2VEC	Accuracy	45.02	# 25	Compare
Subjectivity Analysis	SUBJ	GRU-RNN-GLOVE	Accuracy	91.85	# 15	Compare
Text Classification	TREC-6	GRU-RNN-GLOVE	Error	7.0	# 13	Compare

Methods

Add Remove

GloVe

Edit Social Preview

All-but-the-Top: Simple and Effective Postprocessing for Word Representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove