TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Machine Translation	IWSLT2015 German-English	Transformer with FRAGE	BLEU score	33.97	# 3
Language Modelling	Penn Treebank (Word Level)	FRAGE + AWD-LSTM-MoS + dynamic eval	Validation perplexity	47.38	# 5
Language Modelling	Penn Treebank (Word Level)	FRAGE + AWD-LSTM-MoS + dynamic eval	Test perplexity	46.54	# 7
Language Modelling	Penn Treebank (Word Level)	FRAGE + AWD-LSTM-MoS + dynamic eval	Params	22M	# 23
Language Modelling	WikiText-2	FRAGE + AWD-LSTM-MoS + dynamic eval	Validation perplexity	40.85	# 5
Language Modelling	WikiText-2	FRAGE + AWD-LSTM-MoS + dynamic eval	Test perplexity	39.14	# 13
Language Modelling	WikiText-2	FRAGE + AWD-LSTM-MoS + dynamic eval	Number of params	35M	# 12
Machine Translation	WMT2014 English-German	Transformer Big with FRAGE	BLEU score	29.11	# 32
Machine Translation	WMT2014 English-German	Transformer Big with FRAGE	Hardware Burden	None	# 1
Machine Translation	WMT2014 English-German	Transformer Big with FRAGE	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/frage-frequency-agnostic-word-representation/machine-translation-on-iwslt2015-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2015-german?p=frage-frequency-agnostic-word-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/frage-frequency-agnostic-word-representation/language-modelling-on-penn-treebank-word)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-word?p=frage-frequency-agnostic-word-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/frage-frequency-agnostic-word-representation/language-modelling-on-wikitext-2)](https://paperswithcode.com/sota/language-modelling-on-wikitext-2?p=frage-frequency-agnostic-word-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/frage-frequency-agnostic-word-representation/machine-translation-on-wmt2014-english-german)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-german?p=frage-frequency-agnostic-word-representation)`

FRAGE: Frequency-Agnostic Word Representation

NeurIPS 2018 · Chengyue Gong, Di He, Xu Tan, Tao Qin, Li-Wei Wang, Tie-Yan Liu ·

Continuous word representation (aka word embedding) is a basic building block in many neural network-based models used in natural language processing tasks. Although it is widely accepted that words with similar semantics should be close to each other in the embedding space, we find that word embeddings learned in several tasks are biased towards word frequency: the embeddings of high-frequency and low-frequency words lie in different subregions of the embedding space, and the embedding of a rare word and a popular word can be far from each other even if they are semantically similar. This makes learned word embeddings ineffective, especially for rare words, and consequently limits the performance of these neural network models. In this paper, we develop a neat, simple yet effective way to learn \emph{FRequency-AGnostic word Embedding} (FRAGE) using adversarial training. We conducted comprehensive studies on ten datasets across four natural language processing tasks, including word similarity, language modeling, machine translation and text classification. Results show that with FRAGE, we achieve higher performance than the baselines in all tasks.

PDF Abstract NeurIPS 2018 PDF NeurIPS 2018 Abstract

Code

Add Remove Mark official

ChengyueGongR/FrequencyAgnostic official

118

JakubStefko/w2vf

Tasks

Add Remove

Language Modelling

Machine Translation

text-classification

Text Classification

Translation

Word Embeddings

Word Similarity

Datasets

IMDb Movie Reviews

Penn Treebank

WikiText-2

WMT 2014

Results from the Paper

Edit

Ranked #3 on Machine Translation on IWSLT2015 German-English

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Machine Translation	IWSLT2015 German-English	Transformer with FRAGE	BLEU score	33.97	# 3	Compare
Language Modelling	Penn Treebank (Word Level)	FRAGE + AWD-LSTM-MoS + dynamic eval	Validation perplexity	47.38	# 5	Compare
			Test perplexity	46.54	# 7	Compare
			Params	22M	# 23	Compare
Language Modelling	WikiText-2	FRAGE + AWD-LSTM-MoS + dynamic eval	Validation perplexity	40.85	# 5	Compare
			Test perplexity	39.14	# 13	Compare
			Number of params	35M	# 12	Compare
Machine Translation	WMT2014 English-German	Transformer Big with FRAGE	BLEU score	29.11	# 32	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

FRAGE: Frequency-Agnostic Word Representation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove