TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Word Alignment	en-es	Adv - Refine - CSLS	P@1	81.7	# 2
Word Alignment	en-fr	Adv - Refine - CSLS	P@1	82.3	# 2
Word Alignment	es-en	Adv - Refine - CSLS	P@1	83.3	# 2
Word Alignment	fr-en	Adv - Refine - CSLS	P@1	82.1	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/word-translation-without-parallel-data/word-alignment-on-en-es)](https://paperswithcode.com/sota/word-alignment-on-en-es?p=word-translation-without-parallel-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/word-translation-without-parallel-data/word-alignment-on-en-fr)](https://paperswithcode.com/sota/word-alignment-on-en-fr?p=word-translation-without-parallel-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/word-translation-without-parallel-data/word-alignment-on-es-en)](https://paperswithcode.com/sota/word-alignment-on-es-en?p=word-translation-without-parallel-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/word-translation-without-parallel-data/word-alignment-on-fr-en)](https://paperswithcode.com/sota/word-alignment-on-fr-en?p=word-translation-without-parallel-data)`

Word Translation Without Parallel Data

ICLR 2018 · Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, Hervé Jégou ·

State-of-the-art methods for learning cross-lingual word embeddings have relied on bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel data supervision can be alleviated with character-level information. While these methods showed encouraging results, they are not on par with their supervised counterparts and are limited to pairs of languages sharing a common alphabet. In this work, we show that we can build a bilingual dictionary between two languages without using any parallel corpora, by aligning monolingual word embedding spaces in an unsupervised way. Without using any character information, our model even outperforms existing supervised methods on cross-lingual tasks for some language pairs. Our experiments demonstrate that our method works very well also for distant language pairs, like English-Russian or English-Chinese. We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation. Our code, embeddings and dictionaries are publicly available.

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Code

Add Remove Mark official

facebookresearch/MUSE official

3,165

baidu-research/HNN

yunsukim86/wbw-lm

YovaKem/generalized-procrustes-MUSE

beinborn/SemanticDrift

See all 19 implementations

Tasks

Add Remove

Cross-Lingual Word Embeddings

Machine Translation

Translation

Unsupervised Machine Translation

Word Alignment

Word Embeddings

Word Translation

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #2 on Word Alignment on en-es

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Word Alignment	en-es	Adv - Refine - CSLS	P@1	81.7	# 2	Compare
Word Alignment	en-fr	Adv - Refine - CSLS	P@1	82.3	# 2	Compare
Word Alignment	es-en	Adv - Refine - CSLS	P@1	83.3	# 2	Compare
Word Alignment	fr-en	Adv - Refine - CSLS	P@1	82.1	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Word Translation Without Parallel Data

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove