About

Word Alignment is the task of finding the correspondence between source and target words in a pair of sentences that are translations of each other.

Source: Neural Network-based Word Alignment through Score Aggregation

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Datasets

Greatest papers with code

Jointly Learning to Align and Translate with Transformer Models

IJCNLP 2019 pytorch/fairseq

The state of the art in machine translation (MT) is governed by neural approaches, which typically provide superior translation accuracy over statistical approaches.

MACHINE TRANSLATION WORD ALIGNMENT

Word Translation Without Parallel Data

ICLR 2018 facebookresearch/MUSE

We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation.

UNSUPERVISED MACHINE TRANSLATION WORD ALIGNMENT WORD EMBEDDINGS

Guided Alignment Training for Topic-Aware Neural Machine Translation

6 Jul 2016OpenNMT/OpenNMT-tf

In this paper, we propose an effective way for biasing the attention mechanism of a sequence-to-sequence neural machine translation (NMT) model towards the well-studied statistical word alignment models.

DOMAIN ADAPTATION MACHINE TRANSLATION WORD ALIGNMENT

Addressing the Rare Word Problem in Neural Machine Translation

IJCNLP 2015 atpaino/deep-text-corrector

Our experiments on the WMT14 English to French translation task show that this method provides a substantial improvement of up to 2. 8 BLEU points over an equivalent NMT system that does not use this technique.

MACHINE TRANSLATION WORD ALIGNMENT

Bilingual Lexicon Induction through Unsupervised Machine Translation

ACL 2019 artetxem/monoses

A recent research line has obtained strong results on bilingual lexicon induction by aligning independently trained word embeddings in two languages and using the resulting cross-lingual embeddings to induce word translation pairs through nearest neighbor or related retrieval methods.

LANGUAGE MODELLING UNSUPERVISED MACHINE TRANSLATION WORD ALIGNMENT WORD EMBEDDINGS

SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

18 Apr 2020cisnlp/simalign

We find that alignments created from embeddings are superior for four and comparable for two language pairs compared to those produced by traditional statistical aligners, even with abundant parallel data; e. g., contextualized embeddings achieve a word alignment F1 for English-German that is 5 percentage points higher than eflomal, a high-quality statistical aligner, trained on 100k parallel sentences.

MACHINE TRANSLATION MULTILINGUAL WORD EMBEDDINGS WORD ALIGNMENT

Word Alignment by Fine-tuning Embeddings on Parallel Corpora

20 Jan 2021neulab/awesome-align

In addition, we demonstrate that we are able to train multilingual word aligners that can obtain robust performance on different language pairs.

CROSS-LINGUAL TRANSFER WORD ALIGNMENT WORD EMBEDDINGS