Word Alignment

84 papers with code • 7 benchmarks • 4 datasets

Word Alignment is the task of finding the correspondence between source and target words in a pair of sentences that are translations of each other.

Source: Neural Network-based Word Alignment through Score Aggregation

Benchmarks

Add a Result

These leaderboards are used to track progress in Word Alignment

Dataset	Best Model	Compare
en-es	Barycenter Alignment	See all
es-en	Barycenter Alignment	See all
en-fr	Barycenter Alignment	See all
fr-en	Barycenter Alignment	See all
en-it	Barycenter Alignment	See all
MUSE en-pt	Barycenter Alignment	See all
MUSE en-de	Barycenter Alignment	See all

Datasets

Latest papers

Most implemented Social Latest No code

Unbalanced Optimal Transport for Unbalanced Word Alignment

yukiar/otalign • • 7 Jun 2023

Monolingual word alignment is crucial to model semantic interactions between sentences.

07 Jun 2023

Paper
Code

Do GPTs Produce Less Literal Translations?

vyraun/literalness • 26 May 2023

On the task of Machine Translation (MT), multiple works have investigated few-shot prompting mechanisms to elicit better translations from LLMs.

26 May 2023

Paper
Code

Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents

zurichnlp/recognizing-semantic-differences • • 22 May 2023

Automatically highlighting words that cause semantic differences between two documents could be useful for a wide range of applications.

22 May 2023

Paper
Code

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

hlt-mt/fbk-fairseq • • 19 May 2023

Attention is the core mechanism of today's most used architectures for natural language processing and has been analyzed from many perspectives, including its effectiveness for machine translation-related tasks.

19 May 2023

Paper
Code

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

mainlp/dialect-BLI • 19 Apr 2023

Bilingual word lexicons are crucial tools for multilingual natural language understanding and machine translation tasks, as they facilitate the mapping of words in one language to their synonyms in another language.

19 Apr 2023

Paper
Code

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

abteen/alignment • • 15 Feb 2023

However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data.

15 Feb 2023

Paper
Code

Multilingual Sentence Transformer as A Multilingual Word Aligner

sufenlp/accalign • • 28 Jan 2023

In this paper, we investigate whether multilingual sentence Transformer LaBSE is a strong multilingual word aligner.

28 Jan 2023

Paper
Code

Noisy Parallel Data Alignment

ruoyuxie/noisy_parallel_data_alignment • • 23 Jan 2023

An ongoing challenge in current natural language processing is how its major advancements tend to disproportionately favor resource-rich languages, leaving a significant number of under-resourced languages behind.

23 Jan 2023

Paper
Code