Word Alignment

84 papers with code • 7 benchmarks • 4 datasets

Word Alignment is the task of finding the correspondence between source and target words in a pair of sentences that are translations of each other.

Source: Neural Network-based Word Alignment through Score Aggregation

Benchmarks

Add a Result

These leaderboards are used to track progress in Word Alignment

Dataset	Best Model	Compare
en-es	Barycenter Alignment	See all
es-en	Barycenter Alignment	See all
en-fr	Barycenter Alignment	See all
fr-en	Barycenter Alignment	See all
en-it	Barycenter Alignment	See all
MUSE en-pt	Barycenter Alignment	See all
MUSE en-de	Barycenter Alignment	See all

Datasets

Latest papers with no code

Most implemented Social Latest No code

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

no code yet • 5 Apr 2024

This paper addresses text-supervised semantic segmentation, aiming to learn a model capable of segmenting arbitrary visual concepts within images by using only image-text pairs without dense annotations.

Paper
Add Code

Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment

no code yet • 3 Apr 2024

The field of cross-lingual sentence embeddings has recently experienced significant advancements, but research concerning low-resource languages has lagged due to the scarcity of parallel corpora.

Paper
Add Code

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

no code yet • 27 Mar 2024

Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.

Paper
Add Code

Self-Augmented In-Context Learning for Unsupervised Word Translation

no code yet • 15 Feb 2024

Recent work has shown that, while large language models (LLMs) demonstrate strong word translation or bilingual lexicon induction (BLI) capabilities in few-shot setups, they still cannot match the performance of 'traditional' mapping-based approaches in the unsupervised scenario where no seed translation pairs are available, especially for lower-resource languages.

Paper
Add Code

Unsupervised Graph Attention Autoencoder for Attributed Networks using K-means Loss

no code yet • 21 Nov 2023

The model employs k-means as an objective function and utilizes a multi-head Graph Attention Auto-Encoder for decoding the representations.

Paper
Add Code

Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting

no code yet • 9 Oct 2023

Terminology correctness is important in the downstream application of machine translation, and a prevalent way to ensure this is to inject terminology constraints into a translation system.

Paper
Add Code

Gloss Alignment Using Word Embeddings

no code yet • 8 Aug 2023

As a result, research has turned to TV broadcast content as a source of large-scale training data, consisting of both the sign language interpreter and the associated audio subtitle.

Paper
Add Code

Does mBERT understand Romansh? Evaluating word embeddings using word alignment

no code yet • 14 Jun 2023

We test similarity-based word alignment models (SimAlign and awesome-align) in combination with word embeddings from mBERT and XLM-R on parallel sentences in German and Romansh.

Paper
Add Code

Contextualized Word Vector-based Methods for Discovering Semantic Differences with No Training nor Word Alignment

no code yet • 19 May 2023

In this paper, we propose methods for discovering semantic differences in words appearing in two corpora based on the norms of contextualized word vectors.

Paper
Add Code

Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection

no code yet • 17 Mar 2023

Methods are mostly evaluated in terms of how well object class names are learned, but captions also contain rich attribute context that should be considered when learning object alignment.

Paper
Add Code

Word Alignment

Benchmarks Add a Result

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result