Bilingual Lexicon Induction

34 papers with code • 0 benchmarks • 0 datasets

Translate words from one language to another.

Semi-Supervised Learning for Bilingual Lexicon Induction

gguinet/semisupervised-alignment 10 Feb 2024

It was recently shown that it is possible to infer such lexicon, without using any parallel data, by aligning word embeddings trained on monolingual data.

4
10 Feb 2024

ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting

4mekki4/promap 28 Oct 2023

We also demonstrate the effectiveness of ProMap in re-ranking results from other BLI methods such as with aligned static word embeddings.

0
28 Oct 2023

On Bilingual Lexicon Induction with Large Language Models

cambridgeltl/prompt4bli 21 Oct 2023

Bilingual Lexicon Induction (BLI) is a core task in multilingual NLP that still, to a large extent, relies on calculating cross-lingual word representations.

8
21 Oct 2023

When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages

niyatibafna/bli-for-indic-languages 23 May 2023

Most existing approaches for unsupervised bilingual lexicon induction (BLI) depend on good quality static or contextual embeddings requiring large monolingual corpora for both languages.

1
23 May 2023

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

mainlp/dialect-BLI 19 Apr 2023

Bilingual word lexicons are crucial tools for multilingual natural language understanding and machine translation tasks, as they facilitate the mapping of words in one language to their synonyms in another language.

3
19 Apr 2023

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking

cambridgeltl/BLICEr 30 Oct 2022

This crucial step is done via 1) creating a word similarity dataset, comprising positive word pairs (i. e., true translations) and hard negative pairs induced from the original CLWE space, and then 2) fine-tuning an mPLM (e. g., mBERT or XLM-R) in a cross-encoder manner to predict the similarity scores.

13
30 Oct 2022

RAPO: An Adaptive Ranking Paradigm for Bilingual Lexicon Induction

jlfj345wf/rapo 18 Oct 2022

Bilingual lexicon induction induces the word translations by aligning independently trained word embeddings in two languages.

3
18 Oct 2022

IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces

kellymarchisio/isovec 11 Oct 2022

The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism."

7
11 Oct 2022

Improving Word Translation via Two-Stage Contrastive Learning

cambridgeltl/contrastivebli ACL 2022

At Stage C1, we propose to refine standard cross-lingual linear maps between static word embeddings (WEs) via a contrastive learning objective; we also show how to integrate it into the self-learning procedure for even more refined cross-lingual maps.

32
15 Mar 2022

Improving Word Translation via Two-Stage Contrastive Learning

cambridgeltl/contrastivebli ACL ARR November 2021

As Stage C1, we propose to refine standard cross-lingual linear maps between static word embeddings (WEs) via a contrastive learning objective; we also show how to integrate it into the self-learning procedure for even more refined cross-lingual maps.

32
16 Nov 2021