Bilingual Lexicon Induction

34 papers with code • 0 benchmarks • 0 datasets

Translate words from one language to another.

Prix-LM: Pretraining for Multilingual Knowledge Base Construction

luka-group/prix-lm ACL 2022

To achieve this, it is crucial to represent multilingual knowledge in a shared/unified space.

6
16 Oct 2021

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

zjpbinary/CSCBLI Findings (ACL) 2021

Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in another, and is typically through learning linear projections to align monolingual word representation spaces.

14
06 Jun 2021

Evaluating Word Embeddings with Categorical Modularity

enscma2/categorical-modularity Findings (ACL) 2021

We find moderate to strong positive correlations between categorical modularity and performance on the monolingual tasks of sentiment analysis and word similarity calculation and on the cross-lingual task of bilingual lexicon induction both to and from English.

3
02 Jun 2021

Cross-Lingual Word Embedding Refinement by $\ell_1$ Norm Optimisation

Pzoom522/L1-Refinement NAACL 2021

It is therefore recommended that this strategy be adopted as a standard for CLWE methods.

16
01 Jun 2021

Cross-Lingual Word Embedding Refinement by $\ell_{1}$ Norm Optimisation

Pzoom522/L1-Refinement 11 Apr 2021

It is therefore recommended that this strategy be adopted as a standard for CLWE methods.

16
11 Apr 2021

Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation

alexandra-chron/lexical_xlm_relm NAACL 2021

Successful methods for unsupervised neural machine translation (UNMT) employ crosslingual pretraining via self-supervision, often in the form of a masked language modeling or a sequence generation task, which requires the model to align the lexical- and high-level representations of the two languages.

17
18 Mar 2021

Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora

twadada/multilingual-nlm EMNLP (MRL) 2021

We propose a new approach for learning contextualised cross-lingual word embeddings based on a small parallel corpus (e. g. a few hundred sentence pairs).

29
27 Oct 2020

Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

BestActionNow/SemiSupBLI EMNLP 2020

In this paper, we propose a new semi-supervised BLI framework to encourage the interaction between the supervised signal and unsupervised alignment.

0
14 Oct 2020

Refinement of Unsupervised Cross-Lingual Word Embeddings

artetxem/vecmap 21 Feb 2020

In this paper, we propose a self-supervised method to refine the alignment of unsupervised bilingual word embeddings.

640
21 Feb 2020