Search Results for author: Kelly Marchisio

Found 13 papers, 4 papers with code

An Alignment-Based Approach to Semi-Supervised Bilingual Lexicon Induction with Small Parallel Corpora

1 code implementation MTSummit 2021 Kelly Marchisio, Philipp Koehn, Conghao Xiong

Aimed at generating a seed lexicon for use in downstream natural language tasks and unsupervised methods for bilingual lexicon induction have received much attention in the academic literature recently.

Bilingual Lexicon Induction Translation

Embedding-Enhanced GIZA++: Improving Low-Resource Word Alignment Using Embeddings

no code implementations AMTA 2022 Kelly Marchisio, Conghao Xiong, Philipp Koehn

A popular natural language processing task decades ago, word alignment has been dominated until recently by GIZA++, a statistical method based on the 30-year-old IBM models.

Machine Translation Translation +1

Learning a Formality-Aware Japanese Sentence Representation

no code implementations17 Jan 2023 Henry Li Xinyuan, Ray Lee, Jerry Chen, Kelly Marchisio

On the other hand, downstream tasks such as translation would benefit from working with a sentence representation that preserves formality in addition to semantics, so as to generate sentences with the appropriate level of social formality -- the difference between speaking to a friend versus speaking with a supervisor.

Sentence

Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training

no code implementations20 Dec 2022 Kelly Marchisio, Patrick Lewis, Yihong Chen, Mikel Artetxe

Prior work shows that it is possible to expand pretrained Masked Language Models (MLMs) to new languages by learning a new set of embeddings, while keeping the transformer body frozen.

Cross-Lingual Transfer

Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport

no code implementations25 Oct 2022 Kelly Marchisio, Ali Saad-Eldin, Kevin Duh, Carey Priebe, Philipp Koehn

Bilingual lexicons form a critical component of various natural language processing applications, including unsupervised and semisupervised machine translation and crosslingual information retrieval.

Bilingual Lexicon Induction Graph Matching +3

IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces

1 code implementation11 Oct 2022 Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn

The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism."

Bilingual Lexicon Induction Translation

When Does Unsupervised Machine Translation Work?

no code implementations WMT (EMNLP) 2020 Kelly Marchisio, Kevin Duh, Philipp Koehn

We additionally find that unsupervised MT performance declines when source and target languages use different scripts, and observe very poor performance on authentic low-resource language pairs.

Translation Unsupervised Machine Translation

Johns Hopkins University Submission for WMT News Translation Task

no code implementations WS 2019 Kelly Marchisio, Yash Kumar Lal, Philipp Koehn

We describe the work of Johns Hopkins University for the shared task of news translation organized by the Fourth Conference on Machine Translation (2019).

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.