Search Results for author: Esther Ploeger

Found 11 papers, 4 papers with code

A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch Literature

1 code implementation CRAC (ACL) 2021 Andreas van Cranenburgh, Esther Ploeger, Frank van den Berg, Remi Thüss

We introduce a modular, hybrid coreference resolution system that extends a rule-based baseline with three neural classifiers for the subtasks mention detection, mention attributes (gender, animacy, number), and pronoun resolution.

coreference-resolution Feature Engineering

Multi-perspective Alignment for Increasing Naturalness in Neural Machine Translation

no code implementations11 Dec 2024 Huiyuan Lai, Esther Ploeger, Rik van Noord, Antonio Toral

Neural machine translation (NMT) systems amplify lexical biases present in their training data, leading to artificially impoverished language in output translations.

Diversity Machine Translation +2

How Good is Your Wikipedia?

no code implementations8 Nov 2024 Kushal Tatariya, Artur Kulmizev, Wessel Poelman, Esther Ploeger, Marcel Bollmann, Johannes Bjerva, Jiaming Luo, Heather Lent, Miryam de Lhoneux

Wikipedia's perceived high quality and broad language coverage have established it as a fundamental resource in multilingual NLP.

Multilingual NLP

Towards Tailored Recovery of Lexical Diversity in Literary Machine Translation

no code implementations30 Aug 2024 Esther Ploeger, Huiyuan Lai, Rik van Noord, Antonio Toral

Thus, rather than aiming for the rigid increase of lexical diversity, we reframe the task as recovering what is lost in the machine translation process.

Diversity Machine Translation +1

A Principled Framework for Evaluating on Typologically Diverse Languages

1 code implementation6 Jul 2024 Esther Ploeger, Wessel Poelman, Andreas Holck Høeg-Petersen, Anders Schlichtkrull, Miryam de Lhoneux, Johannes Bjerva

We compare sampling methods with a range of metrics and find that our systematic methods consistently retrieve more typologically diverse language selections than previous methods in NLP.

What is "Typological Diversity" in NLP?

2 code implementations6 Feb 2024 Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva

We recommend future work to include an operationalization of 'typological diversity' that empirically justifies the diversity of language samples.

Diversity Multilingual NLP

Multilingual Gradient Word-Order Typology from Universal Dependencies

no code implementations2 Feb 2024 Emi Baylor, Esther Ploeger, Johannes Bjerva

While information from the field of linguistic typology has the potential to improve performance on NLP tasks, reliable typological data is a prerequisite.

CreoleVal: Multilingual Multitask Benchmarks for Creoles

1 code implementation30 Oct 2023 Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovics, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, Johannes Bjerva

Creoles represent an under-explored and marginalized group of languages, with few available resources for NLP research. While the genealogical ties between Creoles and a number of highly-resourced languages imply a significant potential for transfer learning, this potential is hampered due to this lack of annotated data.

Machine Translation Reading Comprehension +2

The Past, Present, and Future of Typological Databases in NLP

no code implementations20 Oct 2023 Emi Baylor, Esther Ploeger, Johannes Bjerva

We propose that such a view of typology has significant potential in the future, including in language modeling in low-resource scenarios.

Language Modeling Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.