no code implementations • ACL (SIGMORPHON) 2021 • E. Margaret Perkoff, Josh Daniels, Alexis Palmer
This paper presents two different systems for unsupervised clustering of morphological paradigms, in the context of the SIGMORPHON 2021 Shared Task 2.
no code implementations • LILT 2016 • Ana Marasović, Mengfei Zhou, Alexis Palmer, Anette Frank
Modal verbs have different interpretations depending on their context.
no code implementations • NAACL (AmericasNLP) 2021 • Manuel Mager, Arturo Oncevay, Abteen Ebrahimi, John Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Rolando Coto-Solano, Alexis Palmer, Elisabeth Mager-Hois, Vishrav Chaudhary, Graham Neubig, Ngoc Thang Vu, Katharina Kann
This paper presents the results of the 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas.
no code implementations • COLING 2022 • Daniel Chen, Alexis Palmer
For the task of classifying verbs in context as dynamic or stative, current models approach human performance, but only for particular data sets.
no code implementations • FieldMatters (COLING) 2022 • Katharina Kann, Abteen Ebrahimi, Kristine Stenzel, Alexis Palmer
This translation task is challenging for multiple reasons: (1) the data is out-of-domain with respect to the MT system’s training data, (2) much of the data is conversational, (3) existing translations include non-standard and uncommon expressions, often reflecting properties of the documented language, and (4) the data includes borrowings from other regional languages.
no code implementations • 1 Dec 2024 • Ali Marashian, Enora Rice, Luke Gessler, Alexis Palmer, Katharina von der Wense
Many of the world's languages have insufficient data to train high-performing general neural machine translation (NMT) models, let alone domain-specific models, and often the only available parallel data are small amounts of religious texts.
Domain Adaptation Low Resource Neural Machine Translation +3
no code implementations • 1 Oct 2024 • Bhargav Shandilya, Alexis Palmer
In this paper, we propose a retrieval augmented generation (RAG) framework backed by a large language model (LLM) to correct the output of a smaller model for the linguistic task of morphological glossing.
no code implementations • 27 Jun 2024 • Michael Ginn, Mans Hulden, Alexis Palmer
We explore whether LLMs can be effective at the task of interlinear glossing with in-context learning, without any traditional training.
no code implementations • 21 Mar 2024 • Enora Rice, Ali Marashian, Luke Gessler, Alexis Palmer, Katharina von der Wense
Canonical morphological segmentation is the process of analyzing words into the standard (aka underlying) forms of their constituent morphemes.
no code implementations • 11 Mar 2024 • Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin
We compile the largest existing corpus of IGT data from a variety of sources, covering over 450k examples across 1. 8k languages, to enable research on crosslingual transfer and IGT generation.
no code implementations • 5 Nov 2023 • Michael Ginn, Alexis Palmer
Generalization is of particular importance in resource-constrained settings, where the available training data may represent only a small fraction of the distribution of possible texts.
1 code implementation • 29 Aug 2023 • Michael Ginn, Alexis Palmer
Morpheme glossing is a critical task in automated language documentation and can benefit other downstream applications greatly.
no code implementations • 28 Oct 2022 • Marie Grace, Xajavion "Jay" Seabrum, Dananjay Srinivas, Alexis Palmer
The automatic detection of offensive language is a pressing societal need.
no code implementations • 18 Aug 2022 • Annemarie Friedrich, Nianwen Xue, Alexis Palmer
This includes whether a situation is described as a state or as an event, whether the situation is finished or ongoing, and whether it is viewed as a whole or with a focus on a particular phase.
1 code implementation • ACL 2022 • Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Meza-Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann
Continued pretraining offers improvements, with an average accuracy of 44. 05%.
no code implementations • SEMEVAL 2020 • Jared Fromknecht, Alexis Palmer
This paper outlines our approach to Tasks A {\&} B for the English Language track of SemEval-2020 Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media.
no code implementations • SEMEVAL 2020 • Maia Petee, Alexis Palmer
Our system for the PropEval task explores the ability of semantic features to detect and label propagandistic rhetorical techniques in English news articles.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Md Mosharaf Hossain, Antonios Anastasopoulos, Eduardo Blanco, Alexis Palmer
As machine translation (MT) systems progress at a rapid pace, questions of their adequacy linger.
no code implementations • ACL 2020 • Md Mosharaf Hossain, Kathleen Hamilton, Alexis Palmer, Eduardo Blanco
The focus of a negation is the set of tokens intended to be negated, and a key component for revealing affirmative alternatives to negated utterances.
no code implementations • LREC 2020 • Dhivya Chinnappa, Alexis Palmer, Eduardo Blanco
Specifically, to complete the full TOP task for a given article, a system must do the following: a) identify possessors; b) anchor possessors to times/events; c) identify temporal relations between each temporal anchor and the possession relation it corresponds to; d) assign certainty scores to each possessor and each temporal relation; and e) assemble individual possession events into a global possession timeline.
no code implementations • LREC 2020 • Graham Neubig, Shruti Rijhwani, Alexis Palmer, Jordan MacKenzie, Hilaria Cruz, Xinjian Li, Matthew Lee, Aditi Chaudhary, Luke Gessler, Steven Abney, Shirley Anugrah Hayati, Antonios Anastasopoulos, Olga Zamaraeva, Emily Prud'hommeaux, Jennette Child, Sara Child, Rebecca Knowles, Sarah Moeller, Jeffrey Micher, Yiyuan Li, Sydney Zink, Mengzhou Xia, Roshan S Sharma, Patrick Littell
Despite recent advances in natural language processing and other language technology, the application of such technology to language documentation and conservation has been limited.
no code implementations • WS 2019 • Brad Aiken, Jared Kelly, Alexis Palmer, Suleyman Olcay Polat, Taraka Rama, Rodney Nielsen
While our system results are dramatically below the average system submitted for the shared task evaluation campaign, our method is (we suspect) unique in its minimal reliance on labeled training data.
no code implementations • SEMEVAL 2019 • Zahra Sarabi, Erin Killian, Eduardo Blanco, Alexis Palmer
Negation often conveys implicit positive meaning.
no code implementations • NAACL 2018 • Alakan Vempala, a, Eduardo Blanco, Alexis Palmer
This paper presents models to predict event durations.
no code implementations • WS 2017 • Thomas Haider, Alexis Palmer
The feature sets are used for supervised text genre classification, on which our models achieve high accuracy.
no code implementations • WS 2017 • Alexis Palmer, Melissa Robinson, Kristy K. Phillips
This paper focuses on a particular type of abusive language, targeting expressions in which typically neutral adjectives take on pejorative meaning when used as nouns - compare {`}gay people{'} to {`}the gays{'}.
no code implementations • SEMEVAL 2017 • Maria Becker, Michael Staniek, Vivi Nastase, Alexis Palmer, Anette Frank
Detecting aspectual properties of clauses in the form of situation entity types has been shown to depend on a combination of syntactic-semantic and contextual features.
no code implementations • LREC 2014 • Andrea Horbach, Alexis Palmer, Magdalena Wolska
n this paper we investigate the potential of answer clustering for semi-automatic scoring of short answer questions for German as a foreign language.
no code implementations • LREC 2014 • Annemarie Friedrich, Marina Valeeva, Alexis Palmer
We present LQVSumm, a corpus of about 2000 automatically created extractive multi-document summaries from the TAC 2011 shared task on Guided Summarization, which we annotated with several types of linguistic quality violations.