Search Results for author: Alexis Palmer

Found 46 papers, 5 papers with code

Orthographic vs. Semantic Representations for Unsupervised Morphological Paradigm Clustering

no code implementations ACL (SIGMORPHON) 2021 E. Margaret Perkoff, Josh Daniels, Alexis Palmer

This paper presents two different systems for unsupervised clustering of morphological paradigms, in the context of the SIGMORPHON 2021 Shared Task 2.

Clustering LEMMA +1

Contrast Sets for Stativity of English Verbs in Context

no code implementations COLING 2022 Daniel Chen, Alexis Palmer

For the task of classifying verbs in context as dynamic or stative, current models approach human performance, but only for particular data sets.

Machine Translation Between High-resource Languages in a Language Documentation Setting

no code implementations FieldMatters (COLING) 2022 Katharina Kann, Abteen Ebrahimi, Kristine Stenzel, Alexis Palmer

This translation task is challenging for multiple reasons: (1) the data is out-of-domain with respect to the MT system’s training data, (2) much of the data is conversational, (3) existing translations include non-standard and uncommon expressions, often reflecting properties of the documented language, and (4) the data includes borrowings from other regional languages.

Machine Translation Translation +1

From Priest to Doctor: Domain Adaptaion for Low-Resource Neural Machine Translation

no code implementations1 Dec 2024 Ali Marashian, Enora Rice, Luke Gessler, Alexis Palmer, Katharina von der Wense

Many of the world's languages have insufficient data to train high-performing general neural machine translation (NMT) models, let alone domain-specific models, and often the only available parallel data are small amounts of religious texts.

Domain Adaptation Low Resource Neural Machine Translation +3

Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation

no code implementations1 Oct 2024 Bhargav Shandilya, Alexis Palmer

In this paper, we propose a retrieval augmented generation (RAG) framework backed by a large language model (LLM) to correct the output of a smaller model for the linguistic task of morphological glossing.

Descriptive Inductive Bias +4

Can we teach language models to gloss endangered languages?

no code implementations27 Jun 2024 Michael Ginn, Mans Hulden, Alexis Palmer

We explore whether LLMs can be effective at the task of interlinear glossing with in-context learning, without any traditional training.

Descriptive In-Context Learning

TAMS: Translation-Assisted Morphological Segmentation

no code implementations21 Mar 2024 Enora Rice, Ali Marashian, Luke Gessler, Alexis Palmer, Katharina von der Wense

Canonical morphological segmentation is the process of analyzing words into the standard (aka underlying) forms of their constituent morphemes.

Segmentation Translation

GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text

no code implementations11 Mar 2024 Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin

We compile the largest existing corpus of IGT data from a variety of sources, covering over 450k examples across 1. 8k languages, to enable research on crosslingual transfer and IGT generation.

Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context

no code implementations5 Nov 2023 Michael Ginn, Alexis Palmer

Generalization is of particular importance in resource-constrained settings, where the available training data may represent only a small fraction of the distribution of possible texts.

Denoising

Taxonomic Loss for Morphological Glossing of Low-Resource Languages

1 code implementation29 Aug 2023 Michael Ginn, Alexis Palmer

Morpheme glossing is a critical task in automated language documentation and can benefit other downstream applications greatly.

A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches

no code implementations18 Aug 2022 Annemarie Friedrich, Nianwen Xue, Alexis Palmer

This includes whether a situation is described as a state or as an event, whether the situation is finished or ongoing, and whether it is viewed as a whole or with a focus on a particular phase.

UNT Linguistics at SemEval-2020 Task 12: Linear SVC with Pre-trained Word Embeddings as Document Vectors and Targeted Linguistic Features

no code implementations SEMEVAL 2020 Jared Fromknecht, Alexis Palmer

This paper outlines our approach to Tasks A {\&} B for the English Language track of SemEval-2020 Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media.

Language Identification Word Embeddings

UNTLing at SemEval-2020 Task 11: Detection of Propaganda Techniques in English News Articles

no code implementations SEMEVAL 2020 Maia Petee, Alexis Palmer

Our system for the PropEval task explores the ability of semantic features to detect and label propagandistic rhetorical techniques in English news articles.

Predicting the Focus of Negation: Model and Error Analysis

no code implementations ACL 2020 Md Mosharaf Hossain, Kathleen Hamilton, Alexis Palmer, Eduardo Blanco

The focus of a negation is the set of tokens intended to be negated, and a key component for revealing affirmative alternatives to negated utterances.

Negation

WikiPossessions: Possession Timeline Generation as an Evaluation Benchmark for Machine Reading Comprehension of Long Texts

no code implementations LREC 2020 Dhivya Chinnappa, Alexis Palmer, Eduardo Blanco

Specifically, to complete the full TOP task for a given article, a system must do the following: a) identify possessors; b) anchor possessors to times/events; c) identify temporal relations between each temporal anchor and the possession relation it corresponds to; d) assign certainty scores to each possessor and each temporal relation; and e) assemble individual possession events into a global possession timeline.

Machine Reading Comprehension Relation +1

Sigmorphon 2019 Task 2 system description paper: Morphological analysis in context for many languages, with supervision from only a few

no code implementations WS 2019 Brad Aiken, Jared Kelly, Alexis Palmer, Suleyman Olcay Polat, Taraka Rama, Rodney Nielsen

While our system results are dramatically below the average system submitted for the shared task evaluation campaign, our method is (we suspect) unique in its minimal reliance on labeled training data.

Lemmatization Morphological Analysis +5

Illegal is not a Noun: Linguistic Form for Detection of Pejorative Nominalizations

no code implementations WS 2017 Alexis Palmer, Melissa Robinson, Kristy K. Phillips

This paper focuses on a particular type of abusive language, targeting expressions in which typically neutral adjectives take on pejorative meaning when used as nouns - compare {`}gay people{'} to {`}the gays{'}.

Abusive Language POS

Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers

no code implementations LREC 2014 Andrea Horbach, Alexis Palmer, Magdalena Wolska

n this paper we investigate the potential of answer clustering for semi-automatic scoring of short answer questions for German as a foreign language.

Clustering

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

no code implementations LREC 2014 Annemarie Friedrich, Marina Valeeva, Alexis Palmer

We present LQVSumm, a corpus of about 2000 automatically created extractive multi-document summaries from the TAC 2011 shared task on Guided Summarization, which we annotated with several types of linguistic quality violations.

Document Summarization Multi-Document Summarization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.