no code implementations • ACL (SIGMORPHON) 2021 • Kate McCurdy, Sharon Goldwater, Adam Lopez
This work describes the Edinburgh submission to the SIGMORPHON 2021 Shared Task 2 on unsupervised morphological paradigm clustering.
no code implementations • EMNLP (CMCL) 2020 • Kate McCurdy, Adam Lopez, Sharon Goldwater
Grammatical gender is a consistent and informative cue to the plural class of German nouns.
no code implementations • 10 Jan 2025 • Ruby Ostrow, Adam Lopez
A large body of research has found substantial gender bias in NLP systems.
no code implementations • 8 Nov 2023 • Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez
Many NLP researchers are experiencing an existential crisis triggered by the astonishing success of ChatGPT and other systems based on large language models (LLMs).
1 code implementation • 16 Oct 2023 • Andreas Grivas, Antonio Vergari, Adam Lopez
We then show that they can be prevented in practice by introducing a Discrete Fourier Transform (DFT) output layer, which guarantees that all sparse label combinations with up to $k$ active labels are argmaxable.
no code implementations • 22 May 2023 • Seraphina Goldfarb-Tarrant, Björn Ross, Adam Lopez
We also find racial biases to be much more prevalent than gender biases.
no code implementations • 19 May 2023 • Seraphina Goldfarb-Tarrant, Adam Lopez, Roi Blanco, Diego Marcheggiani
To remedy this, we build a counterfactual evaluation corpus for gender and racial/migrant bias in four languages.
1 code implementation • ACL 2022 • Andreas Grivas, Nikolay Bogoychev, Adam Lopez
Classifiers in natural language processing (NLP) often have a large number of output classes.
no code implementations • SEMEVAL 2021 • J. A. Meaney, Steven Wilson, Luis Chiruzzo, Adam Lopez, Walid Magdy
Our subtasks were binary humor detection, prediction of humor and offense ratings, and a novel controversy task: to predict if the variance in the humor ratings was higher than a specific threshold.
no code implementations • ACL 2021 • Seraphina Goldfarb-Tarrant, Rebecca Marchant, Ricardo Muñoz Sanchez, Mugdha Pandya, Adam Lopez
We urge researchers working on debiasing to focus on extrinsic measures of bias, and to make using these measures more feasible via creation of new challenge sets and annotated test data.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Naomi Saphra, Adam Lopez
To explore the inductive biases that cause these compositional representations to arise during training, we conduct simple experiments on synthetic data.
no code implementations • 21 Oct 2020 • Aibek Makazhanov, Sharon Goldwater, Adam Lopez
We present LemMED, a character-level encoder-decoder for contextual morphological analysis (combined lemmatization and tagging).
no code implementations • 6 Oct 2020 • Naomi Saphra, Adam Lopez
To explore the inductive biases that cause these compositional representations to arise during training, we conduct simple experiments on synthetic data.
no code implementations • ACL 2020 • Kate McCurdy, Sharon Goldwater, Adam Lopez
Encoder-decoder models do generalize the most frequently produced plural class, but do not show human-like variability or 'regular' extension of these other plural markers.
no code implementations • 27 Apr 2020 • Naomi Saphra, Adam Lopez
Recent work in NLP shows that LSTM language models capture compositional structure in language data.
1 code implementation • 12 Nov 2019 • Yekun Chai, Naomi Saphra, Adam Lopez
Diverse word representations have surged in most state-of-the-art natural language processing (NLP) applications.
no code implementations • IJCNLP 2019 • Federico Fancellu, Sorcha Gilroy, Adam Lopez, Mirella Lapata
Semantic parses are directed acyclic graphs (DAGs), so semantic parsing should be modeled as graph prediction.
Ranked #5 on
DRS Parsing
on PMB-2.2.0
no code implementations • IJCNLP 2019 • Clara Vania, Yova Kementchedjhieva, Anders Søgaard, Adam Lopez
Parsers are available for only a handful of the world's languages, since they require lots of training data.
no code implementations • 29 Aug 2019 • Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwater
Given a large amount of unannotated speech in a low-resource language, can we classify the speech utterances by topic?
no code implementations • ICML Workshop Deep_Phenomen 2019 • Naomi Saphra, Adam Lopez
Concerns about interpretability, computational resources, and principled inductive priors have motivated efforts to engineer sparse neural models for NLP tasks.
no code implementations • 28 May 2019 • Naomi Saphra, Adam Lopez
LSTM-based language models exhibit compositionality in their representations, but how this behavior emerges over the course of training has not been explored.
no code implementations • NAACL 2019 • Naomi Saphra, Adam Lopez
Research has shown that neural models implicitly encode linguistic features, but there has been no research showing \emph{how} these encodings arise as the models are trained.
no code implementations • WS 2018 • Yova Kementchedjhieva, Adam Lopez
Character language models have access to surface morphological patterns, but it is not clear whether or \textit{how} they learn abstract morphological regularities.
no code implementations • WS 2018 • Clara Vania, Adam Lopez
Neural dependency parsing models that compose word representations from characters can presumably exploit morphosyntax when making attachment decisions.
no code implementations • WS 2018 • Naomi Saphra, Adam Lopez
A glut of recent research shows that language models capture linguistic structure.
no code implementations • NAACL 2019 • Ieva Vasiljeva, Sorcha Gilroy, Adam Lopez
Semantic representations in the form of directed acyclic graphs (DAGs) have been introduced in recent years, and to model them, we need probabilistic models of DAGs.
no code implementations • 4 Oct 2018 • Federico Fancellu, Adam Lopez, Bonnie Webber
Negation scope has been annotated in several English and Chinese corpora, and highly accurate models for this task in these languages have been learned from these annotations.
1 code implementation • NAACL 2019 • Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater
Finally, we show that the approach improves performance on a true low-resource task: pre-training on a combination of English ASR and French ASR improves Mboshi-French ST, where only 4 hours of data are available, from 3. 5 to 7. 1
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 31 Aug 2018 • Yova Kementchedjhieva, Adam Lopez
Character language models have access to surface morphological patterns, but it is not clear whether or how they learn abstract morphological regularities.
no code implementations • EMNLP 2018 • Clara Vania, Andreas Grivas, Adam Lopez
When parsing morphologically-rich languages with neural models, it is beneficial to model input at the character level, and it has been claimed that this is because character-level models learn morphology.
no code implementations • WS 2018 • Arabella Sinclair, Adam Lopez, C. G. Lucas, Dragan Gasevic
We find that lexical priming in learner-tutor dialogues differs from that in conversational and task-based dialogues, and we find evidence that alignment increases with ability and with word complexity.
1 code implementation • NAACL 2018 • Ida Szubert, Adam Lopez, Nathan Schneider
Abstract Meaning Representation (AMR) annotations are often assumed to closely mirror dependency syntax, but AMR explicitly does not require this, and the assumption has never been tested.
no code implementations • 24 Mar 2018 • Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater
We explore models trained on between 20 and 160 hours of data, and find that although models trained on less data have considerably lower BLEU scores, they can still predict words with relatively high precision and recall---around 50% for a model trained on 50 hours of data, versus around 60% for the full 160 hour model.
no code implementations • CL 2018 • David Chiang, Frank Drewes, Daniel Gildea, Adam Lopez, Giorgio Satta
Graphs have a variety of uses in natural language processing, particularly as representations of linguistic meaning.
no code implementations • WS 2017 • Antonios Anastasopoulos, Sameer Bansal, David Chiang, Sharon Goldwater, Adam Lopez
Vast amounts of speech data collected for language documentation and research remain untranscribed and unsearchable, but often a small amount of speech may have text translations available.
no code implementations • ACL 2017 • Jianpeng Cheng, Adam Lopez, Mirella Lapata
Generative models defining joint distributions over parse trees and sentences are useful for parsing and language modeling, but impose restrictions on the scope of features and are often outperformed by discriminative models.
no code implementations • SEMEVAL 2017 • Sorcha Gilroy, Adam Lopez, Sebastian Maneth
Recently, several datasets have become available which represent natural language phenomena as graphs.
no code implementations • CONLL 2017 • Clara Vania, Xingxing Zhang, Adam Lopez
This paper presents our submissions for the CoNLL 2017 UD Shared Task.
no code implementations • ACL 2017 • Clara Vania, Adam Lopez
Words can be represented by composing the representations of subword units such as word segments, characters, and/or character n-grams.
no code implementations • EACL 2017 • Federico Fancellu, Adam Lopez, Bonnie Webber, Hangfeng He
Several corpora have been annotated with negation scope{---}the set of words whose meaning is negated by a cue like the word {``}not{''}{---}leading to the development of classifiers that detect negation scope with high accuracy.
no code implementations • WS 2017 • Federico Fancellu, Siva Reddy, Adam Lopez, Bonnie Webber
Many language technology applications would benefit from the ability to represent negation and its scope on top of widely-used linguistic resources.
no code implementations • EACL 2017 • Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwater
We explore the problem of translating speech to text in low-resource scenarios where neither automatic speech recognition (ASR) nor machine translation (MT) are available, but we have training data in the form of audio paired with text translations.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
1 code implementation • 10 Feb 2017 • Federico Fancellu, Siva Reddy, Adam Lopez, Bonnie Webber
Many language technology applications would benefit from the ability to represent negation and its scope on top of widely-used linguistic resources.
no code implementations • 21 Sep 2016 • Sameer Bansal, Herman Kamper, Sharon Goldwater, Adam Lopez
Recent work on unsupervised term discovery (UTD) aims to identify and cluster repeated word-like units from audio alone.
1 code implementation • WS 2016 • Naomi Saphra, Adam Lopez
Existing corpora for intrinsic evaluation are not targeted towards tasks in informal domains such as Twitter or news comment forums.
no code implementations • TACL 2015 • Hua He, Jimmy Lin, Adam Lopez
We believe that GPU-based extraction of hierarchical grammars is an attractive proposition, particularly for MT applications that demand high throughput.
no code implementations • TACL 2013 • Adam Lopez, Matt Post, Chris Callison-Burch, Jonathan Weese, Juri Ganitkevitch, Narges Ahmidi, Olivia Buzek, Leah Hanson, Beenish Jamil, Matthias Lee, Ya-Ting Lin, Henry Pao, Fatima Rivera, Leili Shahriyari, Debu Sinha, Adam Teichert, Stephen Wampler, Michael Weinberger, Daguang Xu, Lin Yang, Shang Zhao
Machine translation (MT) draws from several different disciplines, making it a complex subject to teach.