Search Results for author: Dirk Hovy

Found 110 papers, 31 papers with code

We Need to Consider Disagreement in Evaluation

no code implementations • ACL (BPPF) 2021 • Valerio Basile, Michael Fell, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio, Alexandra Uma

Instead, we suggest that we need to better capture the sources of disagreement to improve today’s evaluation practice.

Paper
Add Code

Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design

no code implementations • SIGDIAL (ACL) 2022 • A. Stevie Bergman, Gavin Abercrombie, Shannon Spruit, Dirk Hovy, Emily Dinan, Y-Lan Boureau, Verena Rieser

Over the last several years, end-to-end neural conversational agents have vastly improved their ability to carry unrestricted, open-domain conversations with humans.

Paper
Add Code

Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa

no code implementations • ACL 2022 • Tommaso Fornaciari, Alexandra Uma, Massimo Poesio, Dirk Hovy

Natural Language Processing (NLP) ‘s applied nature makes it necessary to select the most effective and robust models.

Experimental Design

Paper
Add Code

A Report on the VarDial Evaluation Campaign 2020

no code implementations • VarDial (COLING) 2020 • Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubešić, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri

This paper presents the results of the VarDial Evaluation Campaign 2020 organized as part of the seventh workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with COLING 2020.

Dialect Identification

Paper
Add Code

Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals

1 code implementation • LTEDI (ACL) 2022 • Debora Nozza, Federico Bianchi, Anne Lauscher, Dirk Hovy

Current language technology is ubiquitous and directly influences individuals’ lives worldwide.

Sentence Sentence Completion

Paper
Code

“We will Reduce Taxes” - Identifying Election Pledges with Language Models

no code implementations • Findings (ACL) 2021 • Tommaso Fornaciari, Dirk Hovy, Elin Naurin, Julia Runeson, Robert Thomson, Pankaj Adhikari

Paper
Add Code

SafetyKit: First Aid for Measuring Safety in Open-domain Conversational Systems

no code implementations • ACL 2022 • Emily Dinan, Gavin Abercrombie, A. Bergman, Shannon Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser

We then empirically assess the extent to which current tools can measure these effects and current systems display them.

Position

Paper
Add Code

On the Gap between Adoption and Understanding in NLP

no code implementations • Findings (ACL) 2021 • Federico Bianchi, Dirk Hovy

Paper
Add Code

Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection

1 code implementation • nlppower (ACL) 2022 • Giuseppe Attanasio, Debora Nozza, Eliana Pastor, Dirk Hovy

In this paper, we provide the first benchmark study of interpretability approaches for hate speech detection.

Benchmarking Hate Speech Detection

Paper
Code

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

1 code implementation • WASSA (ACL) 2022 • Federico Bianchi, Debora Nozza, Dirk Hovy

Detecting emotion in text allows social and computational scientists to study how people behave and react to online events.

Paper
Code

MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?

no code implementations • EACL (WASSA) 2021 • Tommaso Fornaciari, Federico Bianchi, Debora Nozza, Dirk Hovy

The paper describes the MilaNLP team’s submission (Bocconi University, Milan) in the WASSA 2021 Shared Task on Empathy Detection and Emotion Classification.

Emotion Classification Multi-Task Learning

Paper
Add Code

FEEL-IT: Emotion and Sentiment Classification for the Italian Language

1 code implementation • EACL (WASSA) 2021 • Federico Bianchi, Debora Nozza, Dirk Hovy

While sentiment analysis is a popular task to understand people’s reactions online, we often need more nuanced information: is the post negative because the user is angry or sad?

Classification Sentiment Analysis +1

Paper
Code

Universal Joy A Data Set and Results for Classifying Emotions Across Languages

no code implementations • EACL (WASSA) 2021 • Sotiris Lamprinidis, Federico Bianchi, Daniel Hardt, Dirk Hovy

While emotions are universal aspects of human psychology, they are expressed differently across different languages and cultures.

Emotion Classification Zero-Shot Learning

Paper
Add Code

Pipelines for Social Bias Testing of Large Language Models

no code implementations • BigScience (ACL) 2022 • Debora Nozza, Federico Bianchi, Dirk Hovy

We hope to open a discussion on the best methodologies to handle social bias testing in language models.

Paper
Add Code

Conversations as a Source for Teaching Scientific Concepts at Different Education Levels

no code implementations • 16 Apr 2024 • Donya Rooein, Dirk Hovy

While language models hold great promise for educational applications, there are substantial challenges in training them to engage in meaningful and effective conversational teaching, especially when considering the diverse needs of various audiences.

Paper
Add Code

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety

2 code implementations • 8 Apr 2024 • Paul Röttger, Fabio Pernisi, Bertie Vidgen, Dirk Hovy

Researchers and practitioners have met these concerns by introducing an abundance of new datasets for evaluating and improving LLM safety.

Language Modelling Large Language Model

132

Paper
Code

DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods

no code implementations • 8 Mar 2024 • Lorenzo Lupo, Paul Bose, Mahyar Habibi, Dirk Hovy, Carlo Schwarz

DADIT enables us to train and compare the performance of various state-of-the-art models for the prediction of the gender and age of social media users.

Paper
Add Code

Classist Tools: Social Class Correlates with Performance in NLP

no code implementations • 7 Mar 2024 • Amanda Cercas Curry, Giuseppe Attanasio, Zeerak Talat, Dirk Hovy

We argue for the inclusion of socioeconomic class in future language technologies.

Automatic Speech Recognition Language Modelling +2

Paper
Add Code

Impoverished Language Technology: The Lack of (Social) Class in NLP

no code implementations • 6 Mar 2024 • Amanda Cercas Curry, Zeerak Talat, Dirk Hovy

Since Labov's (1964) foundational work on the social stratification of language, linguistics has dedicated concerted efforts towards understanding the relationships between socio-demographic factors and language production and perception.

Paper
Add Code

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

no code implementations • 5 Mar 2024 • Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Curry, Gavin Abercrombie, Dirk Hovy

We then analyze the emotions generated by the models in relation to the gender-event pairs.

Attribute Emotion Recognition

Paper
Add Code

Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions

1 code implementation • 2 Mar 2024 • Flor Miriam Plaza-del-Arco, Alba Curry, Amanda Cercas Curry, Dirk Hovy

We then discuss four lacunae: (1) the absence of demographic and cultural aspects does not account for the variation in how emotions are perceived, but instead assumes they are universally experienced in the same manner; (2) the poor fit of emotion categories from the two main emotion theories to the task; (3) the lack of standardized EA terminology hinders gap identification, comparison, and future goals; and (4) the absence of interdisciplinary research isolates EA from insights in other fields.

Emotion Recognition

Paper
Code

Multilingual Speech Models for Automatic Speech Recognition Exhibit Gender Performance Gaps

1 code implementation • 28 Feb 2024 • Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy

However, the advantaged group varies between languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

1 code implementation • 26 Feb 2024 • Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schütze, Dirk Hovy

Motivated by this discrepancy, we challenge the prevailing constrained evaluation paradigm for values and opinions in LLMs and explore more realistic unconstrained evaluations.

Multiple-choice

Paper
Code

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

1 code implementation • 22 Feb 2024 • Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank

The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging.

Multiple-choice Text Generation

Paper
Code

Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both?

no code implementations • 23 Jan 2024 • Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz, Dirk Hovy

We compare pre-training models with human context via 1) group attributes, 2) individual users, and 3) a combined approach on 5 user- and document-level tasks.

Age Estimation Language Modelling

Paper
Add Code

Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?

no code implementations • 4 Dec 2023 • Donya Rooein, Amanda Cercas Curry, Dirk Hovy

We find large variations in the readability of the answers by different LLMs.

Paper
Add Code

How to Use Large Language Models for Text Coding: The Case of Fatherhood Roles in Public Policy Documents

1 code implementation • 20 Nov 2023 • Lorenzo Lupo, Oscar Magnusson, Dirk Hovy, Elin Naurin, Lena Wängnerud

Recent advances in large language models (LLMs) like GPT-3 and GPT-4 have opened up new opportunities for text analysis in political science.

Paper
Code

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features

no code implementations • 14 Sep 2023 • Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis

Existing work focuses on a few spoken language understanding (SLU) tasks, and explanations are difficult to interpret for most users.

counterfactual Spoken Language Understanding

Paper
Add Code

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

1 code implementation • 2 Aug 2023 • Paul Röttger, Hannah Rose Kirk, Bertie Vidgen, Giuseppe Attanasio, Federico Bianchi, Dirk Hovy

In this paper, we introduce a new test suite called XSTest to identify such eXaggerated Safety behaviours in a systematic way.

Language Modelling

Paper
Code

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation

no code implementations • 24 Jul 2023 • Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

Recent studies emphasize the importance of considering human label variation in data annotation.

Few-Shot Learning Hate Speech Detection +5

Paper
Add Code

The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics

1 code implementation • 20 Jun 2023 • Matthias Orlikowski, Paul Röttger, Philipp Cimiano, Dirk Hovy

To account for sociodemographics in models of individual annotator behaviour, we introduce group-specific layers to multi-annotator models.

Paper
Code

What about em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

no code implementations • 25 May 2023 • Anne Lauscher, Debora Nozza, Archie Crowley, Ehm Miltersen, Dirk Hovy

As 3rd-person pronoun usage shifts to include novel forms, e. g., neopronouns, we need more research on identity-inclusive NLP.

Machine Translation Translation

Paper
Add Code

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

no code implementations • 2 May 2023 • Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees Van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai, Chris van der Lee, Yiru Li, Saad Mahamood, Margot Mieskes, Emiel van Miltenburg, Pablo Mosteiro, Malvina Nissim, Natalie Parde, Ondřej Plátek, Verena Rieser, Jie Ruan, Joel Tetreault, Antonio Toral, Xiaojun Wan, Leo Wanner, Lewis Watson, Diyi Yang

We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible.

Paper
Add Code

Leveraging Social Interactions to Detect Misinformation on Social Media

no code implementations • 6 Apr 2023 • Tommaso Fornaciari, Luca Luceri, Emilio Ferrara, Dirk Hovy

Keeping track of the sequence of the interactions during the time, we improve over previous state-of-the-art models.

Misinformation

Paper
Add Code

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement

no code implementations • 25 Jan 2023 • Gavin Abercrombie, Verena Rieser, Dirk Hovy

We commonly use agreement measures to assess the utility of judgements made by human annotators in Natural Language Processing (NLP) tasks.

Paper
Add Code

Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion

1 code implementation • 18 Dec 2022 • Rishav Hada, Amir Ebrahimi Fard, Sarah Shugars, Federico Bianchi, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev

We find that the diversity scores for both Fragmentation and Representation are lower for immigration than for DST.

Recommendation Systems

Paper
Code

Bridging Fairness and Environmental Sustainability in Natural Language Processing

no code implementations • 8 Nov 2022 • Marius Hessenthaler, Emma Strubell, Dirk Hovy, Anne Lauscher

Fairness and environmental impact are important research directions for the sustainable development of artificial intelligence.

Dimensionality Reduction Fairness +4

Paper
Add Code

SocioProbe: What, When, and Where Language Models Learn about Sociodemographics

1 code implementation • 8 Nov 2022 • Anne Lauscher, Federico Bianchi, Samuel Bowman, Dirk Hovy

Our results show that PLMs do encode these sociodemographics, and that this knowledge is sometimes spread across the layers of some of the tested PLMs.

Paper
Code

"It's Not Just Hate'': A Multi-Dimensional Perspective on Detecting Harmful Speech Online

no code implementations • 28 Oct 2022 • Federico Bianchi, Stefanie Anja Hills, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev

Well-annotated data is a prerequisite for good Natural Language Processing models.

Paper
Add Code

ProSiT! Latent Variable Discovery with PROgressive SImilarity Thresholds

1 code implementation • 26 Oct 2022 • Tommaso Fornaciari, Dirk Hovy, Federico Bianchi

The most common ways to explore latent document dimensions are topic models and clustering methods.

Clustering Topic Models

Paper
Code

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages

1 code implementation • 20 Oct 2022 • Paul Röttger, Debora Nozza, Federico Bianchi, Dirk Hovy

More data is needed, but annotating hateful content is expensive, time-consuming and potentially harmful to annotators.

Hate Speech Detection

Paper
Code

The State of Profanity Obfuscation in Natural Language Processing

1 code implementation • 14 Oct 2022 • Debora Nozza, Dirk Hovy

Work on hate speech has made the consideration of rude and harmful examples in scientific publications inevitable.

Paper
Code

Is It Worth the (Environmental) Cost? Limited Evidence for Temporal Adaptation via Continuous Training

no code implementations • 13 Oct 2022 • Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy

Consequently, we should continuously update our models with new data to expose them to new events and facts.

Paper
Add Code

Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers

1 code implementation • 13 Oct 2022 • Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models.

Language Modelling Multi-Task Learning +2

Paper
Code

On the Limitations of Sociodemographic Adaptation with Transformers

1 code implementation • 1 Aug 2022 • Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

We adapt the language representations for the sociodemographic dimensions of gender and age, using continuous language modeling and dynamic multi-task learning for adaptation, where we couple language modeling with the prediction of a sociodemographic class.

Language Modelling Multi-Task Learning

Paper
Code

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists

1 code implementation • Findings (ACL) 2022 • Giuseppe Attanasio, Debora Nozza, Dirk Hovy, Elena Baralis

EAR also reveals overfitting terms, i. e., terms most likely to induce bias, to help identify their effect on the model, task, and predictions.

Bias Detection Fairness +1

Paper
Code

Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender

no code implementations • COLING 2022 • Anne Lauscher, Archie Crowley, Dirk Hovy

Based on our observations and ethical considerations, we define a series of desiderata for modeling pronouns in language technology.

Paper
Add Code

Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data

1 code implementation • 26 Jan 2022 • Federico Bianchi, Vincenzo Cutrona, Dirk Hovy

Twitter data have become essential to Natural Language Processing (NLP) and social science research, driving various scientific discoveries in recent years.

Paper
Code

Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts

no code implementations • 19 Jan 2022 • Kilian Theil, Dirk Hovy, Heiner Stuckenschmidt

How much does a CEO's personality impact the performance of their company?

Management

Paper
Add Code

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks

1 code implementation • NAACL 2022 • Paul Röttger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert

To address this issue, we propose two contrasting paradigms for data annotation.

Descriptive valid +1

Paper
Code

Language Invariant Properties in Natural Language Processing

1 code implementation • nlppower (ACL) 2022 • Federico Bianchi, Debora Nozza, Dirk Hovy

We introduce language invariant properties: i. e., properties that should not change when we transform text, and how they can be used to quantitatively evaluate the robustness of transformation algorithms.

Paraphrase Generation Translation

Paper
Code

Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling

no code implementations • 7 Jul 2021 • Emily Dinan, Gavin Abercrombie, A. Stevie Bergman, Shannon Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser

Over the last several years, end-to-end neural conversational agents have vastly improved in their ability to carry a chit-chat conversation with humans.

Paper
Add Code

Beyond Black \& White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning

no code implementations • NAACL 2021 • Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy, Massimo Poesio

Supervised learning assumes that a ground truth label exists.

Multi-Task Learning

Paper
Add Code

HONEST: Measuring Hurtful Sentence Completion in Language Models

1 code implementation • NAACL 2021 • Debora Nozza, Federico Bianchi, Dirk Hovy

Our results show that 4. 3{\%} of the time, language models complete a sentence with a hurtful word.

Ranked #1 on Hurtful Sentence Completion on HONEST

Hate Speech Detection Hurtful Sentence Completion +3

Paper
Code

The Importance of Modeling Social Factors of Language: Theory and Practice

no code implementations • NAACL 2021 • Dirk Hovy, Diyi Yang

We show that current NLP systems systematically break down when faced with interpreting the social factors of language.

Paper
Add Code

BERTective: Language Models and Contextual Information for Deception Detection

no code implementations • EACL 2021 • Tommaso Fornaciari, Federico Bianchi, Massimo Poesio, Dirk Hovy

In most cases, however, the target texts{'} preceding context is not considered.

Deception Detection

Paper
Add Code

Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success

no code implementations • Findings of the Association for Computational Linguistics 2020 • Farzana Rashid, Tommaso Fornaciari, Dirk Hovy, Eduardo Blanco, Fernando Vega-Redondo

When interacting with each other, we motivate, advise, inform, show love or power towards our peers.

Paper
Add Code

``You Sound Just Like Your Father'' Commercial Machine Translation Systems Include Stylistic Biases

no code implementations • ACL 2020 • Dirk Hovy, Federico Bianchi, Tommaso Fornaciari

The main goal of machine translation has been to convey the correct content.

Machine Translation Translation

Paper
Add Code

Integrating Ethics into the NLP Curriculum

no code implementations • ACL 2020 • Emily M. Bender, Dirk Hovy, Alex Schofield, ra

To raise awareness among future NLP practitioners and prevent inertia in the field, we need to place ethics in the curriculum for all NLP students{---}not as an elective, but as a core part of their education.

Ethics

Paper
Add Code

Cross-lingual Contextualized Topic Models with Zero-shot Learning

2 code implementations • EACL 2021 • Federico Bianchi, Silvia Terragni, Dirk Hovy, Debora Nozza, Elisabetta Fersini

They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models.

Topic Models Transfer Learning +2

1,157

Paper
Code

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence

3 code implementations • ACL 2021 • Federico Bianchi, Silvia Terragni, Dirk Hovy

Topic models extract groups of words from documents, whose interpretation as a topic hopefully allows for a better understanding of the data.

Sentence Embeddings Topic Models +1

1,157

Paper
Code

What the [MASK]? Making Sense of Language-Specific BERT Models

no code implementations • 5 Mar 2020 • Debora Nozza, Federico Bianchi, Dirk Hovy

Driven by the potential of BERT models, the NLP community has started to investigate and generate an abundant number of BERT models that are trained on a particular language, and tested on a specific data domain and task.

Language Modelling

Paper
Add Code

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

no code implementations • ACL 2020 • Deven Shah, H. Andrew Schwartz, Dirk Hovy

In this paper, we propose a unifying conceptualization: the predictive bias framework for NLP.

Selection bias

Paper
Add Code

Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers

no code implementations • WS 2019 • Hanh Nguyen, Dirk Hovy

User reviews provide a significant source of information for companies to understand their market and audience.

Topic Models

Paper
Add Code

Identifying Linguistic Areas for Geolocation

no code implementations • WS 2019 • Tommaso Fornaciari, Dirk Hovy

We create three sets of labels at different levels of granularity, and compare performance of a state-of-the-art geolocation model trained and tested with P2C labels to one with regular k-d tree labels.

Clustering

Paper
Add Code

Geolocation with Attention-Based Multitask Learning Models

no code implementations • WS 2019 • Tommaso Fornaciari, Dirk Hovy

Geolocation, predicting the location of a post based on text and other information, has a huge potential for several social media applications.

Multi-class Classification regression

Paper
Add Code

Dense Node Representation for Geolocation

no code implementations • WS 2019 • Tommaso Fornaciari, Dirk Hovy

Prior research has shown that geolocation can be substantially improved by including user network information.

Paper
Add Code

Women's Syntactic Resilience and Men's Grammatical Luck: Gender-Bias in Part-of-Speech Tagging and Dependency Parsing

no code implementations • ACL 2019 • Aparna Garimella, Carmen Banea, Dirk Hovy, Rada Mihalcea

Several linguistic studies have shown the prevalence of various lexical and grammatical patterns in texts authored by a person of a particular gender, but models for part-of-speech tagging and dependency parsing have still not adapted to account for these differences.

Dependency Parsing Part-Of-Speech Tagging

Paper
Add Code

Increasing In-Class Similarity by Retrofitting Embeddings with Demographic Information

1 code implementation • EMNLP 2018 • Dirk Hovy, Tommaso Fornaciari

We use homophily cues to retrofit text-based author representations with non-linguistic information, and introduce a trade-off parameter.

Attribute General Classification +3

Paper
Code

Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting

no code implementations • EMNLP 2018 • Dirk Hovy, Christoph Purschke

Dialects are one of the main drivers of language variation, a major challenge for natural language processing tools.

Clustering Dimensionality Reduction +2

Paper
Add Code

Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning

no code implementations • EMNLP 2018 • Sotiris Lamprinidis, Daniel Hardt, Dirk Hovy

However, we also find that performance is very similar to that of a simple Logistic Regression model over character n-grams.

Multi-Task Learning Part-Of-Speech Tagging +2

Paper
Add Code

The Social and the Neural Network: How to Make Natural Language Processing about People again

no code implementations • WS 2018 • Dirk Hovy

Over the years, natural language processing has increasingly focused on tasks that can be solved by statistical models, but ignored the social aspects of language.

Paper
Add Code

Comparing Bayesian Models of Annotation

no code implementations • TACL 2018 • Silviu Paun, Bob Carpenter, Jon Chamberlain, Dirk Hovy, Udo Kruschwitz, Massimo Poesio

We evaluate these models along four aspects: comparison to gold labels, predictive accuracy for new annotations, annotator characterization, and item difficulty, using four datasets with varying degrees of noise in the form of random (spammy) annotators.

Model Selection

Paper
Add Code

Multi-Task Learning for Mental Health using Social Media Text

no code implementations • 10 Dec 2017 • Adrian Benton, Margaret Mitchell, Dirk Hovy

We introduce initial groundwork for estimating suicide risk and mental health in a deep learning framework.

Gender Prediction Multi-Task Learning

Paper
Add Code

Huntsville, hospitals, and hockey teams: Names can reveal your location

no code implementations • WS 2017 • Bahar Salehi, Dirk Hovy, Eduard Hovy, Anders S{\o}gaard

Geolocation is the task of identifying a social media user{'}s primary location, and in natural language processing, there is a growing literature on to what extent automated analysis of social media posts can help.

Knowledge Base Population Recommendation Systems +1

Paper
Add Code

End-to-End Information Extraction without Token-Level Supervision

1 code implementation • WS 2017 • Rasmus Berg Palm, Dirk Hovy, Florian Laws, Ole Winther

End-to-end (E2E) models, which take raw text as input and produce the desired output directly, need not depend on token-level labels.

Paper
Code

Multitask Learning for Mental Health Conditions with Limited Social Media Data

no code implementations • EACL 2017 • Adrian Benton, Margaret Mitchell, Dirk Hovy

Gender Prediction Multi-Task Learning

Paper
Add Code

Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations

no code implementations • ACL 2016 • Gavin Abercrombie, Dirk Hovy

Anomaly Detection General Classification +2

Paper
Add Code

The Enemy in Your Own Camp: How Well Can We Detect Statistically-Generated Fake Reviews -- An Adversarial Study

no code implementations • ACL 2016 • Dirk Hovy

Paper
Add Code

The Social Impact of Natural Language Processing

no code implementations • ACL 2016 • Dirk Hovy, Shannon L. Spruit

Paper
Add Code

SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM)

no code implementations • SEMEVAL 2016 • Nathan Schneider, Dirk Hovy, Anders Johannsen, Marine Carpuat

Part-Of-Speech Tagging Word Sense Disambiguation

Paper
Add Code

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

1 code implementation • NAACL 2016 • Zeerak Waseem, Dirk Hovy

Hate Speech Detection

218

Paper
Code

Learning a POS tagger for AAVE-like language

no code implementations • NAACL 2016 • Anna J{\o}rgensen, Dirk Hovy, Anders S{\o}gaard

Domain Adaptation POS +1

Paper
Add Code

Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics

no code implementations • LREC 2016 • Dirk Hovy, Anders Johannsen

Language varies not only between countries, but also along regional and socio-demographic lines.

Paper
Add Code

The Rating Game: Sentiment Rating Reproducibility from Text

no code implementations • EMNLP 2015 • Lasse Borgholt, Peter Simonsen, Dirk Hovy

Sentiment Analysis

Paper
Add Code

Personality Traits on Twitter---or---How to Get 1,500 Personality Tests in a Week

no code implementations • WS 2015 • Barbara Plank, Dirk Hovy

Paper
Add Code

Challenges of studying and processing dialects in social media

no code implementations • WS 2015 • Anna J{\o}rgensen, Dirk Hovy, Anders S{\o}gaard

Paper
Add Code

If all you have is a bit of the Bible: Learning POS taggers for truly low-resource languages

no code implementations • IJCNLP 2015 • {\v{Z}}eljko Agi{\'c}, Dirk Hovy, Anders S{\o}gaard

POS

Paper
Add Code

Demographic Factors Improve Classification Performance

no code implementations • IJCNLP 2015 • Dirk Hovy

Domain Adaptation General Classification +1

Paper
Add Code

Cross-lingual syntactic variation over age and gender

no code implementations • CONLL 2015 • Anders Johannsen, Dirk Hovy, Anders S{\o}gaard

Paper
Add Code

Tagging Performance Correlates with Author Age

no code implementations • IJCNLP 2015 • Dirk Hovy, Anders S{\o}gaard

Domain Adaptation

Paper
Add Code

Mining for unambiguous instances to adapt part-of-speech taggers to new domains

no code implementations • HLT 2015 • Anders Søgaard, Dirk Hovy, Barbara Plank, Héctor Martínez Alonso

POS TAG

Paper
Add Code

More or less supervised supersense tagging of Twitter

no code implementations • SEMEVAL 2014 • Anders Johannsen, Dirk Hovy, H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank, Anders S{\o}gaard

Domain Adaptation Named Entity Recognition (NER) +2

Paper
Add Code

Copenhagen-Malm\"o: Tree Approximations of Semantic Parsing Problems

no code implementations • SEMEVAL 2014 • Natalie Schluter, Anders S{\o}gaard, Jakob Elming, Dirk Hovy, Barbara Plank, H{\'e}ctor Mart{\'\i}nez Alonso, Anders Johanssen, Sigrid Klerke

Semantic Parsing Semantic Role Labeling

Paper
Add Code

Adapting taggers to Twitter with not-so-distant supervision

1 code implementation • COLING 2014 • Barbara Plank, Dirk Hovy, Ryan Mcdonald, Anders S{\o}gaard

Paper
Code

Selection Bias, Label Bias, and Bias in Ground Truth

no code implementations • COLING 2014 • Anders S{\o}gaard, Barbara Plank, Dirk Hovy

Dependency Parsing Domain Adaptation +1

Paper
Add Code

How Well can We Learn Interpretable Entity Types from Text?

no code implementations • ACL 2014 • Dirk Hovy

Question Answering Relation Extraction

Paper
Add Code

Robust Cross-Domain Sentiment Analysis for Low-Resource Languages

no code implementations • WS 2014 • Jakob Elming, Barbara Plank, Dirk Hovy

Domain Adaptation Sentiment Analysis

Paper
Add Code

Linguistically debatable or just plain wrong?

no code implementations • ACL 2014 • Barbara Plank, Dirk Hovy, Anders S{\o}gaard

Part-Of-Speech Tagging

Paper
Add Code

Experiments with crowdsourced re-annotation of a POS tagging data set

no code implementations • ACL 2014 • Dirk Hovy, Barbara Plank, Anders S{\o}gaard

Document Classification Named Entity Recognition (NER) +4

Paper
Add Code

What's in a p-value in NLP?

no code implementations • WS 2014 • Anders S{\o}gaard, Anders Johannsen, Barbara Plank, Dirk Hovy, Hector Mart{\'\i}nez Alonso

Paper
Add Code

When POS data sets don't add up: Combatting sample bias

no code implementations • LREC 2014 • Dirk Hovy, Barbara Plank, Anders S{\o}gaard

We present a systematic study of several Twitter POS data sets, the problems of label and data bias, discuss their effects on model performance, and show how to overcome them to learn models that perform well on various test sets, achieving relative error reduction of up to 21{\%}.

POS TAG

Paper
Add Code

Augmenting English Adjective Senses with Supersenses

1 code implementation • LREC 2014 • Yulia Tsvetkov, Nathan Schneider, Dirk Hovy, Archna Bhatia, Manaal Faruqui, Chris Dyer

We develop a supersense taxonomy for adjectives, based on that of GermaNet, and apply it to English adjectives in WordNet using human annotation and supervised classification.

Classification General Classification

Paper
Code

Crowdsourcing and annotating NER for Twitter \#drift

no code implementations • LREC 2014 • Hege Fromreide, Dirk Hovy, Anders S{\o}gaard

We present two new NER datasets for Twitter; a manually annotated set of 1, 467 tweets (kappa=0. 942) and a set of 2, 975 expert-corrected, crowdsourced NER annotated tweets from the dataset described in Finin et al. (2010).

NER