no code implementations • SemEval (NAACL) 2022 • Carla Perez-Almendros, Luis Espinosa-Anke, Steven Schockaert
This paper presents an overview of Task 4 at SemEval-2022, which was focused on detecting Patronizing and Condescending Language (PCL) towards vulnerable communities.
no code implementations • SemEval (NAACL) 2022 • Joanne Boisson, Jose Camacho-Collados, Luis Espinosa-Anke
This paper describes the experiments ran for SemEval-2022 Task 2, subtask A, zero-shot and one-shot settings for idiomaticity detection.
1 code implementation • COLING 2022 • Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert
Interpreting patient case descriptions has emerged as a challenging problem for biomedical NLP, where the aim is typically to predict diagnoses, to recommended treatments, or to answer questions about cases more generally.
no code implementations • 10 Aug 2024 • Hsuvas Borkakoty, Luis Espinosa-Anke
Content moderation in online platforms is crucial for ensuring activity therein adheres to existing policies, especially as these platforms grow.
1 code implementation • 9 Jul 2024 • Zara Siddique, Liam D. Turner, Luis Espinosa-Anke
Large language models (LLMs) have been shown to propagate and amplify harmful stereotypes, particularly those that disproportionately affect marginalised communities.
no code implementations • 27 Jun 2024 • Hsuvas Borkakoty, Luis Espinosa-Anke
We introduce CHEW, a novel dataset of changing events in Wikipedia expressed in naturally occurring text.
1 code implementation • 3 May 2024 • Hsuvas Borkakoty, Luis Espinosa-Anke
Hoaxes are a recognised form of disinformation created deliberately, with potential serious implications in the credibility of reference knowledge resources such as Wikipedia.
no code implementations • 1 Nov 2023 • Joanne Boisson, Luis Espinosa-Anke, Jose Camacho-Collados
Metaphor identification aims at understanding whether a given expression is used figuratively in context.
no code implementations • 23 Oct 2023 • Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados
Despite its relevance, the maturity of NLP for social media pales in comparison with general-purpose models, metrics and benchmarks.
2 code implementations • 26 Sep 2023 • Shahul ES, Jithin James, Luis Espinosa-Anke, Steven Schockaert
We introduce RAGAs (Retrieval Augmented Generation Assessment), a framework for reference-free evaluation of Retrieval Augmented Generation (RAG) pipelines.
no code implementations • 7 Aug 2023 • Hsuvas Borkakoty, Luis Espinosa-Anke
A fundamental challenge in the current NLP context, dominated by language models, comes from the inflexibility of current architectures to 'learn' new information.
1 code implementation • 6 Aug 2023 • Fatemah Almeman, Hadi Sheikhi, Luis Espinosa-Anke
Definitions are a fundamental building block in lexicography, linguistics and computational semantics.
1 code implementation • COLING 2022 • Amit Gajbhiye, Luis Espinosa-Anke, Steven Schockaert
Grasping the commonsense properties of everyday concepts is an important prerequisite to language understanding.
1 code implementation • 29 Jun 2022 • Jose Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa-Anke, Fangyu Liu, Eugenio Martínez-Cámara, Gonzalo Medina, Thomas Buhrmann, Leonardo Neves, Francesco Barbieri
In this paper we present TweetNLP, an integrated platform for Natural Language Processing (NLP) in social media.
no code implementations • *SEM (NAACL) 2022 • Luis Espinosa-Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson, Leo Wanner
Recognizing and categorizing lexical collocations in context is useful for language learning, dictionary compilation and downstream NLP.
1 code implementation • 6 Aug 2021 • David Tuxworth, Dimosthenis Antypas, Luis Espinosa-Anke, Jose Camacho-Collados, Alun Preece, David Rogers
In particular, the analysis in centered on Twitter and disinformation for three European languages: English, French and Spanish.
1 code implementation • Findings (ACL) 2021 • Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert
Pre-trained language models such as ClinicalBERT have achieved impressive results on tasks such as medical Natural Language Inference.
1 code implementation • ACL 2021 • Asahi Ushio, Luis Espinosa-Anke, Steven Schockaert, Jose Camacho-Collados
Analogies play a central role in human commonsense reasoning.
no code implementations • 4 Dec 2020 • Na Li, Zied Bouraoui, Jose Camacho Collados, Luis Espinosa-Anke, Qing Gu, Steven Schockaert
While the success of pre-trained language models has largely eliminated the need for high-quality static word vectors in many NLP applications, such vectors continue to play an important role in tasks where words need to be modelled in the absence of linguistic context.
no code implementations • SEMEVAL 2020 • Shelan Jeawak, Luis Espinosa-Anke, Steven Schockaert
We describe the system submitted to SemEval-2020 Task 6, Subtask 1.
no code implementations • COLING 2020 • Carla Pérez-Almendros, Luis Espinosa-Anke, Steven Schockaert
In this paper, we introduce a new annotated dataset which is aimed at supporting the development of NLP models to identify and categorize language that is patronizing or condescending towards vulnerable communities (e. g. refugees, homeless people, poor families).
no code implementations • 11 Nov 2020 • Jordi Porta-Zamorano, Luis Espinosa-Anke
We present the results of the CAPITEL-EVAL shared task, held in the context of the IberLEF 2020 competition series.
1 code implementation • SMM4H (COLING) 2020 • David Owen, Jose Camacho Collados, Luis Espinosa-Anke
Depression and anxiety are psychiatric disorders that are observed in many areas of everyday life.
2 code implementations • Findings of the Association for Computational Linguistics 2020 • Francesco Barbieri, Jose Camacho-Collados, Leonardo Neves, Luis Espinosa-Anke
The experimental landscape in natural language processing for social media is too fragmented.
Ranked #3 on Sentiment Analysis on TweetEval
no code implementations • 3 Dec 2019 • Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert
Unfortunately, meaningful regions can be difficult to estimate, especially since we often have few examples of individuals that belong to a given category.
no code implementations • 16 Oct 2019 • Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert
While monolingual word embeddings encode information about words in the context of a particular language, cross-lingual embeddings define a multilingual space where word embeddings from two or more languages are integrated together.
Cross-Lingual Natural Language Inference Cross-Lingual Word Embeddings +3
no code implementations • LREC 2020 • Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert
Cross-lingual word embeddings are vector representations of words in different languages where words with similar meaning are represented by similar vectors, regardless of the language.
1 code implementation • ACL 2019 • Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert
While word embeddings have been shown to implicitly encode various forms of attributional knowledge, the extent to which they capture relational information is far more limited.
no code implementations • SEMEVAL 2019 • Carla P{\'e}rez-Almendros, Luis Espinosa-Anke, Steven Schockaert
This paper summarizes our contribution to the Hyperpartisan News Detection task in SemEval 2019.
1 code implementation • 17 May 2019 • Jose Camacho-Collados, Yerai Doval, Eugenio Martínez-Cámara, Luis Espinosa-Anke, Francesco Barbieri, Steven Schockaert
Cross-lingual embeddings represent the meaning of words from different languages in the same vector space.
no code implementations • EMNLP 2018 • Francesco Barbieri, Luis Espinosa-Anke, Jose Camacho-Collados, Steven Schockaert, Horacio Saggion
Human language has evolved towards newer forms of communication such as social media, where emojis (i. e., ideograms bearing a visual meaning) play a key role.
1 code implementation • EMNLP 2018 • Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert
Cross-lingual word embeddings are becoming increasingly important in multilingual NLP.
1 code implementation • COLING 2018 • Luis Espinosa-Anke, Steven Schockaert
For example, by examining clusters of relation vectors, we observe that relational similarities can be identified at a more abstract level than with traditional word vector differences.
1 code implementation • 6 Jul 2018 • Sergio Oramas, Luis Espinosa-Anke, Francisco Gómez, Xavier Serra
Today, a massive amount of musical knowledge is stored in written form, with testimonies dated as far back as several centuries ago.
1 code implementation • NAACL 2018 • Jose Camacho-Collados, Luis Espinosa-Anke, Mohammad Taher Pilehvar
Incorporating linguistic, world and common sense knowledge into AI/NLP systems is currently an important research area, with several open problems and challenges.
no code implementations • SEMEVAL 2018 • Francesco Barbieri, Jose Camacho-Collados, Francesco Ronzano, Luis Espinosa-Anke, Miguel Ballesteros, Valerio Basile, Viviana Patti, Horacio Saggion
This paper describes the results of the first Shared Task on Multilingual Emoji Prediction, organized as part of SemEval 2018.
no code implementations • NAACL 2018 • Luis Espinosa-Anke, Steven Schockaert
Automatically identifying definitional knowledge in text corpora (Definition Extraction or DE) is an important task with direct applications in, among others, Automatic Glossary Generation, Taxonomy Learning, Question Answering and Semantic Search.
no code implementations • SEMEVAL 2018 • Jose Camacho-Collados, Claudio Delli Bovi, Luis Espinosa-Anke, Sergio Oramas, Tommaso Pasini, Enrico Santus, Vered Shwartz, Roberto Navigli, Horacio Saggion
This paper describes the SemEval 2018 Shared Task on Hypernym Discovery.
no code implementations • WS 2017 • Francesco Barbieri, Luis Espinosa-Anke, Miguel Ballesteros, Juan Soler-Company, Horacio Saggion
Videogame streaming platforms have become a paramount example of noisy user-generated text.
no code implementations • COLING 2016 • Luis Espinosa-Anke, Jose Camacho-Collados, Sara Rodr{\'\i}guez-Fern{\'a}ndez, Horacio Saggion, Leo Wanner
WordNet is probably the best known lexical resource in Natural Language Processing.
1 code implementation • 8 Jun 2016 • Luis Espinosa-Anke, Roberto Carlini, Horacio Saggion, Francesco Ronzano
We present DefExt, an easy to use semi supervised Definition Extraction Tool.