Search Results for author: Luis Espinosa-Anke

Found 48 papers, 19 papers with code

Self-Supervised Intermediate Fine-Tuning of Biomedical Language Models for Interpreting Patient Case Descriptions

1 code implementation COLING 2022 Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert

Interpreting patient case descriptions has emerged as a challenging problem for biomedical NLP, where the aim is typically to predict diagnoses, to recommended treatments, or to answer questions about cases more generally.

WiDe-analysis: Enabling One-click Content Moderation Analysis on Wikipedia's Articles for Deletion

no code implementations10 Aug 2024 Hsuvas Borkakoty, Luis Espinosa-Anke

Content moderation in online platforms is crucial for ensuring activity therein adheres to existing policies, especially as these platforms grow.

Sentiment Analysis

Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models

1 code implementation9 Jul 2024 Zara Siddique, Liam D. Turner, Luis Espinosa-Anke

Large language models (LLMs) have been shown to propagate and amplify harmful stereotypes, particularly those that disproportionately affect marginalised communities.

Math

CHEW: A Dataset of CHanging Events in Wikipedia

no code implementations27 Jun 2024 Hsuvas Borkakoty, Luis Espinosa-Anke

We introduce CHEW, a novel dataset of changing events in Wikipedia expressed in naturally occurring text.

Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset

1 code implementation3 May 2024 Hsuvas Borkakoty, Luis Espinosa-Anke

Hoaxes are a recognised form of disinformation created deliberately, with potential serious implications in the credibility of reference knowledge resources such as Wikipedia.

Binary Classification Binary text classification +2

Construction Artifacts in Metaphor Identification Datasets

no code implementations1 Nov 2023 Joanne Boisson, Luis Espinosa-Anke, Jose Camacho-Collados

Metaphor identification aims at understanding whether a given expression is used figuratively in context.

RAGAS: Automated Evaluation of Retrieval Augmented Generation

2 code implementations26 Sep 2023 Shahul ES, Jithin James, Luis Espinosa-Anke, Steven Schockaert

We introduce RAGAs (Retrieval Augmented Generation Assessment), a framework for reference-free evaluation of Retrieval Augmented Generation (RAG) pipelines.

RAG Retrieval

WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset

no code implementations7 Aug 2023 Hsuvas Borkakoty, Luis Espinosa-Anke

A fundamental challenge in the current NLP context, dominated by language models, comes from the inflexibility of current architectures to 'learn' new information.

Continual Learning parameter-efficient fine-tuning

3D-EX : A Unified Dataset of Definitions and Dictionary Examples

1 code implementation6 Aug 2023 Fatemah Almeman, Hadi Sheikhi, Luis Espinosa-Anke

Definitions are a fundamental building block in lexicography, linguistics and computational semantics.

Memorization Word Embeddings

Modelling Commonsense Properties using Pre-Trained Bi-Encoders

1 code implementation COLING 2022 Amit Gajbhiye, Luis Espinosa-Anke, Steven Schockaert

Grasping the commonsense properties of everyday concepts is an important prerequisite to language understanding.

Hypernym Discovery

Deriving Disinformation Insights from Geolocalized Twitter Callouts

1 code implementation6 Aug 2021 David Tuxworth, Dimosthenis Antypas, Luis Espinosa-Anke, Jose Camacho-Collados, Alun Preece, David Rogers

In particular, the analysis in centered on Twitter and disinformation for three European languages: English, French and Spanish.

Language Modelling Specificity +1

Probing Pre-Trained Language Models for Disease Knowledge

1 code implementation Findings (ACL) 2021 Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert

Pre-trained language models such as ClinicalBERT have achieved impressive results on tasks such as medical Natural Language Inference.

Binary Classification Natural Language Inference

Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

no code implementations4 Dec 2020 Na Li, Zied Bouraoui, Jose Camacho Collados, Luis Espinosa-Anke, Qing Gu, Steven Schockaert

While the success of pre-trained language models has largely eliminated the need for high-quality static word vectors in many NLP applications, such vectors continue to play an important role in tasks where words need to be modelled in the absence of linguistic context.

Knowledge Base Completion

Don't Patronize Me! An Annotated Dataset with Patronizing and Condescending Language towards Vulnerable Communities

no code implementations COLING 2020 Carla Pérez-Almendros, Luis Espinosa-Anke, Steven Schockaert

In this paper, we introduce a new annotated dataset which is aimed at supporting the development of NLP models to identify and categorize language that is patronizing or condescending towards vulnerable communities (e. g. refugees, homeless people, poor families).

Modelling Semantic Categories using Conceptual Neighborhood

no code implementations3 Dec 2019 Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

Unfortunately, meaningful regions can be difficult to estimate, especially since we often have few examples of individuals that belong to a given category.

Meemi: A Simple Method for Post-processing and Integrating Cross-lingual Word Embeddings

no code implementations16 Oct 2019 Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

While monolingual word embeddings encode information about words in the context of a particular language, cross-lingual embeddings define a multilingual space where word embeddings from two or more languages are integrated together.

Cross-Lingual Natural Language Inference Cross-Lingual Word Embeddings +3

On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning

no code implementations LREC 2020 Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

Cross-lingual word embeddings are vector representations of words in different languages where words with similar meaning are represented by similar vectors, regardless of the language.

Cross-Lingual Word Embeddings Word Embeddings

Relational Word Embeddings

1 code implementation ACL 2019 Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

While word embeddings have been shown to implicitly encode various forms of attributional knowledge, the extent to which they capture relational information is far more limited.

Word Embeddings

Interpretable Emoji Prediction via Label-Wise Attention LSTMs

no code implementations EMNLP 2018 Francesco Barbieri, Luis Espinosa-Anke, Jose Camacho-Collados, Steven Schockaert, Horacio Saggion

Human language has evolved towards newer forms of communication such as social media, where emojis (i. e., ideograms bearing a visual meaning) play a key role.

Emotion Recognition Information Retrieval +3

SeVeN: Augmenting Word Embeddings with Unsupervised Relation Vectors

1 code implementation COLING 2018 Luis Espinosa-Anke, Steven Schockaert

For example, by examining clusters of relation vectors, we observe that relational similarities can be identified at a more abstract level than with traditional word vector differences.

Relation Text Categorization +2

Natural Language Processing for Music Knowledge Discovery

1 code implementation6 Jul 2018 Sergio Oramas, Luis Espinosa-Anke, Francisco Gómez, Xavier Serra

Today, a massive amount of musical knowledge is stored in written form, with testimonies dated as far back as several centuries ago.

Graph Generation Sentiment Analysis

The Interplay between Lexical Resources and Natural Language Processing

1 code implementation NAACL 2018 Jose Camacho-Collados, Luis Espinosa-Anke, Mohammad Taher Pilehvar

Incorporating linguistic, world and common sense knowledge into AI/NLP systems is currently an important research area, with several open problems and challenges.

Common Sense Reasoning

Syntactically Aware Neural Architectures for Definition Extraction

no code implementations NAACL 2018 Luis Espinosa-Anke, Steven Schockaert

Automatically identifying definitional knowledge in text corpora (Definition Extraction or DE) is an important task with direct applications in, among others, Automatic Glossary Generation, Taxonomy Learning, Question Answering and Semantic Search.

Benchmarking Binary Classification +7

Cannot find the paper you are looking for? You can Submit a new open access paper.