Search Results for author: Jose Camacho-Collados

Found 53 papers, 24 papers with code

Definition Extraction Feature Analysis: From Canonical to Naturally-Occurring Definitions

no code implementations COLING (CogALex) 2020 Mireia Roig Mirapeix, Luis Espinosa Anke, Jose Camacho-Collados

Textual definitions constitute a fundamental source of knowledge when seeking the meaning of words, and they are the cornerstone of lexical resources like glossaries, dictionaries, encyclopedia or thesauri.

Definition Extraction

On the Cross-lingual Transferability of Contextualized Sense Embeddings

no code implementations EMNLP (MRL) 2021 Kiamehr Rezaee, Daniel Loureiro, Jose Camacho-Collados, Mohammad Taher Pilehvar

In this paper we analyze the extent to which contextualized sense embeddings, i. e., sense embeddings that are computed based on contextualized word embeddings, are transferable across languages. To this end, we compiled a unified cross-lingual benchmark for Word Sense Disambiguation.

Word Embeddings Word Sense Disambiguation

Assessing the Limits of the Distributional Hypothesis in Semantic Spaces: Trait-based Relational Knowledge and the Impact of Co-occurrences

1 code implementation *SEM (NAACL) 2022 Mark Anderson, Jose Camacho-Collados

The increase in performance in NLP due to the prevalence of distributional models and deep learning has brought with it a reciprocal decrease in interpretability.

Politics, Sentiment and Virality: A Large-Scale Multilingual Twitter Analysis in Greece, Spain and United Kingdom

1 code implementation1 Feb 2022 Dimosthenis Antypas, Alun Preece, Jose Camacho-Collados

Social media has become extremely influential when it comes to policy making in modern societies especially in the western world (e. g., 48% of Europeans use social media every day or almost every day).

Sentiment Analysis

Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification

no code implementations17 Nov 2021 Aleksandra Edwards, Asahi Ushio, Jose Camacho-Collados, Hélène de Ribaupierre, Alun Preece

Data augmentation techniques are widely used for enhancing the performance of machine learning models by tackling class imbalance issues and data sparsity.

Active Learning Classification +3

Distilling Relation Embeddings from Pre-trained Language Models

1 code implementation21 Sep 2021 Asahi Ushio, Jose Camacho-Collados, Steven Schockaert

Among others, this makes it possible to distill high-quality word vectors from pre-trained language models.

Knowledge Graphs Language Modelling +1

Deriving Disinformation Insights from Geolocalized Twitter Callouts

1 code implementation6 Aug 2021 David Tuxworth, Dimosthenis Antypas, Luis Espinosa-Anke, Jose Camacho-Collados, Alun Preece, David Rogers

In particular, the analysis in centered on Twitter and disinformation for three European languages: English, French and Spanish.

Language Modelling Word Embeddings

LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond

1 code implementation26 May 2021 Daniel Loureiro, Alípio Mário Jorge, Jose Camacho-Collados

Prior work has shown that these contextual representations can be used to accurately represent large sense inventories as sense embeddings, to the extent that a distance-based solution to Word Sense Disambiguation (WSD) tasks outperforms models trained specifically for the task.

Natural Language Processing Word Sense Disambiguation

XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond

1 code implementation25 Apr 2021 Francesco Barbieri, Luis Espinosa Anke, Jose Camacho-Collados

Language models are ubiquitous in current NLP, and their multilingual capacity has recently attracted considerable attention.

Language Modelling Sentiment Analysis

Go Simple and Pre-Train on Domain-Specific Corpora: On the Role of Training Data for Text Classification

no code implementations COLING 2020 Aleksandra Edwards, Jose Camacho-Collados, H{\'e}l{\`e}ne De Ribaupierre, Alun Preece

Pre-trained language models provide the foundations for state-of-the-art performance across a wide range of natural language processing tasks, including text classification.

Classification Language Modelling +3

Understanding the Source of Semantic Regularities in Word Embeddings

no code implementations CONLL 2020 Hsiao-Yu Chiang, Jose Camacho-Collados, Zachary Pardos

In this paper, we investigate the hypothesis that examples of a lexical relation in a corpus are fundamental to a neural word embedding{'}s ability to complete analogies involving the relation.

Word Embeddings

Analysis and Evaluation of Language Models for Word Sense Disambiguation

1 code implementation CL (ACL) 2021 Daniel Loureiro, Kiamehr Rezaee, Mohammad Taher Pilehvar, Jose Camacho-Collados

We also perform an in-depth comparison of the two main language model based WSD strategies, i. e., fine-tuning and feature extraction, finding that the latter approach is more robust with respect to sense bias and it can better exploit limited available training data.

Language Modelling Word Sense Disambiguation

WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context

1 code implementation EACL 2021 Anna Breit, Artem Revenko, Kiamehr Rezaee, Mohammad Taher Pilehvar, Jose Camacho-Collados

More specifically, we introduce a framework for Target Sense Verification of Words in Context which grounds its uniqueness in the formulation as a binary classification task thus being independent of external sense inventories, and the coverage of various domains.

 Ranked #1 on Entity Linking on WiC-TSV (Task 3 Accuracy: all metric)

Entity Linking Word Sense Disambiguation

Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation

1 code implementation EMNLP 2020 Daniel Loureiro, Jose Camacho-Collados

State-of-the-art methods for Word Sense Disambiguation (WSD) combine two different features: the power of pre-trained language models and a propagation method to extend the coverage of such models.

Word Sense Disambiguation

Modelling Semantic Categories using Conceptual Neighborhood

no code implementations3 Dec 2019 Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

Unfortunately, meaningful regions can be difficult to estimate, especially since we often have few examples of individuals that belong to a given category.

Natural Language Processing

Inducing Relational Knowledge from BERT

no code implementations28 Nov 2019 Zied Bouraoui, Jose Camacho-Collados, Steven Schockaert

Starting from a few seed instances of a given relation, we first use a large text corpus to find sentences that are likely to express this relation.

Language Modelling Natural Language Processing +1

Meemi: A Simple Method for Post-processing and Integrating Cross-lingual Word Embeddings

1 code implementation16 Oct 2019 Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

While monolingual word embeddings encode information about words in the context of a particular language, cross-lingual embeddings define a multilingual space where word embeddings from two or more languages are integrated together.

Cross-Lingual Natural Language Inference Cross-Lingual Word Embeddings +4

On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning

no code implementations LREC 2020 Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

Cross-lingual word embeddings are vector representations of words in different languages where words with similar meaning are represented by similar vectors, regardless of the language.

Cross-Lingual Word Embeddings Word Embeddings

Relational Word Embeddings

1 code implementation ACL 2019 Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

While word embeddings have been shown to implicitly encode various forms of attributional knowledge, the extent to which they capture relational information is far more limited.

Word Embeddings

Interpretable Emoji Prediction via Label-Wise Attention LSTMs

no code implementations EMNLP 2018 Francesco Barbieri, Luis Espinosa-Anke, Jose Camacho-Collados, Steven Schockaert, Horacio Saggion

Human language has evolved towards newer forms of communication such as social media, where emojis (i. e., ideograms bearing a visual meaning) play a key role.

Emotion Recognition Information Retrieval +3

WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations

1 code implementation NAACL 2019 Mohammad Taher Pilehvar, Jose Camacho-Collados

By design, word embeddings are unable to model the dynamic nature of words' semantics, i. e., the property of words to correspond to potentially different meanings.

Word Embeddings Word Similarity

The Interplay between Lexical Resources and Natural Language Processing

no code implementations NAACL 2018 Jose Camacho-Collados, Luis Espinosa-Anke, Mohammad Taher Pilehvar

Incorporating linguistic, world and common sense knowledge into AI/NLP systems is currently an important research area, with several open problems and challenges.

Common Sense Reasoning Natural Language Processing

How Gender and Skin Tone Modifiers Affect Emoji Semantics in Twitter

1 code implementation SEMEVAL 2018 Francesco Barbieri, Jose Camacho-Collados

Our analyses reveal that some stereotypes related to the skin color and gender seem to be reflected on the use of these modifiers.

Word Embeddings

From Word to Sense Embeddings: A Survey on Vector Representations of Meaning

no code implementations10 May 2018 Jose Camacho-Collados, Mohammad Taher Pilehvar

Over the past years, distributed semantic representations have proved to be effective and flexible keepers of prior knowledge to be integrated into downstream applications.

A Short Survey on Sense-Annotated Corpora

no code implementations LREC 2020 Tommaso Pasini, Jose Camacho-Collados

Large sense-annotated datasets are increasingly necessary for training deep supervised systems in Word Sense Disambiguation.

Word Sense Disambiguation

SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity

no code implementations SEMEVAL 2017 Jose Camacho-Collados, Mohammad Taher Pilehvar, Nigel Collier, Roberto Navigli

This paper introduces a new task on Multilingual and Cross-lingual SemanticThis paper introduces a new task on Multilingual and Cross-lingual Semantic Word Similarity which measures the semantic similarity of word pairs within and across five languages: English, Farsi, German, Italian and Spanish.

Information Retrieval Machine Translation +7

On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis

3 code implementations WS 2018 Jose Camacho-Collados, Mohammad Taher Pilehvar

In this paper we investigate the impact of simple text preprocessing decisions (particularly tokenizing, lemmatizing, lowercasing and multiword grouping) on the performance of a standard neural text classifier.

Natural Language Processing Sentiment Analysis +2

EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text

no code implementations ACL 2017 Claudio Delli Bovi, Jose Camacho-Collados, Aless Raganato, ro, Roberto Navigli

Parallel corpora are widely used in a variety of Natural Language Processing tasks, from Machine Translation to cross-lingual Word Sense Disambiguation, where parallel sentences can be exploited to automatically generate high-quality sense annotations on a large scale.

Entity Linking Machine Translation +3

BabelDomains: Large-Scale Domain Labeling of Lexical Resources

no code implementations EACL 2017 Jose Camacho-Collados, Roberto Navigli

In this paper we present BabelDomains, a unified resource which provides lexical items with information about domains of knowledge.

Domain Adaptation Hypernym Discovery +3

Why we have switched from building full-fledged taxonomies to simply detecting hypernymy relations

no code implementations12 Mar 2017 Jose Camacho-Collados

The study of taxonomies and hypernymy relations has been extensive on the Natural Language Processing (NLP) literature.

Natural Language Processing

Semantic Indexing of Multilingual Corpora and its Application on the History Domain

no code implementations WS 2016 Aless Raganato, ro, Jose Camacho-Collados, Antonio Raganato, Yunseo Joung

The increasing amount of multilingual text collections available in different domains makes its automatic processing essential for the development of a given field.


Cannot find the paper you are looking for? You can Submit a new open access paper.