Search Results for author: Paolo Rosso

Found 87 papers, 21 papers with code

Do Dependency Relations Help in the Task of Stance Detection?

no code implementations insights (ACL) 2022 Alessandra Teresa Cignarella, Cristina Bosco, Paolo Rosso

Furthermore, we study the phenomenon of stance with respect to six different targets – one per language, and two different for Italian – employing a variety of machine learning algorithms that primarily exploit morphological and syntactic knowledge as features, represented throughout the format of Universal Dependencies.

Stance Detection

UPV at the Arabic Hate Speech 2022 Shared Task: Offensive Language and Hate Speech Detection using Transformers and Ensemble Models

1 code implementation OSACT (LREC) 2022 Angel Felipe Magnossão de Paula, Paolo Rosso, Imene Bensalem, Wajdi Zaghouani

This paper describes our participation in the shared task Fine-Grained Hate Speech Detection on Arabic Twitter at the 5th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT).

Hate Speech Detection

Unsupervised Embeddings with Graph Auto-Encoders for Multi-domain and Multilingual Hate Speech Detection

no code implementations LREC 2022 Gretel Liz De la Peña Sarracén, Paolo Rosso

Hate speech detection is a prominent and challenging task, since hate messages are often expressed in subtle ways and with characteristics that may vary depending on the author.

Hate Speech Detection

SemEval-2022 Task 5: Multimedia Automatic Misogyny Identification

no code implementations SemEval (NAACL) 2022 Elisabetta Fersini, Francesca Gasparini, Giulia Rizzi, Aurora Saibene, Berta Chulvi, Paolo Rosso, Alyssa Lees, Jeffrey Sorensen

The paper describes the SemEval-2022 Task 5: Multimedia Automatic Misogyny Identification (MAMI), which explores the detection of misogynous memes on the web by taking advantage of available texts and images.

Arabic WordNet: New Content and New Applications

no code implementations GWC 2016 Yasser Regragui, Lahsen Abouenour, Fettoum Krieche, Karim Bouzoubaa, Paolo Rosso

We also present how this content helps in the implementation of new Arabic NLP applications, especially for Question Answering (QA) systems.

Question Answering

Multi-Aspect Transfer Learning for Detecting Low Resource Mental Disorders on Social Media

1 code implementation LREC 2022 Ana Sabina Uban, Berta Chulvi, Paolo Rosso

We propose that transfer learning with linguistic features can be useful for approaching both the technical problem of improving mental disorder detection in the context of data scarcity, and the clinical problem of understanding the overlapping symptoms between certain disorders.

Transfer Learning

Understanding Patterns of Anorexia Manifestations in Social Media Data with Deep Learning

no code implementations NAACL (CLPsych) 2021 Ana Sabina Uban, Berta Chulvi, Paolo Rosso

Eating disorders are a growing problem especially among young people, yet they have been under-studied in computational research compared to other mental health disorders such as depression.

RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian

1 code implementation20 Feb 2024 Adrian Cosma, Bogdan Iordache, Paolo Rosso

Recently, large language models (LLMs) have become increasingly powerful and have become capable of solving a plethora of tasks through proper instructions in natural language.

Code Generation

Toxic language detection: a systematic review of Arabic datasets

1 code implementation12 Dec 2023 Imene Bensalem, Paolo Rosso, Hanane Zitouni

The detection of toxic language in the Arabic language has emerged as an active area of research in recent years, and reviewing the existing datasets employed for training the developed solutions has become a pressing need.

Overview of AuTexTification at IberLEF 2023: Detection and Attribution of Machine-Generated Text in Multiple Domains

1 code implementation20 Sep 2023 Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, Paolo Rosso

This paper presents the overview of the AuTexTification shared task as part of the IberLEF 2023 Workshop in Iberian Languages Evaluation Forum, within the framework of the SEPLN 2023 conference.

Attribute Language Modelling +2

Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection

1 code implementation7 Jul 2023 Angel Felipe Magnossão de Paula, Paolo Rosso, Damiano Spina

Therefore another solution, based on the sharing of information between tasks, has been developed: Multi-Task Learning (MTL).

Multi-Task Learning

Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages

1 code implementation17 Mar 2023 Angel Felipe Magnossão de Paula, Imene Bensalem, Paolo Rosso, Wajdi Zaghouani

This paper describes our participation in the shared task of hate speech detection, which is one of the subtasks of the CERIST NLP Challenge 2022.

Hate Speech Detection

Multilingual Detection of Check-Worthy Claims using World Languages and Adapter Fusion

no code implementations13 Jan 2023 Ipek Baris Schlicht, Lucie Flek, Paolo Rosso

This paper proposes cross-training adapters on a subset of world languages, combined by adapter fusion, to detect claims emerging globally in multiple languages.

It's Just a Matter of Time: Detecting Depression with Time-Enriched Multimodal Transformers

1 code implementation13 Jan 2023 Ana-Maria Bucur, Adrian Cosma, Paolo Rosso, Liviu P. Dinu

In this work, we propose a flexible time-enriched multimodal transformer architecture for detecting depression from social media posts, using pretrained models for extracting image and text embeddings.

Depression Detection

Fake News and Hate Speech: Language in Common

no code implementations5 Dec 2022 Berta Chulvi, Alejandro Toselli, Paolo Rosso

In this paper we raise the research question of whether fake news and hate speech spreaders share common patterns in language.

An End-to-End Set Transformer for User-Level Classification of Depression and Gambling Disorder

no code implementations2 Jul 2022 Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu, Paolo Rosso

This work proposes a transformer architecture for user-level classification of gambling addiction and depression that is trainable end-to-end.

Sentence

Cryptocurrency Bubble Detection: A New Stock Market Dataset, Financial Task & Hyperbolic Models

1 code implementation NAACL 2022 Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava

Further, we develop a set of sequence-to-sequence hyperbolic models suited to this multi-span identification task based on the power-law dynamics of cryptocurrencies and user behavior on social media.

Unsupervised Ranking and Aggregation of Label Descriptions for Zero-Shot Classifiers

no code implementations20 Apr 2022 Angelo Basile, Marc Franco-Salvador, Paolo Rosso

Zero-shot text classifiers based on label descriptions embed an input text and a set of labels into the same space: measures such as cosine similarity can then be used to select the most similar label description to the input text as the predicted label.

UPV at TREC Health Misinformation Track 2021 Ranking with SBERT and Quality Estimators

no code implementations11 Dec 2021 Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso

Health misinformation on search engines is a significant problem that could negatively affect individuals or public health.

Misinformation

Studying Fake News Spreading, Polarisation Dynamics, and Manipulation by Bots: a Tale of Networks and Language

no code implementations13 Sep 2021 Giancarlo Ruffo, Alfonso Semeraro, Anastasia Giachanou, Paolo Rosso

With the explosive growth of online social media, the ancient problem of information disorders interfering with news diffusion has surfaced with a renewed intensity threatening our democracies, public health, and news outlets' credibility.

Misinformation Rumour Detection

FakeFlow: Fake News Detection by Modeling the Flow of Affective Information

1 code implementation EACL 2021 Bilal Ghanem, Simone Paolo Ponzetto, Paolo Rosso, Francisco Rangel

To capture this, we propose in this paper to model the flow of affective information in fake news articles using a neural architecture.

Fake News Detection

Analysis and tuning of hierarchical topic models based on Renyi entropy approach

no code implementations19 Jan 2021 Sergei Koltcov, Vera Ignatenko, Maxim Terpilovskii, Paolo Rosso

In this paper, we propose a Renyi entropy-based approach for a partial solution to the above problem.

Topic Models

Multilingual Irony Detection with Dependency Syntax and Neural Models

1 code implementation COLING 2020 Alessandra Teresa Cignarella, Valerio Basile, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso, Farah Benamara

This paper presents an in-depth investigation of the effectiveness of dependency-based syntactic features on the irony detection task in a multilingual perspective (English, Spanish, French and Italian).

Word Embeddings

LIMSI_UPV at SemEval-2020 Task 9: Recurrent Convolutional Neural Network for Code-mixed Sentiment Analysis

1 code implementation30 Aug 2020 Somnath Banerjee, Sahar Ghannay, Sophie Rosset, Anne Vilnat, Paolo Rosso

This paper describes the participation of LIMSI UPV team in SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text.

Sentiment Analysis

NodeSketch: Highly-Efficient Graph Embeddings via Recursive Sketching

2 code implementations ‏‏‎ ‎ 2020 Dingqi Yang, Paolo Rosso, Bin Li, Philippe Cudre-Mauroux

Embeddings have become a key paradigm to learn graph represen-tations and facilitate downstream graph analysis tasks.

Profiling Bots, Fake News Spreaders and Haters

no code implementations LREC 2020 Paolo Rosso

In 2021 we specifically aim at addressing the challenging problem of profiling haters in social media in order to monitor abusive language and prevent cases of social exclusion in order to combat, for instance, racism, xenophobia and misogyny.

Abusive Language Marketing

Marking Irony Activators in a Universal Dependencies Treebank: The Case of an Italian Twitter Corpus

no code implementations LREC 2020 Aless Cignarella, ra Teresa, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso

In this paper we describe a fine-grained annotation scheme centered on irony, in which we highlight the tokens that are responsible for its activation, (irony activators) and their morpho-syntactic features.

Sentiment Analysis

Irony Detection in a Multilingual Context

1 code implementation6 Feb 2020 Bilal Ghanem, Jihen Karoui, Farah Benamara, Paolo Rosso, Véronique Moriceau

This paper proposes the first multilingual (French, English and Arabic) and multicultural (Indo-European languages vs. less culturally close languages) irony detection system.

FacTweet: Profiling Fake News Twitter Accounts

no code implementations15 Oct 2019 Bilal Ghanem, Simone Paolo Ponzetto, Paolo Rosso

We present an approach to detect fake news in Twitter at the account level using a neural recurrent model and a variety of different semantic and stylistic features.

TexTrolls: Identifying Russian Trolls on Twitter from a Textual Perspective

no code implementations3 Oct 2019 Bilal Ghanem, Davide Buscaldi, Paolo Rosso

Our approach is mainly based on textual features which utilize thematic information, and profiling features to identify the accounts from their way of writing tweets.

An Emotional Analysis of False Information in Social Media and News Articles

no code implementations26 Aug 2019 Bilal Ghanem, Paolo Rosso, Francisco Rangel

Fake news is risky since it has been created to manipulate the readers' opinions and beliefs.

Unmasking Bias in News

no code implementations11 Jun 2019 Javier Sánchez-Junquera, Paolo Rosso, Manuel Montes-y-Gómez, Simone Paolo Ponzetto

We present experiments on detecting hyperpartisanship in news using a 'masking' method that allows us to assess the role of style vs. content for the task at hand.

TUVD team at SemEval-2019 Task 6: Offense Target Identification

no code implementations SEMEVAL 2019 Elena Shushkevich, John Cardiff, Paolo Rosso

This article presents our approach for detecting a target of offensive messages in Twitter, including Individual, Group and Others classes.

regression

SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter

no code implementations SEMEVAL 2019 Valerio Basile, Cristina Bosco, Elisabetta Fersini, Debora Nozza, Viviana Patti, Francisco Manuel Rangel Pardo, Paolo Rosso, Manuela Sanguinetti

The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter.

UH-PRHLT at SemEval-2016 Task 3: Combining Lexical and Semantic-based Features for Community Question Answering

no code implementations SEMEVAL 2016 Marc Franco-Salvador, Sudipta Kar, Thamar Solorio, Paolo Rosso

In this work we describe the system built for the three English subtasks of the SemEval 2016 Task 3 by the Department of Computer Science of the University of Houston (UH) and the Pattern Recognition and Human Language Technology (PRHLT) research center - Universitat Polit`ecnica de Val`encia: UH-PRHLT.

Community Question Answering Knowledge Graphs

Cross-corpus Native Language Identification via Statistical Embedding

no code implementations WS 2018 Francisco Rangel, Paolo Rosso, Julian Brooke, Alex Uitdenbogerd, ra

In this paper, we approach the task of native language identification in a realistic cross-corpus scenario where a model is trained with available data and has to predict the native language from data of a different corpus.

Cross-corpus Native Language Identification

Semantically-informed distance and similarity measures for paraphrase plagiarism identification

no code implementations29 May 2018 Miguel A. Álvarez-Carmona, Marc Franco-Salvador, Esaú Villatoro-Tello, Manuel Montes-y-Gómez, Paolo Rosso, Luis Villaseñor-Pineda

Paraphrase plagiarism identification represents a very complex task given that plagiarized texts are intentionally modified through several rewording techniques.

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity

1 code implementation19 Jan 2018 Goran Glavaš, Marc Franco-Salvador, Simone Paolo Ponzetto, Paolo Rosso

In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages.

Cross-Lingual Information Retrieval Cross-Lingual Semantic Textual Similarity +9

Learning Multimodal Gender Profile using Neural Networks

no code implementations RANLP 2017 Carlos P{\'e}rez Estruch, Roberto Paredes Palacios, Paolo Rosso

Gender identification in social networks is one of the most popular aspects of user profile learning.

A Low Dimensionality Representation for Language Variety Identification

1 code implementation30 May 2017 Francisco Rangel, Marc Franco-Salvador, Paolo Rosso

We compare our LDR method with common state-of-the-art representations and show an increase in accuracy of ~35%.

Author Profiling at PAN: from Age and Gender Identification to Language Variety Identification (invited talk)

no code implementations WS 2017 Paolo Rosso

Author profiling is the study of how language is shared by people, a problem of growing importance in applications dealing with security, in order to understand who could be behind an anonymous threat message, and marketing, where companies may be interested in knowing the demographics of people that in online reviews liked or disliked their products.

Marketing Sentiment Analysis

Friends and Enemies of Clinton and Trump: Using Context for Detecting Stance in Political Tweets

no code implementations26 Feb 2017 Mirko Lai, Delia Irazú Hernández Farías, Viviana Patti, Paolo Rosso

Stance detection, the task of identifying the speaker's opinion towards a particular target, has attracted the attention of researchers.

Stance Detection

Enrichment of Bilingual Dictionary through News Stream Data

no code implementations LREC 2014 Ajay Dubey, Parth Gupta, Vasudeva Varma, Paolo Rosso

Many time the language pair does not have large bilingual comparable corpora and in such cases the best automatic dictionary is upper bounded by the quality and coverage of such corpora.

Information Retrieval

Squeezing bottlenecks: exploring the limits of autoencoder semantic representation capabilities

no code implementations13 Feb 2014 Parth Gupta, Rafael E. Banchs, Paolo Rosso

We present a comprehensive study on the use of autoencoders for modelling text data, in which (differently from previous studies) we focus our attention on the following issues: i) we explore the suitability of two different models bDA and rsDA for constructing deep autoencoders for text data at the sentence level; ii) we propose and evaluate two novel metrics for better assessing the text-reconstruction capabilities of autoencoders; and iii) we propose an automatic method to find the critical bottleneck dimensionality for text language representations (below which structural information is lost).

Sentence

Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System

no code implementations LREC 2012 Alex Roshchina, ra, John Cardiff, Paolo Rosso

With the constant increase in the amount of information available in online communities, the task of building an appropriate Recommender System to support the user in her decision making process is becoming more and more challenging.

Collaborative Filtering Decision Making +1

Cannot find the paper you are looking for? You can Submit a new open access paper.