no code implementations • insights (ACL) 2022 • Alessandra Teresa Cignarella, Cristina Bosco, Paolo Rosso
Furthermore, we study the phenomenon of stance with respect to six different targets – one per language, and two different for Italian – employing a variety of machine learning algorithms that primarily exploit morphological and syntactic knowledge as features, represented throughout the format of Universal Dependencies.
no code implementations • RANLP 2021 • Javier Sánchez-Junquera, Paolo Rosso, Manuel Montes-y-Gómez, Simone Paolo Ponzetto
Hyperpartisan news show an extreme manipulation of reality based on an underlying and extreme ideological orientation.
no code implementations • NAACL (CLPsych) 2021 • Ana Sabina Uban, Berta Chulvi, Paolo Rosso
Eating disorders are a growing problem especially among young people, yet they have been under-studied in computational research compared to other mental health disorders such as depression.
1 code implementation • OSACT (LREC) 2022 • Angel Felipe Magnossão de Paula, Paolo Rosso, Imene Bensalem, Wajdi Zaghouani
This paper describes our participation in the shared task Fine-Grained Hate Speech Detection on Arabic Twitter at the 5th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT).
1 code implementation • LREC 2022 • Ana Sabina Uban, Berta Chulvi, Paolo Rosso
We propose that transfer learning with linguistic features can be useful for approaching both the technical problem of improving mental disorder detection in the context of data scarcity, and the clinical problem of understanding the overlapping symptoms between certain disorders.
no code implementations • LREC 2022 • Gretel Liz De la Peña Sarracén, Paolo Rosso
Hate speech detection is a prominent and challenging task, since hate messages are often expressed in subtle ways and with characteristics that may vary depending on the author.
no code implementations • SemEval (NAACL) 2022 • Elisabetta Fersini, Francesca Gasparini, Giulia Rizzi, Aurora Saibene, Berta Chulvi, Paolo Rosso, Alyssa Lees, Jeffrey Sorensen
The paper describes the SemEval-2022 Task 5: Multimedia Automatic Misogyny Identification (MAMI), which explores the detection of misogynous memes on the web by taking advantage of available texts and images.
no code implementations • GWC 2016 • Yasser Regragui, Lahsen Abouenour, Fettoum Krieche, Karim Bouzoubaa, Paolo Rosso
We also present how this content helps in the implementation of new Arabic NLP applications, especially for Question Answering (QA) systems.
no code implementations • 28 Jan 2025 • Iván Arcos, Paolo Rosso, Ramón Salaverría
In detection models, SVM+TF-IDF achieved the highest F1-Score, excelling with limited data.
no code implementations • 24 Jan 2025 • Ipek Baris Schlicht, Zhixue Zhao, Burcu Sayin, Lucie Flek, Paolo Rosso
Equitable access to reliable health information is vital for public health, but the quality of online health resources varies by language, raising concerns about inconsistencies in Large Language Models (LLMs) for healthcare.
no code implementations • 26 Nov 2024 • Mireia Hernandez Caralt, Ivan Sekulić, Filip Carević, Nghia Khau, Diana Nicoleta Popa, Bruna Guedes, Victor Guimarães, Zeyu Yang, Andre Manso, Meghana Reddy, Paolo Rosso, Roland Mathis
Detecting user frustration in modern-day task-oriented dialog (TOD) systems is imperative for maintaining overall user satisfaction, engagement, and retention.
no code implementations • 15 Jul 2024 • Damir Korenčić, Berta Chulvi, Xavier Bonet Casals, Alejandro Toselli, Mariona Taulé, Paolo Rosso
The current prevalence of conspiracy theories on the internet is a significant issue, tackled by many computational approaches.
1 code implementation • 20 Feb 2024 • Adrian Cosma, Bogdan Iordache, Paolo Rosso
Recently, large language models (LLMs) have become increasingly powerful and have become capable of solving a plethora of tasks through proper instructions in natural language.
1 code implementation • 5 Jan 2024 • David Gimeno-Gómez, Ana-Maria Bucur, Adrian Cosma, Carlos-David Martínez-Hinarejos, Paolo Rosso
Depression, a prominent contributor to global disability, affects a substantial portion of the population.
1 code implementation • 12 Dec 2023 • Imene Bensalem, Paolo Rosso, Hanane Zitouni
The detection of toxic language in the Arabic language has emerged as an active area of research in recent years, and reviewing the existing datasets employed for training the developed solutions has become a pressing need.
no code implementations • 3 Nov 2023 • Gretel Liz De la Peña Sarracén, Paolo Rosso, Robert Litschko, Goran Glavaš, Simone Paolo Ponzetto
In this work, we resort to data augmentation and continual pre-training for domain adaptation to improve cross-lingual abusive language detection.
1 code implementation • 20 Sep 2023 • Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, Paolo Rosso
This paper presents the overview of the AuTexTification shared task as part of the IberLEF 2023 Workshop in Iberian Languages Evaluation Forum, within the framework of the SEPLN 2023 conference.
1 code implementation • 7 Jul 2023 • Angel Felipe Magnossão de Paula, Paolo Rosso, Damiano Spina
Therefore another solution, based on the sharing of information between tasks, has been developed: Multi-Task Learning (MTL).
1 code implementation • 17 Mar 2023 • Angel Felipe Magnossão de Paula, Imene Bensalem, Paolo Rosso, Wajdi Zaghouani
This paper describes our participation in the shared task of hate speech detection, which is one of the subtasks of the CERIST NLP Challenge 2022.
1 code implementation • 13 Jan 2023 • Ana-Maria Bucur, Adrian Cosma, Paolo Rosso, Liviu P. Dinu
In this work, we propose a flexible time-enriched multimodal transformer architecture for detecting depression from social media posts, using pretrained models for extracting image and text embeddings.
no code implementations • 13 Jan 2023 • Ipek Baris Schlicht, Lucie Flek, Paolo Rosso
This paper proposes cross-training adapters on a subset of world languages, combined by adapter fusion, to detect claims emerging globally in multiple languages.
no code implementations • 5 Dec 2022 • Berta Chulvi, Alejandro Toselli, Paolo Rosso
In this paper we raise the research question of whether fake news and hate speech spreaders share common patterns in language.
no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso
This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.
no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso
This overview paper describes the first shared task on fake news detection in Urdu language.
no code implementations • 2 Jul 2022 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu, Paolo Rosso
This work proposes a transformer architecture for user-level classification of gambling addiction and depression that is trainable end-to-end.
1 code implementation • NAACL 2022 • Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava
Further, we develop a set of sequence-to-sequence hyperbolic models suited to this multi-span identification task based on the power-law dynamics of cryptocurrencies and user behavior on social media.
1 code implementation • 22 Apr 2022 • Petr Lorenc, Ana-Sabina Uban, Paolo Rosso, Jan Šedivý
Unfortunately, there is a lack of data in the conversational domain.
no code implementations • 20 Apr 2022 • Angelo Basile, Marc Franco-Salvador, Paolo Rosso
Zero-shot text classifiers based on label descriptions embed an input text and a set of labels into the same space: measures such as cosine similarity can then be used to select the most similar label description to the input text as the predicted label.
no code implementations • 11 Dec 2021 • Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso
Health misinformation on search engines is a significant problem that could negatively affect individuals or public health.
1 code implementation • 19 Sep 2021 • Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso
Identifying check-worthy claims is often the first step of automated fact-checking systems.
no code implementations • 13 Sep 2021 • Giancarlo Ruffo, Alfonso Semeraro, Anastasia Giachanou, Paolo Rosso
With the explosive growth of online social media, the ancient problem of information disorders interfering with news diffusion has surfaced with a renewed intensity threatening our democracies, public health, and news outlets' credibility.
no code implementations • SEMEVAL 2021 • Roberto Labadie, Mariano Jason Rodriguez, Reynier Ortega, Paolo Rosso
The SNN model is used for learning a latent space where instances of humor and non-humor can be distinguished.
1 code implementation • EACL 2021 • Bilal Ghanem, Simone Paolo Ponzetto, Paolo Rosso, Francisco Rangel
To capture this, we propose in this paper to model the flow of affective information in fake news articles using a neural architecture.
no code implementations • 19 Jan 2021 • Sergei Koltcov, Vera Ignatenko, Maxim Terpilovskii, Paolo Rosso
In this paper, we propose a Renyi entropy-based approach for a partial solution to the above problem.
no code implementations • SEMEVAL 2020 • Gretel Liz De la Pe{\~n}a Sarrac{\'e}n, Paolo Rosso
We propose a model based on the BERT architecture for the analysis of texts in English.
no code implementations • SEMEVAL 2020 • Somnath Banerjee, Sahar Ghannay, Sophie Rosset, Anne Vilnat, Paolo Rosso
This paper describes the participation of LIMSI{\_}UPV team in SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text.
no code implementations • SEMEVAL 2020 • Gretel Liz De la Pe{\~n}a Sarrac{\'e}n, Paolo Rosso, Anastasia Giachanou
The BERT model is used to process the textual information and VGG the images.
1 code implementation • COLING 2020 • Alessandra Teresa Cignarella, Valerio Basile, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso, Farah Benamara
This paper presents an in-depth investigation of the effectiveness of dependency-based syntactic features on the irony detection task in a multilingual perspective (English, Spanish, French and Italian).
no code implementations • 31 Aug 2020 • Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso, Sivaji Bandyopadhyay
Overall, the stacking approach produces the best results for fine-grained classification and achieves 87. 79% of accuracy.
1 code implementation • 30 Aug 2020 • Somnath Banerjee, Sahar Ghannay, Sophie Rosset, Anne Vilnat, Paolo Rosso
This paper describes the participation of LIMSI UPV team in SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text.
1 code implementation • 29 Jul 2020 • Mirko Lai, Viviana Patti, Giancarlo Ruffo, Paolo Rosso
Interest has grown around the classification of stance that users assume within online debates in recent years.
2 code implementations • 2020 • Dingqi Yang, Paolo Rosso, Bin Li, Philippe Cudre-Mauroux
Embeddings have become a key paradigm to learn graph represen-tations and facilitate downstream graph analysis tasks.
no code implementations • LREC 2020 • Paolo Rosso
In 2021 we specifically aim at addressing the challenging problem of profiling haters in social media in order to monitor abusive language and prevent cases of social exclusion in order to combat, for instance, racism, xenophobia and misogyny.
no code implementations • LREC 2020 • Aless Cignarella, ra Teresa, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso
In this paper we describe a fine-grained annotation scheme centered on irony, in which we highlight the tokens that are responsible for its activation, (irony activators) and their morpho-syntactic features.
1 code implementation • 6 Feb 2020 • Bilal Ghanem, Jihen Karoui, Farah Benamara, Paolo Rosso, Véronique Moriceau
This paper proposes the first multilingual (French, English and Arabic) and multicultural (Indo-European languages vs. less culturally close languages) irony detection system.
no code implementations • 15 Oct 2019 • Bilal Ghanem, Simone Paolo Ponzetto, Paolo Rosso
We present an approach to detect fake news in Twitter at the account level using a neural recurrent model and a variety of different semantic and stylistic features.
no code implementations • 3 Oct 2019 • Bilal Ghanem, Davide Buscaldi, Paolo Rosso
Our approach is mainly based on textual features which utilize thematic information, and profiling features to identify the accounts from their way of writing tweets.
no code implementations • 26 Aug 2019 • Bilal Ghanem, Paolo Rosso, Francisco Rangel
Fake news is risky since it has been created to manipulate the readers' opinions and beliefs.
no code implementations • 11 Jun 2019 • Javier Sánchez-Junquera, Paolo Rosso, Manuel Montes-y-Gómez, Simone Paolo Ponzetto
We present experiments on detecting hyperpartisanship in news using a 'masking' method that allows us to assess the role of style vs. content for the task at hand.
no code implementations • SEMEVAL 2019 • Gretel Liz De la Pe{\~n}a, Paolo Rosso
This paper describes the system we developed for SemEval 2019 on Identifying and Categorizing Offensive Language in Social Media (OffensEval - Task 6).
no code implementations • SEMEVAL 2019 • Elena Shushkevich, John Cardiff, Paolo Rosso
This article presents our approach for detecting a target of offensive messages in Twitter, including Individual, Group and Others classes.
no code implementations • SEMEVAL 2019 • Bilal Ghanem, Aless Cignarella, ra Teresa, Cristina Bosco, Paolo Rosso, Francisco Manuel Rangel Pardo
In the present paper we describe the UPV-28-UNITO system{'}s submission to the RumorEval 2019 shared task.
no code implementations • SEMEVAL 2019 • Valerio Basile, Cristina Bosco, Elisabetta Fersini, Debora Nozza, Viviana Patti, Francisco Manuel Rangel Pardo, Paolo Rosso, Manuela Sanguinetti
The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter.
no code implementations • WS 2018 • Bilal Ghanem, Paolo Rosso, Francisco Rangel
Furthermore, we have investigated the importance of different lexicons in the detection of the classification labels.
no code implementations • SEMEVAL 2016 • Marc Franco-Salvador, Sudipta Kar, Thamar Solorio, Paolo Rosso
In this work we describe the system built for the three English subtasks of the SemEval 2016 Task 3 by the Department of Computer Science of the University of Houston (UH) and the Pattern Recognition and Human Language Technology (PRHLT) research center - Universitat Polit`ecnica de Val`encia: UH-PRHLT.
no code implementations • SEMEVAL 2018 • Delia Iraz{\'u} Hern{\'a}ndez Far{\'\i}as, Fern S{\'a}nchez-Vega, o, Manuel Montes-y-G{\'o}mez, Paolo Rosso
This paper describes an ensemble approach to the SemEval-2018 Task 3.
no code implementations • SEMEVAL 2018 • Bilal Ghanem, Francisco Rangel, Paolo Rosso
In this paper we describe our participation in the SemEval-2018 task 3 Shared Task on Irony Detection.
no code implementations • SEMEVAL 2018 • Delia Iraz{\'u} Hern{\'a}ndez Far{\'\i}as, Viviana Patti, Paolo Rosso
In this paper we describe the system used by the ValenTO team in the shared task on Irony Detection in English Tweets at SemEval 2018.
no code implementations • WS 2018 • Francisco Rangel, Paolo Rosso, Julian Brooke, Alex Uitdenbogerd, ra
In this paper, we approach the task of native language identification in a realistic cross-corpus scenario where a model is trained with available data and has to predict the native language from data of a different corpus.
no code implementations • 29 May 2018 • Miguel A. Álvarez-Carmona, Marc Franco-Salvador, Esaú Villatoro-Tello, Manuel Montes-y-Gómez, Paolo Rosso, Luis Villaseñor-Pineda
Paraphrase plagiarism identification represents a very complex task given that plagiarized texts are intentionally modified through several rewording techniques.
1 code implementation • 19 Jan 2018 • Goran Glavaš, Marc Franco-Salvador, Simone Paolo Ponzetto, Paolo Rosso
In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages.
Cross-Lingual Information Retrieval
Cross-Lingual Semantic Textual Similarity
+9
no code implementations • RANLP 2017 • Carlos P{\'e}rez Estruch, Roberto Paredes Palacios, Paolo Rosso
Gender identification in social networks is one of the most popular aspects of user profile learning.
no code implementations • ACL 2017 • Sanja {\v{S}}tajner, Marc Franco-Salvador, Simone Paolo Ponzetto, Paolo Rosso, Heiner Stuckenschmidt
We provide several methods for sentence-alignment of texts with different complexity levels.
1 code implementation • 30 May 2017 • Francisco Rangel, Marc Franco-Salvador, Paolo Rosso
We compare our LDR method with common state-of-the-art representations and show an increase in accuracy of ~35%.
no code implementations • WS 2017 • Paolo Rosso
Author profiling is the study of how language is shared by people, a problem of growing importance in applications dealing with security, in order to understand who could be behind an anonymous threat message, and marketing, where companies may be interested in knowing the demographics of people that in online reviews liked or disliked their products.
no code implementations • EACL 2017 • Rosa M. Gim{\'e}nez-P{\'e}rez, Marc Franco-Salvador, Paolo Rosso
That setting usually implies the use of a domain adaptation method.
no code implementations • EACL 2017 • Prasha Shrestha, Sebastian Sierra, Fabio Gonz{\'a}lez, Manuel Montes, Paolo Rosso, Thamar Solorio
We present a model to perform authorship attribution of tweets using Convolutional Neural Networks (CNNs) over character n-grams.
no code implementations • 26 Feb 2017 • Mirko Lai, Delia Irazú Hernández Farías, Viviana Patti, Paolo Rosso
Stance detection, the task of identifying the speaker's opinion towards a particular target, has attracted the attention of researchers.
no code implementations • LREC 2016 • Mohamed Outahajala, Paolo Rosso
In order to improve this result, we have gathered a set of about 8k words with their POS tags.
no code implementations • LREC 2014 • Ajay Dubey, Parth Gupta, Vasudeva Varma, Paolo Rosso
Many time the language pair does not have large bilingual comparable corpora and in such cases the best automatic dictionary is upper bounded by the quality and coverage of such corpora.
no code implementations • 13 Feb 2014 • Parth Gupta, Rafael E. Banchs, Paolo Rosso
We present a comprehensive study on the use of autoencoders for modelling text data, in which (differently from previous studies) we focus our attention on the following issues: i) we explore the suitability of two different models bDA and rsDA for constructing deep autoencoders for text data at the sentence level; ii) we propose and evaluate two novel metrics for better assessing the text-reconstruction capabilities of autoencoders; and iii) we propose an automatic method to find the critical bottleneck dimensionality for text language representations (below which structural information is lost).
no code implementations • LREC 2012 • Alex Roshchina, ra, John Cardiff, Paolo Rosso
With the constant increase in the amount of information available in online communities, the task of building an appropriate Recommender System to support the user in her decision making process is becoming more and more challenging.