1 code implementation • EMNLP (WNUT) 2021 • Rob van der Goot, Alan Ramponi, Arkaitz Zubiaga, Barbara Plank, Benjamin Muller, Iñaki San Vicente Roncal, Nikola Ljubešić, Özlem Çetinoğlu, Rahmad Mahendra, Talha Çolakoğlu, Timothy Baldwin, Tommaso Caselli, Wladimir Sidorenko
This task is beneficial for downstream analysis, as it provides a way to harmonize (often spontaneous) linguistic variation.
no code implementations • SemEval (NAACL) 2022 • Dina Pisarevskaya, Arkaitz Zubiaga
This paper describes the participation of the team “dina” in the Multilingual News Similarity task at SemEval 2022.
no code implementations • SemEval (NAACL) 2022 • Weihe Zhai, Mingqiang Feng, Arkaitz Zubiaga, Bingquan Liu
As a result, the model can well generalise to soft constrained and other competence-based question answering problem.
no code implementations • 8 Nov 2024 • Amani S. Abumansour, Arkaitz Zubiaga
In addition, we introduce the Similarity-driven Gradual Topic Learning (SGTL) model that synthesizes gradual learning with a similarity-based strategy for the target topic.
1 code implementation • 6 Nov 2024 • Hiu Ting Lau, Arkaitz Zubiaga
Natural Language Generation has been rapidly developing with the advent of large language models (LLMs).
no code implementations • 20 Sep 2024 • Parisa Jamadi Khiabani, Arkaitz Zubiaga
Stance detection is the task of determining the viewpoint expressed in a text towards a given target.
1 code implementation • 26 Jun 2024 • Yufeng Li, Rrubaa Panchendrarajan, Arkaitz Zubiaga
The rapid dissemination of information through social media and the Internet has posed a significant challenge for fact-checking, among others in identifying check-worthy claims that fact-checkers should pay attention to, i. e. filtering claims needing fact-checking from a large pool of sentences.
1 code implementation • 22 Apr 2024 • Bharathi A, Arkaitz Zubiaga
Our experiments demonstrate the effectiveness of model components, not least the translation-augmented data as well as the adversarial learning component, to the improved performance of the model.
no code implementations • 8 Mar 2024 • Parisa Jamadi Khiabani, Arkaitz Zubiaga
Stance detection, as the task of determining the viewpoint of a social media post towards a target as 'favor' or 'against', has been understudied in the challenging yet realistic scenario where there is limited labeled data for a certain target.
no code implementations • 26 Feb 2024 • Peiling Yi, Arkaitz Zubiaga
Swear words are a common proxy to collect datasets with cyberbullying incidents.
no code implementations • 29 Jan 2024 • Xia Zeng, Arkaitz Zubiaga
Claim verification is an essential step in the automated fact-checking pipeline which assesses the veracity of a claim against a piece of evidence.
no code implementations • 22 Jan 2024 • Rrubaa Panchendrarajan, Arkaitz Zubiaga
Focusing on multilingual misinformation, we present a comprehensive survey of existing multilingual claim detection research.
no code implementations • 22 Jan 2024 • Rrubaa Panchendrarajan, Arkaitz Zubiaga
The advancement of machine learning and symbolic approaches have underscored their strengths and weaknesses in Natural Language Processing (NLP).
1 code implementation • 17 Jan 2024 • Aiqi Jiang, Arkaitz Zubiaga
This survey presents a systematic and comprehensive exploration of Cross-Lingual Transfer Learning (CLTL) techniques in offensive language detection in social media.
no code implementations • 7 Oct 2023 • Weihe Zhai, Arkaitz Zubiaga
The fusion of language models (LMs) and knowledge graphs (KGs) is widely used in commonsense question answering, but generating faithful explanations remains challenging.
no code implementations • 28 Feb 2023 • Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He
In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection.
no code implementations • 16 Feb 2023 • XIAOYU GUO, Jing Ma, Arkaitz Zubiaga
Memes have gained popularity as a means to share visual ideas through the Internet and social media by mixing text, images and videos, often for humorous purposes.
1 code implementation • 16 Feb 2023 • XIAOYU GUO, Jing Ma, Arkaitz Zubiaga
This paper describes the participation of our NUAA-QMUL-AIIT team in the Memotion 3 shared task on meme emotion analysis.
no code implementations • 11 Jan 2023 • Parisa Jamadi Khiabani, Arkaitz Zubiaga
To address the cross-target stance detection in social media by leveraging the social nature of the task, we introduce CT-TN, a novel model that aggregates multimodal embeddings derived from both textual and network features of the data.
no code implementations • 20 Dec 2022 • Wenjie Yin, Vibhor Agarwal, Aiqi Jiang, Arkaitz Zubiaga, Nishanth Sastry
During training, the model associates annotators with their label choices given a piece of text; during evaluation, when label information is not available, the model predicts the aggregated label given by the participating annotators by utilising the learnt association.
no code implementations • 16 Dec 2022 • Amani S. Abumansour, Arkaitz Zubiaga
The AraCWA model enables boosting the performance for new topics by incorporating two components for few-shot learning and data augmentation.
1 code implementation • 15 Nov 2022 • Aiqi Jiang, Arkaitz Zubiaga
The goal of sexism detection is to mitigate negative online content targeting certain gender groups of people.
no code implementations • 18 Aug 2022 • Xia Zeng, Arkaitz Zubiaga
To mitigate the impact of the scarcity of labelled data on fact-checking systems, we focus on few-shot claim verification.
no code implementations • 14 Jul 2022 • Peiling Yi, Arkaitz Zubiaga
In this survey paper, we define the Session-based Cyberbullying Detection framework that encapsulates the different steps and challenges of the problem.
no code implementations • 11 May 2022 • Xia Zeng, Arkaitz Zubiaga
In this paper, we introduce SEED, a novel vector-based method to few-shot claim veracity classification that aggregates pairwise semantic differences for claim-evidence pairs.
1 code implementation • 11 May 2022 • Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga
Therefore an ability to predict a model's ability to persist over time can help design models that can be effectively used over a longer period of time.
no code implementations • NAACL 2022 • M. Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Rob Procter, Yulan He
The dataset construction includes work on retrieval techniques and similarity measurements to ensure a unique set of claims.
no code implementations • 3 May 2022 • Wenjie Yin, Arkaitz Zubiaga
While social media offers freedom of self-expression, abusive language carry significant negative social impact.
no code implementations • 1 Apr 2022 • Peiling Yi, Arkaitz Zubiaga
Despite the increasing interest in cyberbullying detection, existing efforts have largely been limited to experiments on a single platform and their generalisability across different social media platforms have received less attention.
no code implementations • 5 Nov 2021 • Amikul Kalra, Arkaitz Zubiaga
These networks are used in conjunction with transfer learning in the form of Bidirectional Encoder Representations from Transformers (BERT) and DistilBERT models, along with data augmentation, to perform binary and multiclass sexism classification on the dataset of tweets and gabs from the sEXism Identification in Social neTworks (EXIST) task in IberLEF 2021.
no code implementations • 1 Nov 2021 • Teodor Tiţa, Arkaitz Zubiaga
Hate speech detection within a cross-lingual setting represents a paramount area of interest for all medium and large-scale online platforms.
no code implementations • 23 Sep 2021 • Xia Zeng, Amani S. Abumansour, Arkaitz Zubiaga
As online false information continues to grow, automated fact-checking has gained an increasing amount of attention in recent years.
no code implementations • 3 Sep 2021 • Dimitris Gkoumas, Bo wang, Adam Tsakalidis, Maria Wolters, Arkaitz Zubiaga, Matthew Purver, Maria Liakata
The corpus consists of spoken conversations, a subset of which are transcribed, as well as typed and written thoughts and associated extra-linguistic information such as pen strokes and keystrokes.
no code implementations • 1 Sep 2021 • Rabab Alkhalifa, Arkaitz Zubiaga
Social media platforms provide a goldmine for mining public opinion on issues of wide societal interest and impact.
1 code implementation • 31 Aug 2021 • Wenjie Yin, Rabab Alkhalifa, Arkaitz Zubiaga
There is however a gap in the longitudinal study of how sentiment evolved in social media over the years.
1 code implementation • 27 Aug 2021 • Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga
We propose a novel approach to mitigate this performance drop, which is based on temporal adaptation of the word embeddings used for training the stance classifier.
no code implementations • 24 Aug 2021 • Peiling Yi, Arkaitz Zubiaga
Teenager detection is an important case of the age detection task in social media, which aims to detect teenage users to protect them from negative influences.
no code implementations • 6 Aug 2021 • Aiqi Jiang, Xiaohan Yang, Yang Liu, Arkaitz Zubiaga
We propose the first Chinese sexism dataset -- Sina Weibo Sexism Review (SWSR) dataset --, as well as a large Chinese lexicon SexHateLex made of abusive and gender-related terms.
no code implementations • 6 Aug 2021 • Aiqi Jiang, Arkaitz Zubiaga
Most hate speech detection research focuses on a single language, generally English, which limits their generalisability to other languages.
no code implementations • 28 Feb 2021 • M. Arana-Catania, F. A. Van Lier, Rob Procter, Nataliya Tkachenko, Yulan He, Arkaitz Zubiaga, Maria Liakata
The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations.
no code implementations • 17 Feb 2021 • Wenjie Yin, Arkaitz Zubiaga
Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation.
1 code implementation • 11 Dec 2020 • Arkaitz Zubiaga
Text classification, as the task consisting in assigning categories to textual instances, is a very common task in information science.
no code implementations • COLING 2020 • Leon Derczynski, Arkaitz Zubiaga
Detecting and grounding false and misleading claims on the web has grown to form a substantial sub-field of NLP.
1 code implementation • 23 Nov 2020 • Neeraj Vashistha, Arkaitz Zubiaga, Shanky Sharma
After attaining a competitive performance score, we create a tool which identifies and scores a page with effective metric in near-real time and uses the same as feedback to re-train our model.
no code implementations • 5 Nov 2020 • Rabab Alkhalifa, Adam Tsakalidis, Arkaitz Zubiaga, Maria Liakata
In this paper, we present the results and main findings of our system for the DIACR-ITA 2020 Task.
1 code implementation • SEMEVAL 2020 • XIAOYU GUO, Jing Ma, Arkaitz Zubiaga
This paper describes our contribution to SemEval 2020 Task 8: Memotion Analysis.
no code implementations • 2 Nov 2020 • Rabab Alkhalifa, Arkaitz Zubiaga
This paper presents our submission to the SardiStance 2020 shared task, describing the architecture used for Task A and Task B.
no code implementations • 30 Aug 2020 • Rabab Alkhalifa, Theodore Yoong, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata
The purpose of this task is to determine the check-worthiness of tweets about COVID-19 to identify and prioritise tweets that need fact-checking.
no code implementations • 3 Jun 2020 • Arkaitz Zubiaga
Text classification is one of the most frequent tasks for processing textual data, facilitating among others research from large-scale datasets.
no code implementations • SEMEVAL 2019 • Genevieve Gorrell, Elena Kochkina, Maria Liakata, Ahmet Aker, Arkaitz Zubiaga, Kalina Bontcheva, Leon Derczynski
Rumour verification is characterised by the need to consider evolving conversations and news updates to reach a verdict on a rumour{'}s veracity.
no code implementations • 14 Nov 2018 • Aiqi Jiang, Arkaitz Zubiaga
Online review platforms are a popular way for users to post reviews by expressing their opinions towards a product or service, as well as they are valuable for other users and companies to find out the overall opinions of customers.
no code implementations • 21 Sep 2018 • Lev Konstantinovskiy, Oliver Price, Mevan Babakar, Arkaitz Zubiaga
In an effort to assist factcheckers in the process of factchecking, we tackle the claim detection task, one of the necessary stages prior to determining the veracity of a claim.
no code implementations • 18 Sep 2018 • Genevieve Gorrell, Kalina Bontcheva, Leon Derczynski, Elena Kochkina, Maria Liakata, Arkaitz Zubiaga
This is the proposal for RumourEval-2019, which will run in early 2019 as part of that year's SemEval event.
no code implementations • COLING 2018 • Elena Kochkina, Maria Liakata, Arkaitz Zubiaga
We propose a multi-task learning approach that allows joint training of the main and auxiliary tasks, improving the performance of rumour verification.
no code implementations • 10 Apr 2018 • Arkaitz Zubiaga
Social media is becoming an increasingly important data source for learning about breaking news and for following the latest developments of ongoing news.
no code implementations • 22 Jan 2018 • Arkaitz Zubiaga, Aiqi Jiang
Our dataset represents a realistic scenario with a real distribution of true, commemorative and false stories, which we release for further use as a benchmark in future research.
no code implementations • 6 Dec 2017 • Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Isabelle Augenstein
We show that sequential classifiers that exploit the use of discourse properties in social media conversations while using only local features, outperform non-sequential classifiers.
no code implementations • IJCNLP 2017 • Bo Wang, Maria Liakata, Adam Tsakalidis, Spiros Georgakopoulos Kolaitis, Symeon Papadopoulos, Lazaros Apostolidis, Arkaitz Zubiaga, Rob Procter, Yiannis Kompatsiaris
We present a system for time sensitive, topic based summarisation of the sentiment around target entities and topics in collections of tweets.
no code implementations • 3 Apr 2017 • Arkaitz Zubiaga, Ahmet Aker, Kalina Bontcheva, Maria Liakata, Rob Procter
Despite the increasing use of social media platforms for information and news gathering, its unmoderated nature often leads to the emergence and spread of rumours, i. e. pieces of information that are unverified at the time of posting.
no code implementations • EACL 2017 • Bo Wang, Maria Liakata, Arkaitz Zubiaga, Rob Procter
Existing target-specific sentiment recognition methods consider only a single target per tweet, and have been shown to miss nearly half of the actual targets mentioned.
no code implementations • 27 Feb 2017 • Arkaitz Zubiaga, Bo wang, Maria Liakata, Rob Procter
Independence movements occur in territories whose citizens have conflicting national identities; users with opposing national identities will then support or oppose the sense of being part of an independent nation that differs from the officially recognised country.
2 code implementations • 24 Oct 2016 • Arkaitz Zubiaga, Maria Liakata, Rob Procter
In this paper we introduce a novel approach to rumour detection that learns from the sequential dynamics of reporting during breaking news in social media to detect rumours in new stories.
no code implementations • COLING 2016 • Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik
Rumour stance classification, the task that determines if each tweet in a collection discussing a rumour is supporting, denying, questioning or simply commenting on the rumour, has been attracting substantial interest.
no code implementations • 7 Sep 2016 • Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Arkaitz Zubiaga, Maria Liakata, Rob Procter
Social media tend to be rife with rumours while new reports are released piecemeal during breaking news.
no code implementations • 14 Jun 2016 • Alberto P. García-Plaza, Víctor Fresno, Raquel Martínez, Arkaitz Zubiaga
The selection of a suitable document representation approach plays a crucial role in the performance of a document clustering task.
no code implementations • LREC 2016 • I{\~n}aki San Vicente, I{\~n}aki Alegr{\'\i}a, Cristina Espa{\~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{\'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri
We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula.
1 code implementation • 25 Apr 2016 • Arkaitz Zubiaga, Alex Voss, Rob Procter, Maria Liakata, Bo wang, Adam Tsakalidis
In contrast to much previous work that has focused on location classification of tweets restricted to a specific country, here we undertake the task in a broader context by classifying global tweets at the country level, which is so far unexplored in a real-time scenario.
no code implementations • LREC 2014 • I{\~n}aki Alegria, Nora Aranberri, Pere Comas, V{\'\i}ctor Fresno, Pablo Gamallo, Lluis Padr{\'o}, I{\~n}aki San Vicente, Jordi Turmo, Arkaitz Zubiaga
It was created for Tweet-Norm, a tweet normalization workshop and shared task, and is the result of a joint annotation effort from different research groups.
no code implementations • 6 Mar 2014 • Arkaitz Zubiaga, Damiano Spina, Raquel Martínez, Víctor Fresno
Social media users give rise to social trends as they share about common interests, which can be triggered by different reasons.