Search Results for author: Arkaitz Zubiaga

Found 75 papers, 16 papers with code

Supporting Automated Fact-checking across Topics: Similarity-driven Gradual Topic Learning for Claim Detection

no code implementations8 Nov 2024 Amani S. Abumansour, Arkaitz Zubiaga

In addition, we introduce the Similarity-driven Gradual Topic Learning (SGTL) model that synthesizes gradual learning with a similarity-based strategy for the target topic.

Domain Adaptation Fact Checking

Cross-Target Stance Detection: A Survey of Techniques, Datasets, and Challenges

no code implementations20 Sep 2024 Parisa Jamadi Khiabani, Arkaitz Zubiaga

Stance detection is the task of determining the viewpoint expressed in a text towards a given target.

Stance Detection

FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning

1 code implementation26 Jun 2024 Yufeng Li, Rrubaa Panchendrarajan, Arkaitz Zubiaga

The rapid dissemination of information through social media and the Internet has posed a significant challenge for fact-checking, among others in identifying check-worthy claims that fact-checkers should pay attention to, i. e. filtering claims needing fact-checking from a large pool of sentences.

Fact Checking Prompt Engineering

Zero-shot Cross-lingual Stance Detection via Adversarial Language Adaptation

1 code implementation22 Apr 2024 Bharathi A, Arkaitz Zubiaga

Our experiments demonstrate the effectiveness of model components, not least the translation-augmented data as well as the adversarial learning component, to the improved performance of the model.

Stance Detection Translation

SocialPET: Socially Informed Pattern Exploiting Training for Few-Shot Stance Detection in Social Media

no code implementations8 Mar 2024 Parisa Jamadi Khiabani, Arkaitz Zubiaga

Stance detection, as the task of determining the viewpoint of a social media post towards a target as 'favor' or 'against', has been understudied in the challenging yet realistic scenario where there is limited labeled data for a certain target.

Few-Shot Stance Detection

MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim Verification

no code implementations29 Jan 2024 Xia Zeng, Arkaitz Zubiaga

Claim verification is an essential step in the automated fact-checking pipeline which assesses the veracity of a claim against a piece of evidence.

Claim Verification Fact Checking

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

no code implementations22 Jan 2024 Rrubaa Panchendrarajan, Arkaitz Zubiaga

Focusing on multilingual misinformation, we present a comprehensive survey of existing multilingual claim detection research.

Fact Checking Misinformation +1

Synergizing Machine Learning & Symbolic Methods: A Survey on Hybrid Approaches to Natural Language Processing

no code implementations22 Jan 2024 Rrubaa Panchendrarajan, Arkaitz Zubiaga

The advancement of machine learning and symbolic approaches have underscored their strengths and weaknesses in Natural Language Processing (NLP).

Natural Language Understanding

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges

1 code implementation17 Jan 2024 Aiqi Jiang, Arkaitz Zubiaga

This survey presents a systematic and comprehensive exploration of Cross-Lingual Transfer Learning (CLTL) techniques in offensive language detection in social media.

Cross-Lingual Transfer Survey +1

Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering

no code implementations7 Oct 2023 Weihe Zhai, Arkaitz Zubiaga

The fusion of language models (LMs) and knowledge graphs (KGs) is widely used in commonsense question answering, but generating faithful explanations remains challenging.

Knowledge Graphs Question Answering

PANACEA: An Automated Misinformation Detection System on COVID-19

no code implementations28 Feb 2023 Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He

In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection.

Fact Checking Misinformation +2

Cluster-based Deep Ensemble Learning for Emotion Classification in Internet Memes

no code implementations16 Feb 2023 XIAOYU GUO, Jing Ma, Arkaitz Zubiaga

Memes have gained popularity as a means to share visual ideas through the Internet and social media by mixing text, images and videos, often for humorous purposes.

Clustering Emotion Classification +1

Few-shot Learning for Cross-Target Stance Detection by Aggregating Multimodal Embeddings

no code implementations11 Jan 2023 Parisa Jamadi Khiabani, Arkaitz Zubiaga

To address the cross-target stance detection in social media by leveraging the social nature of the task, we introduce CT-TN, a novel model that aggregates multimodal embeddings derived from both textual and network features of the data.

Few-Shot Learning Stance Detection

AnnoBERT: Effectively Representing Multiple Annotators' Label Choices to Improve Hate Speech Detection

no code implementations20 Dec 2022 Wenjie Yin, Vibhor Agarwal, Aiqi Jiang, Arkaitz Zubiaga, Nishanth Sastry

During training, the model associates annotators with their label choices given a piece of text; during evaluation, when label information is not available, the model predicts the aggregated label given by the participating annotators by utilising the learnt association.

Hate Speech Detection

Check-worthy Claim Detection across Topics for Automated Fact-checking

no code implementations16 Dec 2022 Amani S. Abumansour, Arkaitz Zubiaga

The AraCWA model enables boosting the performance for new topics by incorporating two components for few-shot learning and data augmentation.

Data Augmentation Fact Checking +1

Session-based Cyberbullying Detection in Social Media: A Survey

no code implementations14 Jul 2022 Peiling Yi, Arkaitz Zubiaga

In this survey paper, we define the Session-based Cyberbullying Detection framework that encapsulates the different steps and challenges of the problem.

Survey

Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity Classification

no code implementations11 May 2022 Xia Zeng, Arkaitz Zubiaga

In this paper, we introduce SEED, a novel vector-based method to few-shot claim veracity classification that aggregates pairwise semantic differences for claim-evidence pairs.

Classification Fact Checking +2

Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

1 code implementation11 May 2022 Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga

Therefore an ability to predict a model's ability to persist over time can help design models that can be effectively used over a longer period of time.

text-classification Text Classification

Hidden behind the obvious: misleading keywords and implicitly abusive language on social media

no code implementations3 May 2022 Wenjie Yin, Arkaitz Zubiaga

While social media offers freedom of self-expression, abusive language carry significant negative social impact.

Abusive Language

Cyberbullying detection across social media platforms via platform-aware adversarial encoding

no code implementations1 Apr 2022 Peiling Yi, Arkaitz Zubiaga

Despite the increasing interest in cyberbullying detection, existing efforts have largely been limited to experiments on a single platform and their generalisability across different social media platforms have received less attention.

Sexism Identification in Tweets and Gabs using Deep Neural Networks

no code implementations5 Nov 2021 Amikul Kalra, Arkaitz Zubiaga

These networks are used in conjunction with transfer learning in the form of Bidirectional Encoder Representations from Transformers (BERT) and DistilBERT models, along with data augmentation, to perform binary and multiclass sexism classification on the dataset of tweets and gabs from the sEXism Identification in Social neTworks (EXIST) task in IberLEF 2021.

Classification Data Augmentation +2

Cross-lingual Hate Speech Detection using Transformer Models

no code implementations1 Nov 2021 Teodor Tiţa, Arkaitz Zubiaga

Hate speech detection within a cross-lingual setting represents a paramount area of interest for all medium and large-scale online platforms.

Hate Speech Detection

Automated Fact-Checking: A Survey

no code implementations23 Sep 2021 Xia Zeng, Amani S. Abumansour, Arkaitz Zubiaga

As online false information continues to grow, automated fact-checking has gained an increasing amount of attention in recent years.

Fact Checking Survey

A Longitudinal Multi-modal Dataset for Dementia Monitoring and Diagnosis

no code implementations3 Sep 2021 Dimitris Gkoumas, Bo wang, Adam Tsakalidis, Maria Wolters, Arkaitz Zubiaga, Matthew Purver, Maria Liakata

The corpus consists of spoken conversations, a subset of which are transcribed, as well as typed and written thoughts and associated extra-linguistic information such as pen strokes and keystrokes.

Capturing Stance Dynamics in Social Media: Open Challenges and Research Directions

no code implementations1 Sep 2021 Rabab Alkhalifa, Arkaitz Zubiaga

Social media platforms provide a goldmine for mining public opinion on issues of wide societal interest and impact.

Opinion Mining Stance Detection

Opinions are Made to be Changed: Temporally Adaptive Stance Classification

1 code implementation27 Aug 2021 Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga

We propose a novel approach to mitigate this performance drop, which is based on temporal adaptation of the word embeddings used for training the stance classifier.

Classification Stance Classification +1

Weakly Supervised Cross-platform Teenager Detection with Adversarial BERT

no code implementations24 Aug 2021 Peiling Yi, Arkaitz Zubiaga

Teenager detection is an important case of the age detection task in social media, which aims to detect teenage users to protect them from negative influences.

SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection

no code implementations6 Aug 2021 Aiqi Jiang, Xiaohan Yang, Yang Liu, Arkaitz Zubiaga

We propose the first Chinese sexism dataset -- Sina Weibo Sexism Review (SWSR) dataset --, as well as a large Chinese lexicon SexHateLex made of abusive and gender-related terms.

Abusive Language

Cross-lingual Capsule Network for Hate Speech Detection in Social Media

no code implementations6 Aug 2021 Aiqi Jiang, Arkaitz Zubiaga

Most hate speech detection research focuses on a single language, generally English, which limits their generalisability to other languages.

Hate Speech Detection

Citizen Participation and Machine Learning for a Better Democracy

no code implementations28 Feb 2021 M. Arana-Catania, F. A. Van Lier, Rob Procter, Nataliya Tkachenko, Yulan He, Arkaitz Zubiaga, Maria Liakata

The development of democratic systems is a crucial task as confirmed by its selection as one of the Millennium Sustainable Development Goals by the United Nations.

BIG-bench Machine Learning Decision Making

Towards generalisable hate speech detection: a review on obstacles and solutions

no code implementations17 Feb 2021 Wenjie Yin, Arkaitz Zubiaga

Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation.

Hate Speech Detection

TF-CR: Weighting Embeddings for Text Classification

1 code implementation11 Dec 2020 Arkaitz Zubiaga

Text classification, as the task consisting in assigning categories to textual instances, is a very common task in information science.

General Classification text-classification +2

Detection and Resolution of Rumors and Misinformation with NLP

no code implementations COLING 2020 Leon Derczynski, Arkaitz Zubiaga

Detecting and grounding false and misleading claims on the web has grown to form a substantial sub-field of NLP.

Misinformation

An Online Multilingual Hate speech Recognition System

1 code implementation23 Nov 2020 Neeraj Vashistha, Arkaitz Zubiaga, Shanky Sharma

After attaining a competitive performance score, we create a tool which identifies and scores a page with effective metric in near-real time and uses the same as feedback to re-train our model.

Hate Speech Detection speech-recognition +1

QMUL-SDS @ SardiStance: Leveraging Network Interactions to Boost Performance on Stance Detection using Knowledge Graphs

no code implementations2 Nov 2020 Rabab Alkhalifa, Arkaitz Zubiaga

This paper presents our submission to the SardiStance 2020 shared task, describing the architecture used for Task A and Task B.

Avg Knowledge Graphs +1

Exploiting Class Labels to Boost Performance on Embedding-based Text Classification

no code implementations3 Jun 2020 Arkaitz Zubiaga

Text classification is one of the most frequent tasks for processing textual data, facilitating among others research from large-scale datasets.

General Classification text-classification +2

Leveraging Aspect Phrase Embeddings for Cross-Domain Review Rating Prediction

no code implementations14 Nov 2018 Aiqi Jiang, Arkaitz Zubiaga

Online review platforms are a popular way for users to post reviews by expressing their opinions towards a product or service, as well as they are valuable for other users and companies to find out the overall opinions of customers.

Towards Automated Factchecking: Developing an Annotation Schema and Benchmark for Consistent Automated Claim Detection

no code implementations21 Sep 2018 Lev Konstantinovskiy, Oliver Price, Mevan Babakar, Arkaitz Zubiaga

In an effort to assist factcheckers in the process of factchecking, we tackle the claim detection task, one of the necessary stages prior to determining the veracity of a claim.

Sentence

All-in-one: Multi-task Learning for Rumour Verification

no code implementations COLING 2018 Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

We propose a multi-task learning approach that allows joint training of the main and auxiliary tasks, improving the performance of rumour verification.

General Classification Multi-Task Learning +2

Mining Social Media for Newsgathering: A Review

no code implementations10 Apr 2018 Arkaitz Zubiaga

Social media is becoming an increasingly important data source for learning about breaking news and for following the latest developments of ongoing news.

Early Detection of Social Media Hoaxes at Scale

no code implementations22 Jan 2018 Arkaitz Zubiaga, Aiqi Jiang

Our dataset represents a realistic scenario with a real distribution of true, commemorative and false stories, which we release for further use as a benchmark in future research.

Veracity Classification Word Embeddings

Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

no code implementations6 Dec 2017 Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Isabelle Augenstein

We show that sequential classifiers that exploit the use of discourse properties in social media conversations while using only local features, outperform non-sequential classifiers.

General Classification Stance Classification

Detection and Resolution of Rumours in Social Media: A Survey

no code implementations3 Apr 2017 Arkaitz Zubiaga, Ahmet Aker, Kalina Bontcheva, Maria Liakata, Rob Procter

Despite the increasing use of social media platforms for information and news gathering, its unmoderated nature often leads to the emergence and spread of rumours, i. e. pieces of information that are unverified at the time of posting.

Classification General Classification +4

TDParse: Multi-target-specific sentiment recognition on Twitter

no code implementations EACL 2017 Bo Wang, Maria Liakata, Arkaitz Zubiaga, Rob Procter

Existing target-specific sentiment recognition methods consider only a single target per tweet, and have been shown to miss nearly half of the actual targets mentioned.

Dependency Parsing Opinion Mining +1

Political Homophily in Independence Movements: Analysing and Classifying Social Media Users by National Identity

no code implementations27 Feb 2017 Arkaitz Zubiaga, Bo wang, Maria Liakata, Rob Procter

Independence movements occur in territories whose citizens have conflicting national identities; users with opposing national identities will then support or oppose the sense of being part of an independent nation that differs from the officially recognised country.

General Classification

Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media

2 code implementations24 Oct 2016 Arkaitz Zubiaga, Maria Liakata, Rob Procter

In this paper we introduce a novel approach to rumour detection that learns from the sequential dynamics of reporting during breaking news in social media to detect rumours in new stories.

Rumour Detection

Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

no code implementations COLING 2016 Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik

Rumour stance classification, the task that determines if each tweet in a collection discussing a rumour is supporting, denying, questioning or simply commenting on the rumour, has been attracting substantial interest.

General Classification Rumour Detection +1

Using Fuzzy Logic to Leverage HTML Markup for Web Page Representation

no code implementations14 Jun 2016 Alberto P. García-Plaza, Víctor Fresno, Raquel Martínez, Arkaitz Zubiaga

The selection of a suitable document representation approach plays a crucial role in the performance of a document clustering task.

Clustering

TweetMT: A Parallel Microblog Corpus

no code implementations LREC 2016 I{\~n}aki San Vicente, I{\~n}aki Alegr{\'\i}a, Cristina Espa{\~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{\'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri

We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula.

Machine Translation Translation

Towards Real-Time, Country-Level Location Classification of Worldwide Tweets

1 code implementation25 Apr 2016 Arkaitz Zubiaga, Alex Voss, Rob Procter, Maria Liakata, Bo wang, Adam Tsakalidis

In contrast to much previous work that has focused on location classification of tweets restricted to a specific country, here we undertake the task in a broader context by classifying global tweets at the country level, which is so far unexplored in a real-time scenario.

Classification General Classification

Real-Time Classification of Twitter Trends

no code implementations6 Mar 2014 Arkaitz Zubiaga, Damiano Spina, Raquel Martínez, Víctor Fresno

Social media users give rise to social trends as they share about common interests, which can be triggered by different reasons.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.