1 code implementation • RANLP 2021 • Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.
no code implementations • EACL (BSNLP) 2021 • Jakub Piskorski, Bogdan Babych, Zara Kancheva, Olga Kanishcheva, Maria Lebedeva, Michał Marcińczuk, Preslav Nakov, Petya Osenova, Lidia Pivovarova, Senja Pollak, Pavel Přibáň, Ivaylo Radev, Marko Robnik-Sikonja, Vasyl Starko, Josef Steinberger, Roman Yangarber
Seven teams covered all six languages, and five teams participated in the cross-lingual entity linking task.
no code implementations • CONSTRAINT (ACL) 2022 • Shivam Sharma, Tharun Suresh, Atharva Kulkarni, Himanshi Mathur, Preslav Nakov, Md. Shad Akhtar, Tanmoy Chakraborty
We present the findings of the shared task at the CONSTRAINT 2022 Workshop: Hero, Villain, and Victim: Dissecting harmful memes for Semantic role labeling of entities.
no code implementations • 26 Nov 2024 • Harsh Singh, Rocktim Jyoti Das, Mingfei Han, Preslav Nakov, Ivan Laptev
Large Language Models (LLMs) have demonstrated remarkable planning abilities across various domains, including robotics manipulation and navigation.
no code implementations • 22 Oct 2024 • Yasser Ashraf, Yuxia Wang, Bin Gu, Preslav Nakov, Timothy Baldwin
The growing use of large language models (LLMs) has raised concerns regarding their safety.
no code implementations • 21 Oct 2024 • Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha
Document structure editing involves manipulating localized textual, visual, and layout components in document images based on the user's requests.
1 code implementation • 17 Oct 2024 • Zhuohan Xie, Rui Xing, Yuxia Wang, Jiahui Geng, Hasan Iqbal, Dhruv Sahnan, Iryna Gurevych, Preslav Nakov
The typical approach to fact-checking these atomic claims involves retrieving a fixed number of pieces of evidence, followed by a verification step.
1 code implementation • 2 Oct 2024 • Haonan Li, Xudong Han, Hao Wang, Yuxia Wang, Minghan Wang, Rui Xing, Yilin Geng, Zenan Zhai, Preslav Nakov, Timothy Baldwin
We introduce Loki, an open-source tool designed to address the growing problem of misinformation.
no code implementations • 27 Sep 2024 • Saptarshi Sengupta, Wenpeng Yin, Preslav Nakov, Shreya Ghosh, Suhang Wang
In this paper, we investigate Extractive Question Answering (EQA) with Large Language Models (LLMs) under domain drift, i. e., can LLMs generalize well to closed-domains that require specific knowledge such as medicine and law in a zero-shot fashion without additional in-domain training?
no code implementations • 26 Sep 2024 • Guokan Shang, Hadi Abdine, Yousef Khoubrane, Amr Mohamed, Yassine Abbahaddou, Sofiane Ennadir, Imane Momayiz, Xuguang Ren, Eric Moulines, Preslav Nakov, Michalis Vazirgiannis, Eric Xing
We introduce Atlas-Chat, the first-ever collection of LLMs specifically developed for dialectal Arabic.
no code implementations • 19 Sep 2024 • Jiateng Liu, Lin Ai, Zizhou Liu, Payam Karisani, Zheng Hui, May Fung, Preslav Nakov, Julia Hirschberg, Heng Ji
Propaganda plays a critical role in shaping public opinion and fueling disinformation.
1 code implementation • 18 Sep 2024 • Xinyuan Lu, Liangming Pan, Yubo Ma, Preslav Nakov, Min-Yen Kan
Current Large Language Models (LLMs) exhibit limited ability to understand table structures and to apply precise numerical reasoning, which is crucial for tasks such as table question answering (TQA) and table-based fact verification (TFV).
1 code implementation • 31 Aug 2024 • Angel Beshirov, Milena Dobreva, Dimitar Dimitrov, Momchil Hardalov, Ivan Koychev, Preslav Nakov
We further develop a method for automatically generating synthetic data in this orthography, as well as in the subsequent Ivanchev orthography, by leveraging vast amounts of contemporary literature Bulgarian texts.
Optical Character Recognition Optical Character Recognition (OCR)
1 code implementation • 23 Aug 2024 • Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych
Health-related misinformation claims often falsely cite a credible biomedical publication as evidence, which superficially appears to support the false claim.
no code implementations • 20 Aug 2024 • Artem Vazhentsev, Ekaterina Fadeeva, Rui Xing, Alexander Panchenko, Preslav Nakov, Timothy Baldwin, Maxim Panov, Artem Shelmanov
Uncertainty quantification (UQ) is a perspective approach to detecting Large Language Model (LLM) hallucinations and low quality output.
1 code implementation • 8 Aug 2024 • Mervat Abassy, Kareem Elozeiri, Alexander Aziz, Minh Ngoc Ta, Raj Vardhan Tomar, Bimarsha Adhikari, Saad El Dine Ahmed, Yuxia Wang, Osama Mohammed Afzal, Zhuohan Xie, Jonibek Mansurov, Ekaterina Artemova, Vladislav Mikhailov, Rui Xing, Jiahui Geng, Hasan Iqbal, Zain Muhammad Mujahid, Tarek Mahmoud, Akim Tsvigun, Alham Fikri Aji, Artem Shelmanov, Nizar Habash, Iryna Gurevych, Preslav Nakov
Category (iii) aims to detect attempts to obfuscate the fact that a text was machine-generated, while category (iv) looks for cases where the LLM was used to polish a human-written text, which is typically acceptable in academic writing, but not in education.
2 code implementations • 6 Aug 2024 • Hasan Iqbal, Yuxia Wang, Minghan Wang, Georgi Georgiev, Jiahui Geng, Iryna Gurevych, Preslav Nakov
The increased use of large language models (LLMs) across a variety of real-world applications calls for automatic tools to check the factual accuracy of their outputs, as LLMs often hallucinate.
no code implementations • 13 Jul 2024 • Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming, Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Onkar Pandit, Satheesh Katipomu, Samta Kamboj, Samujjwal Ghosh, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami, Preslav Nakov
By continually pre-training on a mix of Arabic and English corpora, the model retains its proficiency in English while acquiring capabilities in Arabic.
1 code implementation • 28 Jun 2024 • Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen
To address this problem, we propose $\texttt{Web2Code}$, a benchmark consisting of a new large-scale webpage-to-code dataset for instruction tuning and an evaluation framework for the webpage understanding and HTML code translation abilities of MLLMs.
3 code implementations • 21 Jun 2024 • Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Lyudmila Rvanova, Akim Tsvigun, Daniil Vasilev, Rui Xing, Abdelrahman Boda Sadallah, Kirill Grishchenkov, Sergey Petrakov, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov, Artem Shelmanov
However, research to date on UQ methods for LLMs has been fragmented, in terms of the literature on UQ techniques and evaluation methods.
1 code implementation • 17 Jun 2024 • Muhammad Arslan Manzoor, Yuxia Wang, Minghan Wang, Preslav Nakov
Our systematic exploration of LMs' understanding of empathy reveals substantial opportunities for further investigation in both task formulation and modeling.
no code implementations • 16 Jun 2024 • Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov, Zeerak Talat
We find that classifiers are highly sensitive to stylistic changes and differences in text complexity, and in some cases degrade entirely to random classifiers.
1 code implementation • 7 Jun 2024 • Jinyan Su, Preslav Nakov, Claire Cardie
Dense retrievers are widely used in information retrieval and have also been successfully extended to other knowledge intensive areas such as language models, e. g., Retrieval-Augmented Generation (RAG) systems.
1 code implementation • 5 Jun 2024 • Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych
Unlike previous fallacy detection datasets, Missci (i) focuses on implicit fallacies between the relevant content of the cited publication and the inaccurate claim, and (ii) requires models to verbalize the fallacious reasoning in addition to classifying it.
no code implementations • 18 May 2024 • Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty
Memes have evolved as a prevalent medium for diverse communication, ranging from humour to propaganda.
4 code implementations • 9 May 2024 • Yuxia Wang, Minghan Wang, Hasan Iqbal, Georgi Georgiev, Jiahui Geng, Preslav Nakov
The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs.
no code implementations • 26 Apr 2024 • Teresa Lynn, Malik H. Altakrori, Samar Mohamed Magdy, Rocktim Jyoti Das, Chenyang Lyu, Mohamed Nasr, Younes Samih, Alham Fikri Aji, Preslav Nakov, Shantanu Godbole, Salim Roukos, Radu Florian, Nizar Habash
The rapid evolution of Natural Language Processing (NLP) has favored major languages such as English, leaving a significant gap for many others due to limited resources.
1 code implementation • 22 Apr 2024 • Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohammed Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Chenxi Whitehouse, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov
The task attracted a large number of participants: subtask A monolingual (126), subtask A multilingual (59), subtask B (70), and subtask C (30).
1 code implementation • 15 Mar 2024 • Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov
Solving the problems in the dataset requires advanced perception and joint reasoning over the text and the visual content of the image.
1 code implementation • 7 Mar 2024 • Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov
Uncertainty scores leverage information encapsulated in the output of a neural network or its layers to detect unreliable predictions, and we show that they can be used to fact-check the atomic claims in the LLM output.
no code implementations • 6 Mar 2024 • Jiahui Geng, Yova Kementchedjhieva, Preslav Nakov, Iryna Gurevych
To the best of our knowledge, we are the first to evaluate MLLMs for real-world fact-checking.
1 code implementation • 20 Feb 2024 • Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin
The focus of language model evaluation has transitioned towards reasoning and knowledge-intensive tasks, driven by advancements in pretraining large models.
1 code implementation • 19 Feb 2024 • Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin
Previous studies have proposed comprehensive taxonomies of the risks posed by LLMs, as well as corresponding prompts that can be used to examine the safety mechanisms of LLMs.
1 code implementation • 17 Feb 2024 • Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohanned Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov
The advent of Large Language Models (LLMs) has brought an unprecedented surge in machine-generated text (MGT) across diverse channels.
no code implementations • 4 Feb 2024 • Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Georgiev, Rocktim Jyoti Das, Preslav Nakov
Large language models (LLMs), especially when instruction-tuned for chat, have become part of our daily lives, freeing people from the process of searching, extracting, and integrating information from multiple sources by offering a straightforward answer to a variety of questions in a single place.
no code implementations • 30 Jan 2024 • Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Preslav Nakov, Tanmoy Chakraborty, Roy Ka-Wei Lee
In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content.
1 code implementation • 23 Jan 2024 • Iman Munire Bilal, Preslav Nakov, Rob Procter, Maria Liakata
The task of rumour verification in social media concerns assessing the veracity of a claim on the basis of conversation threads that result from it.
1 code implementation • 11 Dec 2023 • Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze, Preslav Nakov, Tim Baldwin, Eric P. Xing
The recent surge in open-source Large Language Models (LLMs), such as LLaMA, Falcon, and Mistral, provides diverse options for AI practitioners and researchers.
1 code implementation • 16 Nov 2023 • Tariq Alhindi, Smaranda Muresan, Preslav Nakov
In this study, we aim to enhance existing models for fallacy recognition by incorporating additional context and by leveraging large language models to generate synthetic data, thus increasing the representation of the infrequent classes.
2 code implementations • 15 Nov 2023 • Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, Arnav Arora, Aleksandr Rubashevskii, Jiahui Geng, Osama Mohammed Afzal, Liangming Pan, Nadav Borenstein, Aditya Pillai, Isabelle Augenstein, Iryna Gurevych, Preslav Nakov
The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs.
no code implementations • 14 Nov 2023 • Jiahui Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov, Iryna Gurevych
Assessing their confidence and calibrating them across different tasks can help mitigate risks and enable LLMs to produce better generations.
1 code implementation • 11 Nov 2023 • Luke Bates, Peter Ebert Christensen, Preslav Nakov, Iryna Gurevych
Here, to aid understanding of memes, we release a knowledge base of memes and information found on www. knowyourmeme. com, which we call the Know Your Meme Knowledge Base (KYMKB), composed of more than 54, 000 images.
no code implementations • 6 Nov 2023 • Maram Hasanain, Firoj Alam, Hamdy Mubarak, Samir Abdaljalil, Wajdi Zaghouani, Preslav Nakov, Giovanni Da San Martino, Abed Alhakim Freihat
We present an overview of the ArAIEval shared task, organized as part of the first ArabicNLP 2023 conference co-located with EMNLP 2023.
1 code implementation • 2 Nov 2023 • Jinyan Su, Claire Cardie, Preslav Nakov
With the proliferation of both human-written and machine-generated real and fake news, robustly and effectively discerning the veracity of news articles has become an intricate challenge.
1 code implementation • 27 Oct 2023 • Shubham Mittal, Megha Sundriyal, Preslav Nakov
Claim span identification (CSI) is an important step in fact-checking pipelines, aiming to identify text segments that contain a checkworthy claim or assertion in a social media post.
1 code implementation • 25 Oct 2023 • Saptarshi Sengupta, Connor Heaton, Shreya Ghosh, Preslav Nakov, Prasenjit Mitra
Domain adaptation, the process of training a model in one domain and applying it to another, has been extensively explored in machine learning.
1 code implementation • 22 Oct 2023 • Megha Sundriyal, Tanmoy Chakraborty, Preslav Nakov
To evaluate the effectiveness of our proposed model, we meticulously compile a comprehensive real-world dataset, CLAN, comprising more than 6k instances of social media posts alongside their respective normalized claims.
1 code implementation • 11 Oct 2023 • Liangming Pan, Xinyuan Lu, Min-Yen Kan, Preslav Nakov
Fact-checking real-world claims often requires complex, multi-step reasoning due to the absence of direct evidence to support or refute them.
no code implementations • 8 Oct 2023 • Isabelle Augenstein, Timothy Baldwin, Meeyoung Cha, Tanmoy Chakraborty, Giovanni Luca Ciampaglia, David Corney, Renee DiResta, Emilio Ferrara, Scott Hale, Alon Halevy, Eduard Hovy, Heng Ji, Filippo Menczer, Ruben Miguez, Preslav Nakov, Dietram Scheufele, Shivam Sharma, Giovanni Zagni
The emergence of tools based on Large Language Models (LLMs), such as OpenAI's ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention.
no code implementations • 16 Sep 2023 • Yuxia Wang, Minghan Wang, Preslav Nakov
Recent years have seen the rise of large language models (LLMs), where practitioners use task-specific prompts; this was shown to be effective for a variety of tasks.
no code implementations • 15 Sep 2023 • Jinyan Su, Terry Yue Zhuo, Jonibek Mansurov, Di Wang, Preslav Nakov
The spread of fake news has emerged as a critical challenge, undermining trust and posing threats to society.
no code implementations • 13 Sep 2023 • Georgi Pachov, Dimitar Dimitrov, Ivan Koychev, Preslav Nakov
Thus, subjectivity detection can play an important role in ensuring the objectiveness and the quality of a piece of information.
no code implementations • 30 Aug 2023 • Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Satheesh Katipomu, Haonan Li, Fajri Koto, William Marshall, Gurpreet Gosal, Cynthia Liu, Zhiming Chen, Osama Mohammed Afzal, Samta Kamboj, Onkar Pandit, Rahul Pal, Lalit Pradhan, Zain Muhammad Mujahid, Massa Baali, Xudong Han, Sondos Mahmoud Bsharat, Alham Fikri Aji, Zhiqiang Shen, Zhengzhong Liu, Natalia Vassilieva, Joel Hestness, Andy Hock, Andrew Feldman, Jonathan Lee, Andrew Jackson, Hector Xuguang Ren, Preslav Nakov, Timothy Baldwin, Eric Xing
We release two open versions of the model -- the foundation Jais model, and an instruction-tuned Jais-chat variant -- with the aim of promoting research on Arabic LLMs.
1 code implementation • 25 Aug 2023 • Yuxia Wang, Haonan Li, Xudong Han, Preslav Nakov, Timothy Baldwin
With the rapid evolution of large language models (LLMs), new and hard-to-predict harmful capabilities are emerging.
2 code implementations • 4 Jun 2023 • Momchil Hardalov, Pepa Atanasova, Todor Mihaylov, Galia Angelova, Kiril Simov, Petya Osenova, Ves Stoyanov, Ivan Koychev, Preslav Nakov, Dragomir Radev
We run the first systematic evaluation of pre-trained language models for Bulgarian, comparing and contrasting results across the nine tasks in the benchmark.
no code implementations • 28 May 2023 • Mugariya Farooq, Shahad Hardan, Aigerim Zhumbhayeva, Yujia Zheng, Preslav Nakov, Kun Zhang
The need for more usable and explainable machine learning models in healthcare increases the importance of developing and utilizing causal discovery algorithms, which aim to discover causal relations by analyzing observational data.
1 code implementation • 24 May 2023 • Petar Ivanov, Ivan Koychev, Momchil Hardalov, Preslav Nakov
Developing tools to automatically detect check-worthy claims in political debates and speeches can greatly help moderators of debates, journalists, and fact-checkers.
2 code implementations • 24 May 2023 • Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Chenxi Whitehouse, Osama Mohammed Afzal, Tarek Mahmoud, Toru Sasaki, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov
These results show that the problem is far from solved and that there is a lot of room for improvement.
1 code implementation • 23 May 2023 • Jinyan Su, Terry Yue Zhuo, Di Wang, Preslav Nakov
One is called DetectLLM-LRR, which is fast and efficient, and the other is called DetectLLM-NPR, which is more accurate, but slower due to the need for perturbations.
1 code implementation • 23 May 2023 • Yikang Pan, Liangming Pan, Wenhu Chen, Preslav Nakov, Min-Yen Kan, William Yang Wang
In this paper, we comprehensively investigate the potential misuse of modern Large Language Models (LLMs) for generating credible-sounding misinformation and its subsequent impact on information-intensive applications, particularly Open-Domain Question Answering (ODQA) systems.
1 code implementation • 23 May 2023 • Muhammad Umar Salman, Asif Hanif, Shady Shehata, Preslav Nakov
Yet, it is common to find a mix of multiple languages in social media communication, a phenomenon known as code-switching.
1 code implementation • 22 May 2023 • Liangming Pan, Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan, Preslav Nakov
Fact-checking real-world claims often requires collecting multiple pieces of evidence and applying complex multi-step reasoning.
1 code implementation • 22 May 2023 • Xinyuan Lu, Liangming Pan, Qian Liu, Preslav Nakov, Min-Yen Kan
Current scientific fact-checking benchmarks exhibit several shortcomings, such as biases arising from crowd-sourced claims and an over-reliance on text-based evidence.
no code implementations • 5 May 2023 • Maram Hasanain, Ahmed Oumar El-Shangiti, Rabindra Nath Nandi, Preslav Nakov, Firoj Alam
This paper describes our participating system to this task.
no code implementations • 20 Apr 2023 • Qisheng Liao, Meiting Lai, Preslav Nakov
This paper describes our system for SemEval-2023 Task 3 Subtask 2 on Framing Detection.
no code implementations • 13 Apr 2023 • Ashraf Haddad, Najwa Aaraj, Preslav Nakov, Septimiu Fabian Mare
In recent years, a proliferation of cyber-security threats and diversity has been on the rise culminating in an increase in their reporting and analysis.
1 code implementation • 1 Feb 2023 • Muhammad Arslan Manzoor, Sarah Albarri, Ziting Xian, Zaiqiao Meng, Preslav Nakov, Shangsong Liang
This survey presents the comprehensive literature on the evolution and enhancement of deep learning multimodal architectures to deal with textual, visual and audio features for diverse cross-modal and modern multimodal tasks.
no code implementations • 26 Jan 2023 • Shivam Sharma, Atharva Kulkarni, Tharun Suresh, Himanshi Mathur, Preslav Nakov, Md. Shad Akhtar, Tanmoy Chakraborty
A common problem associated with meme comprehension lies in detecting the entities referenced and characterizing the role of each of these entities.
no code implementations • 17 Jan 2023 • Serena Tardelli, Leonardo Nizzoli, Maurizio Tesconi, Mauro Conti, Preslav Nakov, Giovanni Da San Martino, Stefano Cresci
Large-scale online campaigns, malicious or otherwise, require a significant degree of coordination among participants, which sparked interest in the study of coordinated online behavior.
1 code implementation • 1 Dec 2022 • Shivam Sharma, Siddhant Agarwal, Tharun Suresh, Preslav Nakov, Md. Shad Akhtar, Tanmoy Chakraborty
Here, we introduce a novel task - EXCLAIM, generating explanations for visual semantic role labeling in memes.
no code implementations • 18 Nov 2022 • Firoj Alam, Hamdy Mubarak, Wajdi Zaghouani, Giovanni Da San Martino, Preslav Nakov
Thus, there has been a lot of recent research on automatic detection of propaganda techniques in text as well as in memes.
no code implementations • 10 Nov 2022 • Panayot Panayotov, Utsav Shukla, Husrev Taha Sencar, Mohamed Nabeel, Preslav Nakov
We study the problem of profiling news media on the Web with respect to their factuality of reporting and bias.
1 code implementation • 5 Nov 2022 • Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du
In particular, on the complex set of TabFact, which contains multiple operations, PASTA largely outperforms the previous state of the art by 4. 7 points (85. 6% vs. 80. 9%), and the gap between PASTA and human performance on the small TabFact test set is narrowed to just 1. 5 points (90. 6% vs. 92. 1%).
Ranked #2 on Table-based Fact Verification on TabFact
1 code implementation • 31 Oct 2022 • Shubham Mittal, Preslav Nakov
In addition to finding the techniques, Subtask 2 further asks to identify the textual span for each instance of each technique that is present in the tweet; the task can be modeled as a sequence tagging problem.
1 code implementation • 10 Oct 2022 • Momchil Hardalov, Anton Chernyavskiy, Ivan Koychev, Dmitry Ilvovsky, Preslav Nakov
Thus, an interesting approach has emerged: to perform automatic fact-checking by verifying whether an input claim has been previously fact-checked by professional fact-checkers and to return back an article that explains their decision.
no code implementations • 1 Oct 2022 • Sanjay Chawla, Preslav Nakov, Ahmed Ali, Wendy Hall, Issa Khalil, Xiaosong Ma, Husrev Taha Sencar, Ingmar Weber, Michael Wooldridge, Ting Yu
The rise of attention networks, self-supervised learning, generative modeling, and graph neural networks has widened the application space of AI.
1 code implementation • Findings (NAACL) 2022 • Shivam Sharma, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
Finally, we show that DISARM is interpretable and comparatively more generalizable and that it can reduce the relative error rate for harmful target identification by up to 9 points absolute over several strong multimodal rivals.
no code implementations • DravidianLangTech (ACL) 2022 • Rabindra Nath Nandi, Firoj Alam, Preslav Nakov
The spread of fake news, propaganda, misinformation, disinformation, and harmful content online raised concerns among social media platforms, government agencies, policymakers, and society as a whole.
1 code implementation • CONSTRAINT (ACL) 2022 • Rabindra Nath Nandi, Firoj Alam, Preslav Nakov
The content that is posted and shared online can be textual, visual, or a combination of both, e. g., in a meme.
1 code implementation • 9 May 2022 • Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty
One interesting finding is that many types of harmful memes are not really studied, e. g., such featuring self-harm and extremism, partly due to the lack of suitable datasets.
1 code implementation • 10 Mar 2022 • Kung-Hsiang Huang, Kathleen McKeown, Preslav Nakov, Yejin Choi, Heng Ji
Despite recent advances in detecting fake news generated by neural models, their results are not readily applicable to effective detection of human-written disinformation.
no code implementations • 8 Mar 2022 • Preslav Nakov, Firoj Alam, Yifan Zhang, Animesh Prakash, Fahim Dalvi
Fighting the ongoing COVID-19 infodemic has been declared as one of the most important focus areas by the World Health Organization since the onset of the COVID-19 pandemic.
1 code implementation • 22 Jan 2022 • Kristiyan Vachev, Momchil Hardalov, Georgi Karadzhov, Georgi Georgiev, Ivan Koychev, Preslav Nakov
Testing with quiz questions has proven to be an effective way to assess and improve the educational process.
2 code implementations • 16 Dec 2021 • Revanth Gangi Reddy, Sai Chetan, Zhenhailong Wang, Yi R. Fung, Kathryn Conger, Ahmed Elsayed, Martha Palmer, Preslav Nakov, Eduard Hovy, Kevin Small, Heng Ji
In this work, we present NewsClaims, a new benchmark for attribute-aware claim detection in the news domain.
no code implementations • NAACL 2022 • Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov
The use of contrastive loss for representation learning has become prominent in computer vision, and it is now getting attention in Natural Language Processing (NLP).
no code implementations • 27 Sep 2021 • Preslav Nakov, Hwee Tou Ng
We propose a novel approach to translating from a morphologically complex language.
no code implementations • RANLP 2013 • Jörg Tiedemann, Preslav Nakov
This paper provides an analysis of character-level machine translation models used in pivot-based translation when applied to sparse and noisy datasets, such as crowdsourced movie subtitles.
no code implementations • RANLP 2015 • Dame Jovanoski, Veno Pachovski, Preslav Nakov
We present work on sentiment analysis in Twitter for Macedonian.
no code implementations • 27 Sep 2021 • Kristina Hristakieva, Stefano Cresci, Giovanni Da San Martino, Mauro Conti, Preslav Nakov
Large-scale manipulations on social media have two important characteristics: (i) use of propaganda to influence others, and (ii) adoption of coordinated behavior to spread it and to amplify its impact.
no code implementations • RANLP 2015 • Todor Mihaylov, Ivan Koychev, Georgi Georgiev, Preslav Nakov
Recently, Web forums have been invaded by opinion manipulation trolls.
no code implementations • 26 Sep 2021 • Georgi Georgiev, Preslav Nakov, Kuzman Ganchev, Petya Osenova, Kiril Ivanov Simov
The paper presents a feature-rich approach to the automatic recognition and categorization of named entities (persons, organizations, locations, and miscellaneous) in news text for Bulgarian.
no code implementations • SemEval (ACL) 2016 • Tsvetomila Mihaylova, Pepa Gencheva, Martin Boyanov, Ivana Yovcheva, Todor Mihaylov, Momchil Hardalov, Yasen Kiprov, Daniel Balchev, Ivan Koychev, Preslav Nakov, Ivelina Nikolova, Galia Angelova
We present the system we built for participating in SemEval-2016 Task 3 on Community Question Answering.
no code implementations • 25 Sep 2021 • Preslav Nakov
We propose a novel monolingual sentence paraphrasing method for augmenting the training data for statistical machine translation systems "for free" -- by creating it from data that is already available rather than having to create more aligned data.
no code implementations • 25 Sep 2021 • Tamer Elsayed, Preslav Nakov, Alberto Barrón-Cedeño, Maram Hasanain, Reem Suwaileh, Giovanni Da San Martino, Pepa Atanasova
We present an overview of the second edition of the CheckThat!
no code implementations • Findings (ACL) 2021 • Shraman Pramanick, Dimitar Dimitrov, Rituparna Mukherjee, Shivam Sharma, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
In this work, we propose two novel problem formulations: detecting harmful memes and the social entities that these harmful memes target.
1 code implementation • EMNLP 2021 • Mohammed Saeed, Naser Ahmadi, Preslav Nakov, Paolo Papotti
While pre-trained language models (PLMs) are the go-to solution to tackle many natural language processing problems, they are still very limited in their ability to capture and to use common-sense knowledge.
no code implementations • 23 Sep 2021 • Preslav Nakov, Giovanni Da San Martino, Tamer Elsayed, Alberto Barrón-Cedeño, Rubén Míguez, Shaden Shaar, Firoj Alam, Fatima Haouari, Maram Hasanain, Watheq Mansour, Bayan Hamdan, Zien Sheikh Ali, Nikolay Babulkov, Alex Nikolov, Gautam Kishore Shahi, Julia Maria Struß, Thomas Mandl, Mucahid Kutlu, Yavuz Selim Kartal
We describe the fourth edition of the CheckThat!
no code implementations • NAACL (NLP4IF) 2021 • Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Alex Nikolov, Wajdi Zaghouani, Preslav Nakov, Anna Feldman
Here, we present the tasks, analyze the results, and discuss the system submissions and the methods they used.
no code implementations • RANLP 2021 • Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang
While COVID-19 vaccines are finally becoming widely available, a second pandemic that revolves around the circulation of anti-vaxxer fake news may hinder efforts to recover from the first one.
1 code implementation • 14 Sep 2021 • Shaden Shaar, Nikola Georgiev, Firoj Alam, Giovanni Da San Martino, Aisha Mohamed, Preslav Nakov
The output is a re-ranked list of the document sentences, so that those that can be verified are ranked as high as possible, together with corresponding evidence.
1 code implementation • 13 Sep 2021 • Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
Most research in stance detection, however, has been limited to working with a single language and on a few limited targets, with little work on cross-lingual stance detection.
1 code implementation • Findings (EMNLP) 2021 • Shraman Pramanick, Shivam Sharma, Dimitar Dimitrov, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
We focus on two tasks: (i)detecting harmful memes, and (ii)identifying the social entities they target.
no code implementations • RANLP 2021 • Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James Glass, Preslav Nakov
Online users today are exposed to misleading and propagandistic news articles and media posts on a daily basis.
no code implementations • RANLP 2021 • Kristiyan Vachev, Momchil Hardalov, Georgi Karadzhov, Georgi Georgiev, Ivan Koychev, Preslav Nakov
In education, open-ended quiz questions have become an important tool for assessing the knowledge of students.
no code implementations • RANLP 2021 • Krasimira Bozhanova, Yoan Dinkov, Ivan Koychev, Maria Castaldo, Tommaso Venturini, Preslav Nakov
We propose a novel framework for predicting the factuality of reporting of news media outlets by studying the user attention cycles in their YouTube channels.
1 code implementation • ACL 2021 • Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino
We further create and release a new corpus of 950 memes, carefully annotated with 22 propaganda techniques, which can appear in the text, in the image, or in both.
1 code implementation • NAACL (NLP4IF) 2021 • Tariq Alhindi, Amal Alabdulkarim, Ali Alshehri, Muhammad Abdul-Mageed, Preslav Nakov
With the continuing spread of misinformation and disinformation online, it is of increasing importance to develop combating mechanisms at scale in the form of automated systems that support multiple languages.
1 code implementation • SEMEVAL 2021 • Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino
We describe SemEval-2021 task 6 on Detection of Persuasion Techniques in Texts and Images: the data, the annotation guidelines, the evaluation setup, the results, and the participating systems.
no code implementations • 16 Apr 2021 • Anton Chernyavskiy, Dmitry Ilvovsky, Preslav Nakov
The rise of Internet has made it a major source of information.
2 code implementations • EMNLP 2021 • Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
In this paper, we perform an in-depth analysis of 16 stance detection datasets, and we explore the possibility for cross-domain learning from them.
1 code implementation • Findings (NAACL) 2022 • Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov
Recent years have seen the proliferation of disinformation and fake news online.
no code implementations • 9 Apr 2021 • Anton Chernyavskiy, Dmitry Ilvovsky, Preslav Nakov
Recent advances in neural architectures, such as the Transformer, coupled with the emergence of large-scale pre-trained models such as BERT, have revolutionized the field of Natural Language Processing (NLP), pushing the state of the art for a number of NLP tasks.
no code implementations • 31 Mar 2021 • Sheikh Muhammad Sarwar, Dimitrina Zlatkova, Momchil Hardalov, Yoan Dinkov, Isabelle Augenstein, Preslav Nakov
The framework is based on a nearest-neighbour architecture.
no code implementations • 16 Mar 2021 • Preslav Nakov, Husrev Taha Sencar, Jisun An, Haewoon Kwak
The present level of proliferation of fake, biased, and propagandistic content online has made it impossible to fact-check every single suspicious claim or article, either manually or automatically.
no code implementations • COLING 2022 • Firoj Alam, Stefano Cresci, Tanmoy Chakraborty, Fabrizio Silvestri, Dimiter Dimitrov, Giovanni Da San Martino, Shaden Shaar, Hamed Firooz, Preslav Nakov
As a result, researchers started leveraging different modalities and combinations thereof to tackle online multimodal offensive content.
no code implementations • 13 Mar 2021 • Preslav Nakov, David Corney, Maram Hasanain, Firoj Alam, Tamer Elsayed, Alberto Barrón-Cedeño, Paolo Papotti, Shaden Shaar, Giovanni Da San Martino
The reporting and the analysis of current events around the globe has expanded from professional, editor-lead journalism all the way to citizen journalism.
no code implementations • 27 Feb 2021 • Arnav Arora, Preslav Nakov, Momchil Hardalov, Sheikh Muhammad Sarwar, Vibha Nayak, Yoan Dinkov, Dimitrina Zlatkova, Kyle Dent, Ameya Bhatawdekar, Guillaume Bouchard, Isabelle Augenstein
The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other.
no code implementations • Findings (NAACL) 2022 • Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
Understanding attitudes expressed in texts, also known as stance detection, plays an important role in systems for detecting false information online, be it misinformation (unintentionally false) or disinformation (intentionally false information).
no code implementations • SEMEVAL 2020 • Giovanni Da San Martino, Alberto Barr{\'o}n-Cede{\~n}o, Henning Wachsmuth, Rostislav Petrov, Preslav Nakov
We present the results and the main findings of SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles.
2 code implementations • 20 Nov 2020 • Ileana Rugina, Rumen Dangovski, Li Jing, Preslav Nakov, Marin Soljačić
Attention mechanisms play a crucial role in the neural revolution of Natural Language Processing (NLP).
2 code implementations • EMNLP 2020 • Momchil Hardalov, Todor Mihaylov, Dimitrina Zlatkova, Yoan Dinkov, Ivan Koychev, Preslav Nakov
We perform various experiments with existing top-performing multilingual pre-trained models and we show that EXAMS offers multiple challenges that require multilingual knowledge and reasoning in multiple domains.
no code implementations • EMNLP 2020 • Preslav Nakov, Giovanni Da San Martino
The rise of social media has democratized content creation and has made it easy for everybody to share and spread information online.
1 code implementation • EMNLP 2020 • Ramy Baly, Giovanni Da San Martino, James Glass, Preslav Nakov
We explore the task of predicting the leading political ideology or bias of news articles.
no code implementations • EMNLP 2020 • Matthew Khoury, Rumen Dangovski, Longwu Ou, Preslav Nakov, Yichen Shen, Li Jing
To address this issue, we propose a novel vector-vector-matrix architecture (VVMA), which greatly reduces the latency at inference time for NMT.
3 code implementations • 7 Sep 2020 • Alex Nikolov, Giovanni Da San Martino, Ivan Koychev, Preslav Nakov
While misinformation and disinformation have been thriving in social media for years, with the emergence of the COVID-19 pandemic, the political and the health misinformation merged, thus elevating the problem to a whole new level and giving rise to the first global infodemic.
1 code implementation • 18 Aug 2020 • Van-Hoang Nguyen, Kazunari Sugiyama, Preslav Nakov, Min-Yen Kan
In particular, FANG yields significant improvements for the task of fake news detection, and it is robust in the case of limited training data.
no code implementations • 10 Aug 2020 • Preslav Nakov
Given the recent proliferation of disinformation online, there has been also growing research interest in automatically debunking rumors, false claims, and "fake news."
1 code implementation • SEMEVAL 2020 • Anton Chernyavskiy, Dmitry Ilvovsky, Preslav Nakov
We describe our system for SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles.
1 code implementation • 18 Jul 2020 • Guillem Ramírez, Rumen Dangovski, Preslav Nakov, Marin Soljačić
We believe that our rethinking of the Wasserstein-Procrustes problem could enable further research, thus helping to develop better algorithms for aligning word embeddings across languages.
1 code implementation • 15 Jul 2020 • Firoj Alam, Fahim Dalvi, Shaden Shaar, Nadir Durrani, Hamdy Mubarak, Alex Nikolov, Giovanni Da San Martino, Ahmed Abdelali, Hassan Sajjad, Kareem Darwish, Preslav Nakov
With the outbreak of the COVID-19 pandemic, people turned to social media to read and to share timely information including statistics, warnings, advice, and inspirational stories.
3 code implementations • 15 Jul 2020 • Alberto Barron-Cedeno, Tamer Elsayed, Preslav Nakov, Giovanni Da San Martino, Maram Hasanain, Reem Suwaileh, Fatima Haouari, Nikolay Babulkov, Bayan Hamdan, Alex Nikolov, Shaden Shaar, Zien Sheikh Ali
The first four tasks compose the full pipeline of claim verification in social media: Task 1 on check-worthiness estimation, Task 2 on retrieving previously fact-checked claims, Task 3 on evidence retrieval, and Task 4 on claim verification.
no code implementations • 15 Jul 2020 • Giovanni Da San Martino, Stefano Cresci, Alberto Barron-Cedeno, Seunghak Yu, Roberto Di Pietro, Preslav Nakov
Propaganda campaigns aim at influencing people's mindset with the purpose of advancing a specific agenda.
no code implementations • ACL 2020 • Peter Stefanov, Kareem Darwish, Atanas Atanasov, Preslav Nakov
Discovering the stances of media outlets and influential people on current, debatable topics is important for social statisticians and policy makers.
no code implementations • SEMEVAL 2020 • Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, Çağrı Çöltekin
We present the results and main findings of SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
2 code implementations • ACL 2020 • Shaden Shaar, Giovanni Da San Martino, Nikolay Babulkov, Preslav Nakov
Interestingly, despite the importance of the task, it has been largely ignored by the research community so far.
no code implementations • ACL 2020 • Giovanni Da San Martino, Shaden Shaar, Yifan Zhang, Seunghak Yu, Alberto Barrón-Cedeño, Preslav Nakov
However, little attention has been paid to the specific rhetorical and psychological techniques used to convey propaganda messages.
1 code implementation • ACL 2020 • Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav Nakov
Alternatively, we can profile entire news outlets and look for those that are likely to publish fake or biased content.
no code implementations • 30 Apr 2020 • Momchil Hardalov, Ivan Koychev, Preslav Nakov
Recently, the advances in pre-trained language models, namely contextualized models such as ELMo and BERT have revolutionized the field by tapping the potential of training very large models with just a few steps of fine-tuning on a task-specific dataset.
2 code implementations • Findings (EMNLP) 2021 • Firoj Alam, Shaden Shaar, Fahim Dalvi, Hassan Sajjad, Alex Nikolov, Hamdy Mubarak, Giovanni Da San Martino, Ahmed Abdelali, Nadir Durrani, Kareem Darwish, Abdulaziz Al-Homaid, Wajdi Zaghouani, Tommaso Caselli, Gijs Danoe, Friso Stolk, Britt Bruntink, Preslav Nakov
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.
no code implementations • Findings (ACL) 2021 • Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Marcos Zampieri, Preslav Nakov
The widespread use of offensive content in social media has led to an abundance of research in detecting language such as hate speech, cyberbullying, and cyber-aggression.
4 code implementations • 8 Apr 2020 • Hassan Sajjad, Fahim Dalvi, Nadir Durrani, Preslav Nakov
Transformer-based NLP models are trained using hundreds of millions or even billions of parameters, limiting their applicability in computationally constrained environments.
no code implementations • 27 Feb 2020 • Prakhar Ganesh, Yao Chen, Xin Lou, Mohammad Ali Khan, Yin Yang, Hassan Sajjad, Preslav Nakov, Deming Chen, Marianne Winslett
Pre-trained Transformer-based models have achieved state-of-the-art performance for various Natural Language Processing (NLP) tasks.
3 code implementations • 21 Jan 2020 • Alberto Barron-Cedeno, Tamer Elsayed, Preslav Nakov, Giovanni Da San Martino, Maram Hasanain, Reem Suwaileh, Fatima Haouari
Finally, the lab offers a fifth task that asks to predict the check-worthiness of the claims made in English political debates and speeches.
no code implementations • 14 Dec 2019 • Pepa Gencheva, Ivan Koychev, Lluís Màrquez, Alberto Barrón-Cedeño, Preslav Nakov
In the context of investigative journalism, we address the problem of automatically identifying which claims in a given document are most worthy and should be prioritized for fact-checking.
no code implementations • 14 Dec 2019 • Alberto Barrón-Cedeño, Giovanni Da San Martino, Israa Jaradat, Preslav Nakov
We present proppy, the first publicly available real-world, real-time propaganda detection system for online news, which aims at raising awareness, thus potentially limiting the impact of propaganda and helping fight disinformation.
no code implementations • SEMEVAL 2013 • Preslav Nakov, Zornitsa Kozareva, Alan Ritter, Sara Rosenthal, Veselin Stoyanov, Theresa Wilson
To address this issue, we have proposed SemEval-2013 Task 2: Sentiment Analysis in Twitter, which included two subtasks: A, an expression-level subtask, and B, a message-level subtask.