1 code implementation • EMNLP (WNUT) 2020 • Tanvirul Alam, Akib Khan, Firoj Alam
In this work, we explore different transformer based models and propose an augmentation strategy for this task, focusing on high-resource (English) and low-resource (Bangla) languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • RANLP 2021 • Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.
no code implementations • 20 Oct 2024 • Mohamed Bayan Kmainasi, Ali Ezzat Shahroor, Maram Hasanain, Sahinur Rahman Laskar, Naeemul Hassan, Firoj Alam
To address this gap, this study focuses on developing a specialized LLM, LlamaLens, for analyzing news and social media content in a multilingual context.
no code implementations • 17 Sep 2024 • Basel Mousi, Nadir Durrani, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain, Tameem Kabbani, Fahim Dalvi, Shammur Absar Chowdhury, Firoj Alam
Arabic, with its rich diversity of dialects, remains significantly underrepresented in Large Language Models, particularly in dialectal variations.
no code implementations • 11 Sep 2024 • Mohamed Bayan Kmainasi, Rakif Khan, Ali Ezzat Shahroor, Boushra Bendou, Maram Hasanain, Firoj Alam
Since prompts play a crucial role in understanding their capabilities, the language used for prompts remains an important research question.
no code implementations • 11 Sep 2024 • Firoj Alam, Md. Rafiul Biswas, Uzair Shah, Wajdi Zaghouani, Georgios Mikros
In the current literature, there have been efforts to individually detect such content in memes.
no code implementations • 13 Jul 2024 • Md. Arid Hasan, Maram Hasanain, Fatema Ahmad, Sahinur Rahman Laskar, Sunaya Upadhyay, Vrunda N Sukhadia, Mucahid Kutlu, Shammur Absar Chowdhury, Firoj Alam
Natural Question Answering (QA) datasets play a crucial role in evaluating the capabilities of large language models (LLMs), ensuring their effectiveness in real-world applications.
no code implementations • 5 Jul 2024 • Maram Hasanain, Md. Arid Hasan, Fatema Ahmed, Reem Suwaileh, Md. Rafiul Biswas, Wajdi Zaghouani, Firoj Alam
We further provide a brief overview of the participating systems.
no code implementations • 8 Jun 2024 • Reem Suwaileh, Maram Hasanain, Fatema Hubail, Wajdi Zaghouani, Firoj Alam
In this study, we present the first large dataset for subjectivity detection in Arabic, consisting of ~3. 6K manually annotated sentences, and GPT-4o based explanation.
no code implementations • 6 Jun 2024 • Firoj Alam, Abul Hasnat, Fatema Ahmed, Md Arid Hasan, Maram Hasanain
Identification of such misleading and persuasive multimodal content has become more important among various stakeholders, including social media platforms, policymakers, and the broader society as they often cause harm to individuals, organizations, and/or society.
no code implementations • 27 Feb 2024 • Maram Hasanain, Fatema Ahmed, Firoj Alam
Finally, we evaluate GPT-4 on a dataset consisting of six other languages for span detection, and results suggest that the model struggles with the task across languages.
1 code implementation • 16 Nov 2023 • Maram Hasanain, Fatema Ahmad, Firoj Alam
Finally, we examine the effectiveness of labels provided by GPT-4 in training smaller language models for the task.
no code implementations • 6 Nov 2023 • Maram Hasanain, Firoj Alam, Hamdy Mubarak, Samir Abdaljalil, Wajdi Zaghouani, Preslav Nakov, Giovanni Da San Martino, Abed Alhakim Freihat
We present an overview of the ArAIEval shared task, organized as part of the first ArabicNLP 2023 conference co-located with EMNLP 2023.
no code implementations • 6 Nov 2023 • Yunze Xiao, Firoj Alam
The spread of disinformation and propagandistic content poses a threat to societal harmony, undermining informed decision-making and trust in reliable sources.
1 code implementation • 6 Nov 2023 • Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam
Our results demonstrate the efficacy of the model trained on psuedo-label data for the designed test-set along with publicly-available Bangla datasets.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 24 Oct 2023 • Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, Afiyat Anjum
Additionally, we provide a brief overview of the systems submitted by the participants.
1 code implementation • 21 Aug 2023 • Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori
The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare.
1 code implementation • 9 Aug 2023 • Fahim Dalvi, Maram Hasanain, Sabri Boughorbel, Basel Mousi, Samir Abdaljalil, Nizi Nazar, Ahmed Abdelali, Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Ali, Majd Hawasly, Nadir Durrani, Firoj Alam
In this study, we introduce the LLMeBench framework, which can be seamlessly customized to evaluate LLMs for any NLP task, regardless of language.
no code implementations • 24 May 2023 • Ahmed Abdelali, Hamdy Mubarak, Shammur Absar Chowdhury, Maram Hasanain, Basel Mousi, Sabri Boughorbel, Yassine El Kheir, Daniel Izham, Fahim Dalvi, Majd Hawasly, Nizi Nazar, Yousseif Elshahawy, Ahmed Ali, Nadir Durrani, Natasa Milic-Frayling, Firoj Alam
Our findings provide valuable insights into the applicability of LLMs for Arabic NLP and speech processing tasks.
no code implementations • 5 May 2023 • Hamdy Mubarak, Samir Abdaljalil, Azza Nassar, Firoj Alam
Social media platforms empower us in several ways, from information dissemination to consumption.
no code implementations • 5 May 2023 • Maram Hasanain, Ahmed Oumar El-Shangiti, Rabindra Nath Nandi, Preslav Nakov, Firoj Alam
This paper describes our participating system to this task.
no code implementations • 18 Nov 2022 • Firoj Alam, Hamdy Mubarak, Wajdi Zaghouani, Giovanni Da San Martino, Preslav Nakov
Thus, there has been a lot of recent research on automatic detection of propaganda techniques in text as well as in memes.
no code implementations • 12 Nov 2022 • Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu
We use an unsupervised method to discover concepts learned in these models and enable a graphical interface for humans to generate explanations for the concepts.
no code implementations • 23 Oct 2022 • Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Firoj Alam
We study the evolution of latent space in fine-tuned NLP models.
no code implementations • 15 Jul 2022 • Prerona Tarannum, Firoj Alam, Md. Arid Hasan, Sheak Rashed Haider Noori
In further experiments, our evaluation shows that transformer models (BERT-m and XLM-RoBERTa-base) outperform the SVM and RF in Dutch and English languages where a different scenario is observed for Spanish.
1 code implementation • NAACL 2022 • Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
We propose a novel framework ConceptX, to analyze how latent concepts are encoded in representations learned within pre-trained language models.
no code implementations • ICLR 2022 • Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, Hassan Sajjad
We address this limitation by discovering and analyzing latent concepts learned in neural network models in an unsupervised fashion and provide interpretations from the model's perspective.
no code implementations • DravidianLangTech (ACL) 2022 • Rabindra Nath Nandi, Firoj Alam, Preslav Nakov
The spread of fake news, propaganda, misinformation, disinformation, and harmful content online raised concerns among social media platforms, government agencies, policymakers, and society as a whole.
1 code implementation • 9 May 2022 • Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty
One interesting finding is that many types of harmful memes are not really studied, e. g., such featuring self-harm and extremism, partly due to the lack of suitable datasets.
1 code implementation • CONSTRAINT (ACL) 2022 • Rabindra Nath Nandi, Firoj Alam, Preslav Nakov
The content that is posted and shared online can be textual, visual, or a combination of both, e. g., in a meme.
no code implementations • 8 Mar 2022 • Preslav Nakov, Firoj Alam, Yifan Zhang, Animesh Prakash, Fahim Dalvi
Fighting the ongoing COVID-19 infodemic has been declared as one of the most important focus areas by the World Health Organization since the onset of the COVID-19 pandemic.
no code implementations • COLING (WNUT) 2022 • Hamdy Mubarak, Shammur Absar Chowdhury, Firoj Alam
Gender analysis of Twitter can reveal important socio-cultural differences between male and female users.
no code implementations • LREC 2022 • Hamdy Mubarak, Sabit Hassan, Shammur Absar Chowdhury, Firoj Alam
We studied the data for individual types of tweets and temporal changes in stance towards vaccine.
no code implementations • 26 Oct 2021 • Izzat Alsmadi, Kashif Ahmad, Mahmoud Nazzal, Firoj Alam, Ala Al-Fuqaha, Abdallah Khreishah, Abdulelah Algosaibi
These vulnerabilities allow adversaries to launch a diversified set of adversarial attacks on these algorithms in different applications of social media text processing.
no code implementations • 23 Sep 2021 • Preslav Nakov, Giovanni Da San Martino, Tamer Elsayed, Alberto Barrón-Cedeño, Rubén Míguez, Shaden Shaar, Firoj Alam, Fatima Haouari, Maram Hasanain, Watheq Mansour, Bayan Hamdan, Zien Sheikh Ali, Nikolay Babulkov, Alex Nikolov, Gautam Kishore Shahi, Julia Maria Struß, Thomas Mandl, Mucahid Kutlu, Yavuz Selim Kartal
We describe the fourth edition of the CheckThat!
no code implementations • NAACL (NLP4IF) 2021 • Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Alex Nikolov, Wajdi Zaghouani, Preslav Nakov, Anna Feldman
Here, we present the tasks, analyze the results, and discuss the system submissions and the methods they used.
no code implementations • RANLP 2021 • Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang
While COVID-19 vaccines are finally becoming widely available, a second pandemic that revolves around the circulation of anti-vaxxer fake news may hinder efforts to recover from the first one.
1 code implementation • 14 Sep 2021 • Shaden Shaar, Nikola Georgiev, Firoj Alam, Giovanni Da San Martino, Aisha Mohamed, Preslav Nakov
The output is a re-ranked list of the document sentences, so that those that can be verified are ranked as high as possible, together with corresponding evidence.
1 code implementation • 29 Aug 2021 • Firoj Alam, Tanvirul Alam, Md. Arid Hasan, Abul Hasnat, Muhammad Imran, Ferda Ofli
This is the first dataset of its kind: social media images, disaster response, and multi-task learning research.
1 code implementation • ACL 2021 • Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino
We further create and release a new corpus of 950 memes, carefully annotated with 22 propaganda techniques, which can appear in the text, in the image, or in both.
2 code implementations • 8 Jul 2021 • Firoj Alam, Arid Hasan, Tanvirul Alam, Akib Khan, Janntatul Tajrin, Naira Khan, Shammur Absar Chowdhury
In this study, we first provide a review of Bangla NLP tasks, resources, and tools available to the research community; we benchmark datasets collected from various platforms for nine NLP tasks using current state-of-the-art algorithms (i. e., transformer-based models).
1 code implementation • SEMEVAL 2021 • Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino
We describe SemEval-2021 task 6 on Detection of Persuasion Techniques in Texts and Images: the data, the annotation guidelines, the evaluation setup, the results, and the participating systems.
1 code implementation • Findings (NAACL) 2022 • Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov
Recent years have seen the proliferation of disinformation and fake news online.
no code implementations • COLING 2022 • Hassan Sajjad, Firoj Alam, Fahim Dalvi, Nadir Durrani
However, post-processing for contextualized embeddings is an under-studied problem.
no code implementations • 9 Apr 2021 • Firoj Alam, Tanvirul Alam, Muhammad Imran, Ferda Ofli
Images shared on social media help crisis managers gain situational awareness and assess incurred damages, among other response tasks.
no code implementations • 7 Apr 2021 • Firoj Alam, Umair Qazi, Muhammad Imran, Ferda Ofli
Social networks are widely used for information consumption and dissemination, especially during time-critical events such as natural disasters.
no code implementations • 13 Mar 2021 • Preslav Nakov, David Corney, Maram Hasanain, Firoj Alam, Tamer Elsayed, Alberto Barrón-Cedeño, Paolo Papotti, Shaden Shaar, Giovanni Da San Martino
The reporting and the analysis of current events around the globe has expanded from professional, editor-lead journalism all the way to citizen journalism.
no code implementations • COLING 2022 • Firoj Alam, Stefano Cresci, Tanmoy Chakraborty, Fabrizio Silvestri, Dimiter Dimitrov, Giovanni Da San Martino, Shaden Shaar, Hamed Firooz, Preslav Nakov
As a result, researchers started leveraging different modalities and combinations thereof to tackle online multimodal offensive content.
no code implementations • 1 Mar 2021 • Kashif Ahmad, Firoj Alam, Junaid Qadir, Basheer Qolomany, Imran Khan, Talhat Khan, Muhammad Suleman, Naina Said, Syed Zohaib Hassan, Asma Gul, Ala Al-Fuqaha
In this work, we propose a pipeline starting from manual annotation via a crowd-sourcing study and concluding on the development and training of AI models for automatic sentiment analysis of users' reviews.
no code implementations • 30 Nov 2020 • Firoj Alam, Zohaib Hassan, Kashif Ahmad, Asma Gul, Michael Reiglar, Nicola Conci, Ala Al-Fuqaha
The paper presents our proposed solutions for the MediaEval 2020 Flood-Related Multimedia Task, which aims to analyze and detect flooding events in multimedia content shared over Twitter.
1 code implementation • 19 Nov 2020 • Md. Arid Hasan, Jannatul Tajrin, Shammur Absar Chowdhury, Firoj Alam
In this study, we explore several publicly available sentiment labeled datasets and designed classifiers using both classical and deep learning algorithms.
no code implementations • 17 Nov 2020 • Firoj Alam, Ferda Ofli, Muhammad Imran, Tanvirul Alam, Umair Qazi
In this study, we propose new datasets for disaster type detection, and informativeness classification, and damage severity assessment.
1 code implementation • 9 Nov 2020 • Tanvirul Alam, Akib Khan, Firoj Alam
Text classification has been one of the earliest problems in NLP.
1 code implementation • 15 Jul 2020 • Firoj Alam, Fahim Dalvi, Shaden Shaar, Nadir Durrani, Hamdy Mubarak, Alex Nikolov, Giovanni Da San Martino, Ahmed Abdelali, Hassan Sajjad, Kareem Darwish, Preslav Nakov
With the outbreak of the COVID-19 pandemic, people turned to social media to read and to share timely information including statistics, warnings, advice, and inspirational stories.
2 code implementations • Findings (EMNLP) 2021 • Firoj Alam, Shaden Shaar, Fahim Dalvi, Hassan Sajjad, Alex Nikolov, Hamdy Mubarak, Giovanni Da San Martino, Ahmed Abdelali, Nadir Durrani, Kareem Darwish, Abdulaziz Al-Homaid, Wajdi Zaghouani, Tommaso Caselli, Gijs Danoe, Friso Stolk, Britt Bruntink, Preslav Nakov
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.
no code implementations • 14 Apr 2020 • Muhammad Imran, Firoj Alam, Umair Qazi, Steve Peterson, Ferda Ofli
Rapid damage assessment is one of the core tasks that response organizations perform at the onset of a disaster to understand the scale of damage to infrastructures such as roads, bridges, and buildings.
no code implementations • 14 Apr 2020 • Firoj Alam, Hassan Sajjad, Muhammad Imran, Ferda Ofli
Time-critical analysis of social media streams is important for humanitarian organizations for planing rapid response during disasters.
1 code implementation • 14 Apr 2020 • Ferda Ofli, Firoj Alam, Muhammad Imran
Multimedia content in social media platforms provides significant information during disaster events.
Ranked #1 on Disaster Response on CrisisMMD
1 code implementation • ACL 2018 • Firoj Alam, Shafiq Joty, Muhammad Imran
In such scenarios, a DNN model can leverage labeled and unlabeled data from a related domain, but it has to deal with the shift in data distributions between the source and the target domains.
no code implementations • 2 May 2018 • Firoj Alam, Shafiq Joty, Muhammad Imran
During time-critical situations such as natural disasters, rapid classification of data posted on social networks by affected people is useful for humanitarian organizations to gain situational awareness and to plan response efforts.
2 code implementations • 2 May 2018 • Firoj Alam, Ferda Ofli, Muhammad Imran
Despite extensive research that mainly focuses on textual content to extract useful information, limited work has focused on the use of imagery content or the combination of both content types.
Social and Information Networks Computers and Society
no code implementations • 13 May 2017 • Firoj Alam, Morena Danieli, Giuseppe Riccardi
The automatic classification system was evaluated on call center conversations where it showed significantly better performance than the baseline.
no code implementations • 9 Apr 2017 • Dat Tien Nguyen, Firoj Alam, Ferda Ofli, Muhammad Imran
The extensive use of social media platforms, especially during disasters, creates unique opportunities for humanitarian organizations to gain situational awareness and launch relief operations accordingly.
no code implementations • WS 2016 • Firoj Alam, Fabio Celli, Evgeny A. Stepanov, Arindam Ghosh, Giuseppe Riccardi
In this paper, we address the issue of automatic prediction of readers{'} mood from newspaper articles and comments.
no code implementations • COLING 2016 • Firoj Alam, Shammur Absar Chowdhury, Morena Danieli, Giuseppe Riccardi
In this paper, we aim to investigate the coordination of interlocutors behavior in different emotional segments.
no code implementations • LREC 2016 • Fabio Celli, Giuseppe Riccardi, Firoj Alam
In this paper, we present a corpus of news blog conversations in Italian annotated with gold standard agreement/disagreement relations at message and sentence levels.
no code implementations • Workshop on Speech, Language and Audio in Multimedia (SLAM 2014) 2014 • Shammur Absar Chowdhury, Giuseppe Riccardi, Firoj Alam
Then, we used unsupervised clustering to find the distinct and well-separated clusters in terms of acoustic and lexical features.