Search Results for author: Firoj Alam

Found 67 papers, 22 papers with code

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

1 code implementation EMNLP (WNUT) 2020 Tanvirul Alam, Akib Khan, Firoj Alam

In this work, we explore different transformer based models and propose an augmentation strategy for this task, focusing on high-resource (English) and low-resource (Bangla) languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

COVID-19 in Bulgarian Social Media: Factuality, Harmfulness, Propaganda, and Framing

1 code implementation RANLP 2021 Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang

With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.

LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

no code implementations20 Oct 2024 Mohamed Bayan Kmainasi, Ali Ezzat Shahroor, Maram Hasanain, Sahinur Rahman Laskar, Naeemul Hassan, Firoj Alam

To address this gap, this study focuses on developing a specialized LLM, LlamaLens, for analyzing news and social media content in a multilingual context.

Specificity

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs

no code implementations17 Sep 2024 Basel Mousi, Nadir Durrani, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain, Tameem Kabbani, Fahim Dalvi, Shammur Absar Chowdhury, Firoj Alam

Arabic, with its rich diversity of dialects, remains significantly underrepresented in Large Language Models, particularly in dialectal variations.

Dialect Identification Diversity +2

Native vs Non-Native Language Prompting: A Comparative Analysis

no code implementations11 Sep 2024 Mohamed Bayan Kmainasi, Rakif Khan, Ali Ezzat Shahroor, Boushra Bendou, Maram Hasanain, Firoj Alam

Since prompts play a crucial role in understanding their capabilities, the language used for prompts remains an important research question.

Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs

no code implementations11 Sep 2024 Firoj Alam, Md. Rafiul Biswas, Uzair Shah, Wajdi Zaghouani, Georgios Mikros

In the current literature, there have been efforts to individually detect such content in memes.

NativQA: Multilingual Culturally-Aligned Natural Query for LLMs

no code implementations13 Jul 2024 Md. Arid Hasan, Maram Hasanain, Fatema Ahmad, Sahinur Rahman Laskar, Sunaya Upadhyay, Vrunda N Sukhadia, Mucahid Kutlu, Shammur Absar Chowdhury, Firoj Alam

Natural Question Answering (QA) datasets play a crucial role in evaluating the capabilities of large language models (LLMs), ensuring their effectiveness in real-world applications.

Benchmarking Question Answering

ThatiAR: Subjectivity Detection in Arabic News Sentences

no code implementations8 Jun 2024 Reem Suwaileh, Maram Hasanain, Fatema Hubail, Wajdi Zaghouani, Firoj Alam

In this study, we present the first large dataset for subjectivity detection in Arabic, consisting of ~3. 6K manually annotated sentences, and GPT-4o based explanation.

In-Context Learning Misinformation

ArMeme: Propagandistic Content in Arabic Memes

no code implementations6 Jun 2024 Firoj Alam, Abul Hasnat, Fatema Ahmed, Md Arid Hasan, Maram Hasanain

Identification of such misleading and persuasive multimodal content has become more important among various stakeholders, including social media platforms, policymakers, and the broader society as they often cause harm to individuals, organizations, and/or society.

Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles

no code implementations27 Feb 2024 Maram Hasanain, Fatema Ahmed, Firoj Alam

Finally, we evaluate GPT-4 on a dataset consisting of six other languages for span detection, and results suggest that the model struggles with the task across languages.

8k Propaganda detection

Large Language Models for Propaganda Span Annotation

1 code implementation16 Nov 2023 Maram Hasanain, Fatema Ahmad, Firoj Alam

Finally, we examine the effectiveness of labels provided by GPT-4 in training smaller language models for the task.

Propaganda detection

ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text

no code implementations6 Nov 2023 Maram Hasanain, Firoj Alam, Hamdy Mubarak, Samir Abdaljalil, Wajdi Zaghouani, Preslav Nakov, Giovanni Da San Martino, Abed Alhakim Freihat

We present an overview of the ArAIEval shared task, organized as part of the first ArabicNLP 2023 conference co-located with EMNLP 2023.

Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation Detection

no code implementations6 Nov 2023 Yunze Xiao, Firoj Alam

The spread of disinformation and propagandistic content poses a threat to societal harmony, undermining informed decision-making and trust in reliable sources.

Decision Making Few-Shot Learning

Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

1 code implementation21 Aug 2023 Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare.

In-Context Learning Marketing +1

Detecting and Reasoning of Deleted Tweets before they are Posted

no code implementations5 May 2023 Hamdy Mubarak, Samir Abdaljalil, Azza Nassar, Firoj Alam

Social media platforms empower us in several ways, from information dissemination to consumption.

Overview of the WANLP 2022 Shared Task on Propaganda Detection in Arabic

no code implementations18 Nov 2022 Firoj Alam, Hamdy Mubarak, Wajdi Zaghouani, Giovanni Da San Martino, Preslav Nakov

Thus, there has been a lot of recent research on automatic detection of propaganda techniques in text as well as in memes.

Propaganda detection

ConceptX: A Framework for Latent Concept Analysis

no code implementations12 Nov 2022 Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu

We use an unsupervised method to discover concepts learned in these models and enable a graphical interface for humans to generate explanations for the concepts.

Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text

no code implementations15 Jul 2022 Prerona Tarannum, Firoj Alam, Md. Arid Hasan, Sheak Rashed Haider Noori

In further experiments, our evaluation shows that transformer models (BERT-m and XLM-RoBERTa-base) outperform the SVM and RF in Dutch and English languages where a different scenario is observed for Spanish.

Fact Checking

Analyzing Encoded Concepts in Transformer Language Models

1 code implementation NAACL 2022 Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu

We propose a novel framework ConceptX, to analyze how latent concepts are encoded in representations learned within pre-trained language models.

Clustering

Discovering Latent Concepts Learned in BERT

no code implementations ICLR 2022 Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, Hassan Sajjad

We address this limitation by discovering and analyzing latent concepts learned in neural network models in an unsupervised fashion and provide interpretations from the model's perspective.

Novel Concepts POS

TeamX@DravidianLangTech-ACL2022: A Comparative Analysis for Troll-Based Meme Classification

no code implementations DravidianLangTech (ACL) 2022 Rabindra Nath Nandi, Firoj Alam, Preslav Nakov

The spread of fake news, propaganda, misinformation, disinformation, and harmful content online raised concerns among social media platforms, government agencies, policymakers, and society as a whole.

Meme Classification Misinformation

Detecting and Understanding Harmful Memes: A Survey

1 code implementation9 May 2022 Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty

One interesting finding is that many types of harmful memes are not really studied, e. g., such featuring self-harm and extremism, partly due to the lack of suitable datasets.

Survey

Detecting the Role of an Entity in Harmful Memes: Techniques and Their Limitations

1 code implementation CONSTRAINT (ACL) 2022 Rabindra Nath Nandi, Firoj Alam, Preslav Nakov

The content that is posted and shared online can be textual, visual, or a combination of both, e. g., in a meme.

QCRI's COVID-19 Disinformation Detector: A System to Fight the COVID-19 Infodemic in Social Media

no code implementations8 Mar 2022 Preslav Nakov, Firoj Alam, Yifan Zhang, Animesh Prakash, Fahim Dalvi

Fighting the ongoing COVID-19 infodemic has been declared as one of the most important focus areas by the World Health Organization since the onset of the COVID-19 pandemic.

ArabGend: Gender Analysis and Inference on Arabic Twitter

no code implementations COLING (WNUT) 2022 Hamdy Mubarak, Shammur Absar Chowdhury, Firoj Alam

Gender analysis of Twitter can reveal important socio-cultural differences between male and female users.

A Second Pandemic? Analysis of Fake News About COVID-19 Vaccines in Qatar

no code implementations RANLP 2021 Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino, Yifan Zhang

While COVID-19 vaccines are finally becoming widely available, a second pandemic that revolves around the circulation of anti-vaxxer fake news may hinder efforts to recover from the first one.

Assisting the Human Fact-Checkers: Detecting All Previously Fact-Checked Claims in a Document

1 code implementation14 Sep 2021 Shaden Shaar, Nikola Georgiev, Firoj Alam, Giovanni Da San Martino, Aisha Mohamed, Preslav Nakov

The output is a re-ranked list of the document sentences, so that those that can be verified are ranked as high as possible, together with corresponding evidence.

Fact Checking Learning-To-Rank +2

MEDIC: A Multi-Task Learning Dataset for Disaster Image Classification

1 code implementation29 Aug 2021 Firoj Alam, Tanvirul Alam, Md. Arid Hasan, Abul Hasnat, Muhammad Imran, Ferda Ofli

This is the first dataset of its kind: social media images, disaster response, and multi-task learning research.

Classification Disaster Response +4

Detecting Propaganda Techniques in Memes

1 code implementation ACL 2021 Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

We further create and release a new corpus of 950 memes, carefully annotated with 22 propaganda techniques, which can appear in the text, in the image, or in both.

A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models

2 code implementations8 Jul 2021 Firoj Alam, Arid Hasan, Tanvirul Alam, Akib Khan, Janntatul Tajrin, Naira Khan, Shammur Absar Chowdhury

In this study, we first provide a review of Bangla NLP tasks, resources, and tools available to the research community; we benchmark datasets collected from various platforms for nine NLP tasks using current state-of-the-art algorithms (i. e., transformer-based models).

Survey

SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images

1 code implementation SEMEVAL 2021 Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

We describe SemEval-2021 task 6 on Detection of Persuasion Techniques in Texts and Images: the data, the annotation guidelines, the evaluation setup, the results, and the participating systems.

Robust Training of Social Media Image Classification Models for Rapid Disaster Response

no code implementations9 Apr 2021 Firoj Alam, Tanvirul Alam, Muhammad Imran, Ferda Ofli

Images shared on social media help crisis managers gain situational awareness and assess incurred damages, among other response tasks.

Data Augmentation Disaster Response +3

HumAID: Human-Annotated Disaster Incidents Data from Twitter with Deep Learning Benchmarks

no code implementations7 Apr 2021 Firoj Alam, Umair Qazi, Muhammad Imran, Ferda Ofli

Social networks are widely used for information consumption and dissemination, especially during time-critical events such as natural disasters.

Decision Making

Automated Fact-Checking for Assisting Human Fact-Checkers

no code implementations13 Mar 2021 Preslav Nakov, David Corney, Maram Hasanain, Firoj Alam, Tamer Elsayed, Alberto Barrón-Cedeño, Paolo Papotti, Shaden Shaar, Giovanni Da San Martino

The reporting and the analysis of current events around the globe has expanded from professional, editor-lead journalism all the way to citizen journalism.

Fact Checking

Sentiment Analysis of Users' Reviews on COVID-19 Contact Tracing Apps with a Benchmark Dataset

no code implementations1 Mar 2021 Kashif Ahmad, Firoj Alam, Junaid Qadir, Basheer Qolomany, Imran Khan, Talhat Khan, Muhammad Suleman, Naina Said, Syed Zohaib Hassan, Asma Gul, Ala Al-Fuqaha

In this work, we propose a pipeline starting from manual annotation via a crowd-sourcing study and concluding on the development and training of AI models for automatic sentiment analysis of users' reviews.

Sentiment Analysis

Flood Detection via Twitter Streams using Textual and Visual Features

no code implementations30 Nov 2020 Firoj Alam, Zohaib Hassan, Kashif Ahmad, Asma Gul, Michael Reiglar, Nicola Conci, Ala Al-Fuqaha

The paper presents our proposed solutions for the MediaEval 2020 Flood-Related Multimedia Task, which aims to analyze and detect flooding events in multimedia content shared over Twitter.

Sentiment Classification in Bangla Textual Content: A Comparative Study

1 code implementation19 Nov 2020 Md. Arid Hasan, Jannatul Tajrin, Shammur Absar Chowdhury, Firoj Alam

In this study, we explore several publicly available sentiment labeled datasets and designed classifiers using both classical and deep learning algorithms.

Classification General Classification +2

Deep Learning Benchmarks and Datasets for Social Media Image Classification for Disaster Response

no code implementations17 Nov 2020 Firoj Alam, Ferda Ofli, Muhammad Imran, Tanvirul Alam, Umair Qazi

In this study, we propose new datasets for disaster type detection, and informativeness classification, and damage severity assessment.

Disaster Response General Classification +3

Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective and a Call to Arms

1 code implementation15 Jul 2020 Firoj Alam, Fahim Dalvi, Shaden Shaar, Nadir Durrani, Hamdy Mubarak, Alex Nikolov, Giovanni Da San Martino, Ahmed Abdelali, Hassan Sajjad, Kareem Darwish, Preslav Nakov

With the outbreak of the COVID-19 pandemic, people turned to social media to read and to share timely information including statistics, warnings, advice, and inspirational stories.

Misinformation

Rapid Damage Assessment Using Social Media Images by Combining Human and Machine Intelligence

no code implementations14 Apr 2020 Muhammad Imran, Firoj Alam, Umair Qazi, Steve Peterson, Ferda Ofli

Rapid damage assessment is one of the core tasks that response organizations perform at the onset of a disaster to understand the scale of damage to infrastructures such as roads, bridges, and buildings.

CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing

no code implementations14 Apr 2020 Firoj Alam, Hassan Sajjad, Muhammad Imran, Ferda Ofli

Time-critical analysis of social media streams is important for humanitarian organizations for planing rapid response during disasters.

Benchmarking General Classification +2

Domain Adaptation with Adversarial Training and Graph Embeddings

1 code implementation ACL 2018 Firoj Alam, Shafiq Joty, Muhammad Imran

In such scenarios, a DNN model can leverage labeled and unlabeled data from a related domain, but it has to deal with the shift in data distributions between the source and the target domains.

Domain Adaptation

Graph Based Semi-supervised Learning with Convolution Neural Networks to Classify Crisis Related Tweets

no code implementations2 May 2018 Firoj Alam, Shafiq Joty, Muhammad Imran

During time-critical situations such as natural disasters, rapid classification of data posted on social networks by affected people is useful for humanitarian organizations to gain situational awareness and to plan response efforts.

General Classification Humanitarian

CrisisMMD: Multimodal Twitter Datasets from Natural Disasters

2 code implementations2 May 2018 Firoj Alam, Ferda Ofli, Muhammad Imran

Despite extensive research that mainly focuses on textual content to extract useful information, limited work has focused on the use of imagery content or the combination of both content types.

Social and Information Networks Computers and Society

Annotating and Modeling Empathy in Spoken Conversations

no code implementations13 May 2017 Firoj Alam, Morena Danieli, Giuseppe Riccardi

The automatic classification system was evaluated on call center conversations where it showed significantly better performance than the baseline.

General Classification

Automatic Image Filtering on Social Networks Using Deep Learning and Perceptual Hashing During Crises

no code implementations9 Apr 2017 Dat Tien Nguyen, Firoj Alam, Ferda Ofli, Muhammad Imran

The extensive use of social media platforms, especially during disasters, creates unique opportunities for humanitarian organizations to gain situational awareness and launch relief operations accordingly.

Humanitarian

How Interlocutors Coordinate with each other within Emotional Segments?

no code implementations COLING 2016 Firoj Alam, Shammur Absar Chowdhury, Morena Danieli, Giuseppe Riccardi

In this paper, we aim to investigate the coordination of interlocutors behavior in different emotional segments.

Multilevel Annotation of Agreement and Disagreement in Italian News Blogs

no code implementations LREC 2016 Fabio Celli, Giuseppe Riccardi, Firoj Alam

In this paper, we present a corpus of news blog conversations in Italian annotated with gold standard agreement/disagreement relations at message and sentence levels.

Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.