1 code implementation • Findings (EMNLP) 2021 • Khondoker Ittehadul Islam, Sudipta Kar, Md Saiful Islam, Mohammad Ruhul Amin
In this paper, we propose an annotated sentiment analysis dataset made of informally written Bangla texts.
no code implementations • RANLP 2021 • Henry Gorelick, Biddut Sarker Bijoy, Syeda Jannatus Saba, Sudipta Kar, Md Saiful Islam, Mohammad Ruhul Amin
By analyzing the model parameters, we extracted the successful semantic relationships from books of 12 different genres.
1 code implementation • 30 Jan 2024 • Stepan Tytarenko, Mohammad Ruhul Amin
We show that a linear transformation of the text representation from any transformer model using the task-specific concept operator results in a projection onto the latent concept space, referred to as context attribution in this paper.
Ranked #1 on Sentiment Analysis on IMDb Movie Reviews
1 code implementation • 6 Nov 2023 • Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed
Lemmatization holds significance in both natural language processing (NLP) and linguistics, as it effectively decreases data density and aids in comprehending contextual meaning.
no code implementations • 9 Jun 2023 • Md. Ekramul Islam, Labib Chowdhury, Faisal Ahamed Khan, Shazzad Hossain, Sourave Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed, Mohammad Ruhul Amin
This study introduces SentiGOLD, a Bangla multi-domain sentiment analysis dataset.
1 code implementation • LREC 2022 • Nauros Romim, Mosahed Ahmed, Md. Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin
In this paper, we identify the shortcomings of existing Bangla HS datasets and introduce a large manually labeled dataset BD-SHS that includes HS in different social contexts.
no code implementations • 3 Dec 2021 • Nauros Romim, Mosahed Ahmed, Md Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin
In this paper, we present HS-BAN, a binary class hate speech (HS) dataset in Bangla language consisting of more than 50, 000 labeled comments, including 40. 17% hate and rest are non hate speech.
no code implementations • Joint Conference on Lexical and Computational Semantics 2021 • Syeda Jannatus Saba, Biddut Sarker Bijoy, Henry Gorelick, Sabir Ismail, Md Saiful Islam, Mohammad Ruhul Amin
This article presents the study of semantic word associations using the word embedding of book content for a set of Roget{'}s thesaurus concepts for book success prediction.