Search Results for author: Farig Sadeque

Found 18 papers, 5 papers with code

BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset

1 code implementation10 Nov 2023 MD. Motahar Mahtab, Monirul Haque, Mehedi Hasan, Farig Sadeque

We expect that this dataset and the detailed analysis and comparison of these clickbait detection models will provide a fundamental basis for future research into detecting clickbait titles in Bengali articles.

Clickbait Detection

Abugida Normalizer and Parser for Unicode texts

1 code implementation11 May 2023 Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Sazia Mehnaz, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Mohammad Mamun Or Rashid, Farig Sadeque

This paper proposes two libraries to address common and uncommon issues with Unicode-based writing schemes for Indic languages.

Language Modelling

BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset

1 code implementation9 Mar 2023 Md. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, MD. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun, Asif Shahriyar Sushmit

While strides have been made in deep learning based Bengali Optical Character Recognition (OCR) in the past decade, the absence of large Document Layout Analysis (DLA) datasets has hindered the application of OCR in document transcription, e. g., transcribing historical documents and newspapers.

Benchmarking Document Layout Analysis +2

TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

no code implementations SemEval (NAACL) 2022 Nazia Tasnim, Md. Istiak Hossain Shihab, Asif Shahriyar Sushmit, Steven Bethard, Farig Sadeque

Many areas, such as the biological and healthcare domain, artistic works, and organization names, have nested, overlapping, discontinuous entity mentions that may even be syntactically or semantically ambiguous in practice.

Data Augmentation

Predicting engagement in online social networks: Challenges and opportunities

no code implementations11 Jul 2019 Farig Sadeque, Steven Bethard

We classified these works based on our task definitions, and explored the machine learning models that have been used for any kind of participation prediction.

BIG-bench Machine Learning Domain Adaptation +1

Incivility Detection in Online Comments

no code implementations SEMEVAL 2019 Farig Sadeque, Stephen Rains, Yotam Shmargad, Kate Kenski, Kevin Coe, Steven Bethard

Incivility in public discourse has been a major concern in recent times as it can affect the quality and tenacity of the discourse negatively.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.