Search Results for author: Farig Sadeque

Found 18 papers, 5 papers with code

Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions

no code implementations • 17 Apr 2024 • Nazia Tasnim, Sujan Sen Gupta, Md. Istiak Hossain Shihab, Fatiha Islam Juee, Arunima Tahsin, Pritom Ghum, Kanij Fatema, Marshia Haque, Wasema Farzana, Prionti Nasir, Ashique KhudaBukhsh, Farig Sadeque, Asif Sushmit

Communal violence in online forums has become extremely prevalent in South Asia, where many communities of different cultures coexist and share resources.

Benchmarking

Paper
Add Code

IPA Transcription of Bengali Texts

no code implementations • 29 Mar 2024 • Kanij Fatema, Fazle Dawood Haider, Nirzona Ferdousi Turpa, Tanveer Azmal, Sourav Ahmed, Navid Hasan, Mohammad Akhlaqur Rahman, Biplab Kumar Sarkar, Afrar Jahin, Md. Rezuwan Hassan, Md Foriduzzaman Zihad, Rubayet Sabbir Faruque, Asif Sushmit, Mashrur Imtiaz, Farig Sadeque, Syed Shahrier Rahman

The International Phonetic Alphabet (IPA) serves to systematize phonemes in language, enabling precise textual representation of pronunciation.

Paper
Add Code

Gazetteer-Enhanced Bangla Named Entity Recognition with BanglaBERT Semantic Embeddings K-Means-Infused CRF Model

1 code implementation • 30 Jan 2024 • Niloy Farhan, Saman Sarker Joy, Tafseer Binte Mannan, Farig Sadeque

In this research, we explored the existing state of research in Bangla Named Entity Recognition.

named-entity-recognition Named Entity Recognition +1

Paper
Code

Involution Fused ConvNet for Classifying Eye-Tracking Patterns of Children with Autism Spectrum Disorder

no code implementations • 7 Jan 2024 • Md. Farhadul Islam, Meem Arafat Manab, Joyanta Jyoti Mondal, Sarah Zabeen, Fardin Bin Rahman, Md. Zahidul Hasan, Farig Sadeque, Jannatun Noor

Our proposed model is implemented in a simple yet effective approach, which makes it easier for applying in real life.

Paper
Add Code

BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset

1 code implementation • 10 Nov 2023 • MD. Motahar Mahtab, Monirul Haque, Mehedi Hasan, Farig Sadeque

We expect that this dataset and the detailed analysis and comparison of these clickbait detection models will provide a fundamental basis for future research into detecting clickbait titles in Bengali articles.

Clickbait Detection

Paper
Code

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

1 code implementation • 21 Aug 2023 • Imam Mohammad Zulkarnain, Shayekh Bin Islam, Md. Zami Al Zunaed Farabe, Md. Mehedi Hasan Shawon, Jawaril Munshad Abedin, Beig Rajibul Hasan, Marsia Haque, Istiak Shihab, Syed Mobassir, MD. Nazmuddoha Ansary, Asif Sushmit, Farig Sadeque

We present extensive component-level and system-level evaluation: both use a novel diversified evaluation dataset and comprehensive evaluation metrics.

Optical Character Recognition Optical Character Recognition (OCR)

Paper
Code

OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking

no code implementations • 15 May 2023 • Fazle Rabbi Rakib, Souhardya Saha Dip, Samiul Alam, Nazia Tasnim, Md. Istiak Hossain Shihab, MD. Nazmuddoha Ansary, Syed Mobassir Hossen, Marsia Haque Meghla, Mamunur Mamun, Farig Sadeque, Sayma Sultana Chowdhury, Tahsin Reasat, Asif Sushmit, Ahmed Imtiaz Humayun

Our test dataset comprises 23. 03 hours of speech collected and manually annotated from 17 different sources, e. g., Bengali TV drama, Audiobook, Talk show, Online class, and Islamic sermons to name a few.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Abugida Normalizer and Parser for Unicode texts

1 code implementation • 11 May 2023 • Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Sazia Mehnaz, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Mohammad Mamun Or Rashid, Farig Sadeque

This paper proposes two libraries to address common and uncommon issues with Unicode-based writing schemes for Indic languages.

Language Modelling

Paper
Code

BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset

1 code implementation • 9 Mar 2023 • Md. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, MD. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun, Asif Shahriyar Sushmit

While strides have been made in deep learning based Bengali Optical Character Recognition (OCR) in the past decade, the absence of large Document Layout Analysis (DLA) datasets has hindered the application of OCR in document transcription, e. g., transcribing historical documents and newspapers.

Benchmarking Document Layout Analysis +2

Paper
Code

TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

no code implementations • SemEval (NAACL) 2022 • Nazia Tasnim, Md. Istiak Hossain Shihab, Asif Shahriyar Sushmit, Steven Bethard, Farig Sadeque

Many areas, such as the biological and healthcare domain, artistic works, and organization names, have nested, overlapping, discontinuous entity mentions that may even be syntactically or semantically ambiguous in practice.

Data Augmentation

Paper
Add Code

A BERT-based One-Pass Multi-Task Model for Clinical Temporal Relation Extraction

no code implementations • WS 2020 • Chen Lin, Timothy Miller, Dmitriy Dligach, Farig Sadeque, Steven Bethard, Guergana Savova

Recently BERT has achieved a state-of-the-art performance in temporal relation extraction from clinical Electronic Medical Records text.

Multi-Task Learning Relation +2

Paper
Add Code

Predicting engagement in online social networks: Challenges and opportunities

no code implementations • 11 Jul 2019 • Farig Sadeque, Steven Bethard

We classified these works based on our task definitions, and explored the machine learning models that have been used for any kind of participation prediction.

BIG-bench Machine Learning Domain Adaptation +1