Search Results for author: Abdul Waheed

Found 19 papers, 5 papers with code

On the Robust Approximation of ASR Metrics

no code implementations18 Feb 2025 Abdul Waheed, Hanin Atwany, Rita Singh, Bhiksha Raj

Traditionally, ASR models are evaluated using metrics like Word Error Rate (WER) and Character Error Rate (CER), which depend on ground truth labels.

speech-recognition Speech Recognition

Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models

no code implementations18 Feb 2025 Hanin Atwany, Abdul Waheed, Rita Singh, Monojit Choudhury, Bhiksha Raj

We examine how factors such as distribution shifts, model size, and model architecture influence the hallucination error rate (HER), a metric we introduce to quantify hallucinations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

On the Diversity of Synthetic Data and its Impact on Training Large Language Models

no code implementations19 Oct 2024 Hao Chen, Abdul Waheed, Xiang Li, Yidong Wang, Jindong Wang, Bhiksha Raj, Marah I. Abdin

The rise of Large Language Models (LLMs) has accentuated the need for diverse, high-quality pre-training data.

Diversity

What Do Speech Foundation Models Not Learn About Speech?

no code implementations16 Oct 2024 Abdul Waheed, Hanin Atwany, Bhiksha Raj, Rita Singh

The analysis of layer-wise features demonstrates that some models exhibit a convex relationship between the separability of the learned representations and model depth, with different layers capturing task-specific features.

uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes

1 code implementation1 Jul 2024 Abdul Waheed, Karima Kadaoui, Bhiksha Raj, Muhammad Abdul-Mageed

Our models are also 25-50% more compute- and memory-efficient while maintaining performance equal to or better than that of the teacher model.

Knowledge Distillation

Towards Zero-Shot Text-To-Speech for Arabic Dialects

no code implementations24 Jun 2024 Khai Duy Doan, Abdul Waheed, Muhammad Abdul-Mageed

We then evaluate our models on a dataset comprising 31 unseen speakers and an in-house dialectal dataset.

Dialect Identification Speech Synthesis +2

To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation

1 code implementation6 Jun 2024 Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed

Our best-distilled model's overall performance ($45. 0$\% WER) surpasses that of a SoTA model twice its size (SeamlessM4T-large-v2, WER=$47. 0$\%) and its teacher model (Whisper-large-v2, WER=$55. 1$\%), and its average performance on our new dialectal data ($56. 9$\% WER) outperforms all other models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

A Novel Defocus-Blur Region Detection Approach Based on DCT Feature and PCNN Structure

no code implementations12 Oct 2023 Sadia Basar, Mushtaq Ali, Abdul Waheed, Muneer Ahmad, Mahdi H. Miraz

Therefore, it is important to detect in-focused objects in defocused-blurred images after the segmentation of blurred and non-blurred regions.

N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition

no code implementations5 Jun 2023 Bashar Talafha, Abdul Waheed, Muhammad Abdul-Mageed

Whisper, the recently developed multilingual weakly supervised model, is reported to perform well on multiple speech recognition benchmarks in both monolingual and multilingual settings.

Arabic Speech Recognition Benchmarking +2

GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP

no code implementations24 May 2023 Md Tawkat Islam Khondaker, Abdul Waheed, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Although we further explore and confirm the utility of employing GPT-4 as a potential alternative for human evaluation, our work adds to a growing body of research underscoring the limitations of ChatGPT.

Natural Language Understanding

BloomNet: A Robust Transformer based model for Bloom's Learning Outcome Classification

no code implementations16 Aug 2021 Abdul Waheed, Muskan Goyal, Nimisha Mittal, Deepak Gupta, Ashish Khanna, Moolchand Sharma

For the optimization of educational programs, it is crucial to design course learning outcomes (CLOs) according to the different cognitive levels of Bloom Taxonomy.

Out-of-Distribution Generalization

Domain Controlled Title Generation with Human Evaluation

no code implementations8 Mar 2021 Abdul Waheed, Muskan Goyal, Nimisha Mittal, Deepak Gupta

We study automatic title generation and present a method for generating domain-controlled titles for scientific articles.

Articles

CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection

2 code implementations8 Mar 2021 Abdul Waheed, Muskan Goyal, Deepak Gupta, Ashish Khanna, Fadi Al-Turjman, Placido Rogerio Pinheiro

This has led to the introduction of a variety of deep learning systems and studies have shown that the accuracy of COVID-19 patient detection through the use of chest X-rays is strongly optimistic.

Data Augmentation Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.