Search Results for author: Ahmed Ali

Found 50 papers, 14 papers with code

Arabic Speech Recognition by End-to-End, Modular Systems and Human

1 code implementation21 Jan 2021 Amir Hussein, Shinji Watanabe, Ahmed Ali

Recent advances in automatic speech recognition (ASR) have achieved accuracy levels comparable to human transcribers, which led researchers to debate if the machine has reached human performance.

Arabic Speech Recognition Automatic Speech Recognition +3

Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition

2 code implementations12 Mar 2018 Suwon Shon, Ahmed Ali, James Glass

Although the Siamese network with language embeddings did not achieve as good a result as the end-to-end DID system, the two approaches had good synergy when combined together in a fused system.

Sound Audio and Speech Processing

Speech Recognition Challenge in the Wild: Arabic MGB-3

1 code implementation21 Sep 2017 Ahmed Ali, Stephan Vogel, Steve Renals

Two hours of audio per dialect were released for development and a further two hours were used for evaluation.

Arabic Speech Recognition Dialect Identification +2

Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition

1 code implementation9 Jul 2019 Yonatan Belinkov, Ahmed Ali, James Glass

End-to-end neural network systems for automatic speech recognition (ASR) are trained from acoustic features to text transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Word Error Rate Estimation for Speech Recognition: e-WER

1 code implementation ACL 2018 Ahmed Ali, Steve Renals

Measuring the performance of automatic speech recognition (ASR) systems requires manually transcribed data in order to compute the word error rate (WER), which is often time-consuming and expensive.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Word Error Rate Estimation Without ASR Output: e-WER2

1 code implementation8 Aug 2020 Ahmed Ali, Steve Renals

Measuring the performance of automatic speech recognition (ASR) systems requires manually transcribed data in order to compute the word error rate (WER), which is often time-consuming and expensive.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata Information

1 code implementation20 Oct 2019 Yoan Dinkov, Ahmed Ali, Ivan Koychev, Preslav Nakov

Our analysis shows that the use of acoustic signal helped to improve bias detection by more than 6% absolute over using text and metadata only.

Bias Detection Multimodal Deep Learning

Creating Speech-to-Speech Corpus from Dubbed Series

1 code implementation7 Mar 2022 Massa Baali, Wassim El-Hajj, Ahmed Ali

We propose an unsupervised approach to construct speech-to-speech corpus, aligned on short segment levels, to produce a parallel speech corpus in the source- and target- languages.

Machine Translation speech-recognition +1

MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge

no code implementations28 Aug 2017 Suwon Shon, Ahmed Ali, James Glass

In order to achieve a robust ADI system, we explored both Siamese neural network models to learn similarity and dissimilarities among Arabic dialects, as well as i-vector post-processing to adapt domain mismatches.

Arabic Speech Recognition Dialect Identification +2

The MGB-2 Challenge: Arabic Multi-Dialect Broadcast Media Recognition

no code implementations19 Sep 2016 Ahmed Ali, Peter Bell, James Glass, Yacine Messaoui, Hamdy Mubarak, Steve Renals, Yifan Zhang

For language modelling, we made available over 110M words crawled from Aljazeera Arabic website Aljazeera. net for a 10 year duration 2000-2011.

Acoustic Modelling Language Modelling +1

Multi-view Dimensionality Reduction for Dialect Identification of Arabic Broadcast Speech

no code implementations19 Sep 2016 Sameer Khurana, Ahmed Ali, Steve Renals

In this work, we present a new Vector Space Model (VSM) of speech utterances for the task of spoken dialect identification.

Dialect Identification Dimensionality Reduction

Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain

no code implementations4 Dec 2018 Suwon Shon, Ahmed Ali, James Glass

An important issue for end-to-end systems is to have some knowledge of the application domain, because the system can be vulnerable to use cases that were not seen in the training phase; such a scenario is often referred to as a domain mismatched condition.

Dialect Identification

Findings of the VarDial Evaluation Campaign 2017

no code implementations WS 2017 Marcos Zampieri, Shervin Malmasi, Nikola Ljube{\v{s}}i{\'c}, Preslav Nakov, Ahmed Ali, J{\"o}rg Tiedemann, Yves Scherrer, No{\"e}mi Aepli

We present the results of the VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which we organized as part of the fourth edition of the VarDial workshop at EACL{'}2017.

Dependency Parsing Dialect Identification

DARTS: Dialectal Arabic Transcription System

no code implementations26 Sep 2019 Sameer Khurana, Ahmed Ali, James Glass

We analyze the following; transfer learning from high resource broadcast domain to low-resource dialectal domain and semi-supervised learning where we use in-domain unlabeled audio data collected from YouTube.

Language Modelling Transfer Learning

Tetraquark Interpretation and Production Mechanism of the Belle $Y_b (10750)$-Resonance

no code implementations14 Dec 2020 Ahmed Ali, Luciano Maiani, Alexander Parkhomenko, Wei Wang

Recently, the Belle Collaboration has updated the analysis of the cross sections for the processes $e^+ e^- \to \Upsilon(nS)\, \pi^+ \pi^-$ ($n = 1,\, 2,\, 3$) in the $e^+ e^-$ center-of-mass energy range from 10. 52 to 11. 02~GeV.

High Energy Physics - Phenomenology High Energy Physics - Experiment

Interpretation of LHCb Hidden-Charm Pentaquarks within the Compact Diquark Model

no code implementations14 Dec 2020 Ahmed Ali, Ishtiaq Ahmed, M. Jamil Aslam, Alexander Parkhomenko, Abdur Rehman

We interpret these narrow resonances as compact hidden-charm diquark-diquark-antiquark pentaquarks.

High Energy Physics - Phenomenology High Energy Physics - Experiment

Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR

no code implementations31 May 2021 Shammur Absar Chowdhury, Amir Hussein, Ahmed Abdelali, Ahmed Ali

We evaluate the system performance handling: (i) monolingual (Ar, En and Fr); (ii) multi-dialectal (Modern Standard Arabic, along with dialectal variation such as Egyptian and Moroccan); (iii) code-switching -- cross-lingual (Ar-En/Fr) and dialectal (MSA-Egyptian dialect) test cases, and compare with current state-of-the-art systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition

no code implementations10 Jun 2021 Amir Hussein, Shammur Chowdhury, Najim Dehak, Ahmed Ali

In this paper, we exploit the transfer learning approach to design End-to-End (E2E) CS ASR systems for the two low-resourced language pairs using different monolingual speech data and a small set of noisy CS data.

Language Modelling speech-recognition +3

Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition

no code implementations7 Jan 2022 Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur

The pervasiveness of intra-utterance code-switching (CS) in spoken content requires that speech recognition (ASR) systems handle mixed language.

Language Modelling speech-recognition +5

ClassSPLOM -- A Scatterplot Matrix to Visualize Separation of Multiclass Multidimensional Data

no code implementations30 Jan 2022 Michael Aupetit, Ahmed Ali

In multiclass classification of multidimensional data, the user wants to build a model of the classes to predict the label of unseen data.

Ten Years after ImageNet: A 360° Perspective on AI

no code implementations1 Oct 2022 Sanjay Chawla, Preslav Nakov, Ahmed Ali, Wendy Hall, Issa Khalil, Xiaosong Ma, Husrev Taha Sencar, Ingmar Weber, Michael Wooldridge, Ting Yu

The rise of attention networks, self-supervised learning, generative modeling, and graph neural networks has widened the application space of AI.

Decision Making Fairness +1

SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation

no code implementations2 Nov 2022 Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali, Hamdy Mubarak, Shazia Afzal

Our proposed technique achieves state-of-the-art results, with Speechocean762, on ASR dependent mispronunciation detection models at phoneme level, with a 2. 0% gain in Pearson Correlation Coefficient (PCC) compared to the previous state-of-the-art [1].

Data Augmentation Multi-Task Learning +1

Multilingual Word Error Rate Estimation: e-WER3

no code implementations2 Apr 2023 Shammur Absar Chowdhury, Ahmed Ali

The success of the multilingual automatic speech recognition systems empowered many voice-driven applications.

Automatic Speech Recognition speech-recognition +1

QVoice: Arabic Speech Pronunciation Learning Application

no code implementations9 May 2023 Yassine El Kheir, Fouad Khnaisser, Shammur Absar Chowdhury, Hamdy Mubarak, Shazia Afzal, Ahmed Ali

This paper introduces a novel Arabic pronunciation learning application QVoice, powered with end-to-end mispronunciation detection and feedback generator module.

FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator

no code implementations7 Jun 2023 Massa Baali, Ahmed Ali

This paper presents FOOCTTS, an automatic pipeline for a football commentator that generates speech with background crowd noise.

Automatic Speech Recognition speech-recognition +1

MyVoice: Arabic Speech Resource Collaboration Platform

no code implementations23 Jul 2023 Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

Furthermore, the platform offers flexibility to admin roles to add new data or tasks beyond dialectal speech and word collection, which are displayed to contributors.

The complementary roles of non-verbal cues for Robust Pronunciation Assessment

no code implementations14 Sep 2023 Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

Research on pronunciation assessment systems focuses on utilizing phonetic and phonological aspects of non-native (L2) speech, often neglecting the rich layer of information hidden within the non-verbal cues.

L1-aware Multilingual Mispronunciation Detection Framework

no code implementations14 Sep 2023 Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

The phonological discrepancies between a speaker's native (L1) and the non-native language (L2) serves as a major factor for mispronunciation.

Automatic Pronunciation Assessment -- A Review

no code implementations21 Oct 2023 Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

Pronunciation assessment and its application in computer-aided pronunciation training (CAPT) have seen impressive progress in recent years.

Cannot find the paper you are looking for? You can Submit a new open access paper.