no code implementations • 16 Oct 2024 • Ahmed Ali, Chiara Gabellieri, Antonio Franchi
We propose a new multirotor aerial vehicle class of designs composed of a multi-body structure in which a main body is connected by passive joints to links equipped with propellers.
no code implementations • 5 Aug 2024 • Yassine El Kheir, Hamdy Mubarak, Ahmed Ali, Shammur Absar Chowdhury
Phonetically correct transcribed speech resources for dialectal Arabic are scarce.
no code implementations • 21 Oct 2023 • Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury
Pronunciation assessment and its application in computer-aided pronunciation training (CAPT) have seen impressive progress in recent years.
1 code implementation • 27 Sep 2023 • Amir Hussein, Dorsa Zeinali, Ondřej Klejch, Matthew Wiesner, Brian Yan, Shammur Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur
Designing effective automatic speech recognition (ASR) systems for Code-Switching (CS) often depends on the availability of the transcribed CS resources.
no code implementations • 14 Sep 2023 • Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali
The phonological discrepancies between a speaker's native (L1) and the non-native language (L2) serves as a major factor for mispronunciation.
no code implementations • 14 Sep 2023 • Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali
Research on pronunciation assessment systems focuses on utilizing phonetic and phonological aspects of non-native (L2) speech, often neglecting the rich layer of information hidden within the non-verbal cues.
1 code implementation • 9 Aug 2023 • Fahim Dalvi, Maram Hasanain, Sabri Boughorbel, Basel Mousi, Samir Abdaljalil, Nizi Nazar, Ahmed Abdelali, Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Ali, Majd Hawasly, Nadir Durrani, Firoj Alam
In this study, we introduce the LLMeBench framework, which can be seamlessly customized to evaluate LLMs for any NLP task, regardless of language.
no code implementations • 23 Jul 2023 • Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali
Furthermore, the platform offers flexibility to admin roles to add new data or tasks beyond dialectal speech and word collection, which are displayed to contributors.
no code implementations • 7 Jun 2023 • Massa Baali, Ahmed Ali
This paper presents FOOCTTS, an automatic pipeline for a football commentator that generates speech with background crowd noise.
no code implementations • 24 May 2023 • Ahmed Abdelali, Hamdy Mubarak, Shammur Absar Chowdhury, Maram Hasanain, Basel Mousi, Sabri Boughorbel, Yassine El Kheir, Daniel Izham, Fahim Dalvi, Majd Hawasly, Nizi Nazar, Yousseif Elshahawy, Ahmed Ali, Nadir Durrani, Natasa Milic-Frayling, Firoj Alam
Our findings provide valuable insights into the applicability of LLMs for Arabic NLP and speech processing tasks.
no code implementations • 9 May 2023 • Yassine El Kheir, Fouad Khnaisser, Shammur Absar Chowdhury, Hamdy Mubarak, Shazia Afzal, Ahmed Ali
This paper introduces a novel Arabic pronunciation learning application QVoice, powered with end-to-end mispronunciation detection and feedback generator module.
no code implementations • 2 Apr 2023 • Shammur Absar Chowdhury, Ahmed Ali
The success of the multilingual automatic speech recognition systems empowered many voice-driven applications.
2 code implementations • 22 Jan 2023 • Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe, Wassim El-Hajj, Ahmed Ali
Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 22 Nov 2022 • Injy Hamed, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali
Code-switching poses a number of challenges and opportunities for multilingual automatic speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 2 Nov 2022 • Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali, Hamdy Mubarak, Shazia Afzal
Our proposed technique achieves state-of-the-art results, with Speechocean762, on ASR dependent mispronunciation detection models at phoneme level, with a 2. 0% gain in Pearson Correlation Coefficient (PCC) compared to the previous state-of-the-art [1].
Ranked #4 on Phone-level pronunciation scoring on speechocean762
no code implementations • 1 Oct 2022 • Sanjay Chawla, Preslav Nakov, Ahmed Ali, Wendy Hall, Issa Khalil, Xiaosong Ma, Husrev Taha Sencar, Ingmar Weber, Michael Wooldridge, Ting Yu
The rise of attention networks, self-supervised learning, generative modeling, and graph neural networks has widened the application space of AI.
1 code implementation • 7 Mar 2022 • Massa Baali, Wassim El-Hajj, Ahmed Ali
We propose an unsupervised approach to construct speech-to-speech corpus, aligned on short segment levels, to produce a parallel speech corpus in the source- and target- languages.
no code implementations • 30 Jan 2022 • Michael Aupetit, Ahmed Ali
In multiclass classification of multidimensional data, the user wants to build a model of the classes to predict the label of unseen data.
no code implementations • 7 Jan 2022 • Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur
The pervasiveness of intra-utterance code-switching (CS) in spoken content requires that speech recognition (ASR) systems handle mixed language.
no code implementations • ACL 2021 • Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury, Ahmed Ali
We also report the first baseline for Arabic punctuation restoration.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +8
no code implementations • 4 Jul 2021 • Ahmed Ali, Shammur Chowdhury, Amir Hussein, Yasser Hifny
Code-switching in automatic speech recognition (ASR) is an important challenge due to globalization.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 1 Jul 2021 • Shammur Absar Chowdhury, Nadir Durrani, Ahmed Ali
In our study, we conduct a post-hoc functional interpretability analysis of pretrained speech models using the probing framework [1].
no code implementations • 24 Jun 2021 • Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury, Ahmed Ali
We also report the first baseline for Arabic punctuation restoration.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +8
no code implementations • 10 Jun 2021 • Amir Hussein, Shammur Chowdhury, Najim Dehak, Ahmed Ali
In this paper, we exploit the transfer learning approach to design End-to-End (E2E) CS ASR systems for the two low-resourced language pairs using different monolingual speech data and a small set of noisy CS data.
no code implementations • 31 May 2021 • Shammur Absar Chowdhury, Amir Hussein, Ahmed Abdelali, Ahmed Ali
We evaluate the system performance handling: (i) monolingual (Ar, En and Fr); (ii) multi-dialectal (Modern Standard Arabic, along with dialectal variation such as Egyptian and Moroccan); (iii) code-switching -- cross-lingual (Ar-En/Fr) and dialectal (MSA-Egyptian dialect) test cases, and compare with current state-of-the-art systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 21 Jan 2021 • Amir Hussein, Shinji Watanabe, Ahmed Ali
Recent advances in automatic speech recognition (ASR) have achieved accuracy levels comparable to human transcribers, which led researchers to debate if the machine has reached human performance.
no code implementations • 14 Dec 2020 • Ahmed Ali, Luciano Maiani, Alexander Parkhomenko, Wei Wang
Recently, the Belle Collaboration has updated the analysis of the cross sections for the processes $e^+ e^- \to \Upsilon(nS)\, \pi^+ \pi^-$ ($n = 1,\, 2,\, 3$) in the $e^+ e^-$ center-of-mass energy range from 10. 52 to 11. 02~GeV.
High Energy Physics - Phenomenology High Energy Physics - Experiment
no code implementations • 14 Dec 2020 • Ahmed Ali, Ishtiaq Ahmed, M. Jamil Aslam, Alexander Parkhomenko, Abdur Rehman
We interpret these narrow resonances as compact hidden-charm diquark-diquark-antiquark pentaquarks.
High Energy Physics - Phenomenology High Energy Physics - Experiment
1 code implementation • 31 Oct 2020 • Ahmed Ali, Yasser Hifny
The second architecture is based on a deep CNN model.
1 code implementation • 8 Aug 2020 • Ahmed Ali, Steve Renals
Measuring the performance of automatic speech recognition (ASR) systems requires manually transcribed data in order to compute the word error rate (WER), which is often time-consuming and expensive.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • ACL 2020 • Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav Nakov
Alternatively, we can profile entire news outlets and look for those that are likely to publish fake or biased content.
1 code implementation • 20 Oct 2019 • Yoan Dinkov, Ahmed Ali, Ivan Koychev, Preslav Nakov
Our analysis shows that the use of acoustic signal helped to improve bias detection by more than 6% absolute over using text and metadata only.
no code implementations • 4 Oct 2019 • Daniel Kopev, Ahmed Ali, Ivan Koychev, Preslav Nakov
We present work on deception detection, where, given a spoken claim, we aim to predict its factuality.
no code implementations • 26 Sep 2019 • Sameer Khurana, Ahmed Ali, James Glass
We analyze the following; transfer learning from high resource broadcast domain to low-resource dialectal domain and semi-supervised learning where we use in-domain unlabeled audio data collected from YouTube.
1 code implementation • 9 Jul 2019 • Yonatan Belinkov, Ahmed Ali, James Glass
End-to-end neural network systems for automatic speech recognition (ASR) are trained from acoustic features to text transcriptions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 4 Dec 2018 • Suwon Shon, Ahmed Ali, James Glass
An important issue for end-to-end systems is to have some knowledge of the application domain, because the system can be vulnerable to use cases that were not seen in the training phase; such a scenario is often referred to as a domain mismatched condition.
no code implementations • COLING 2018 • Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James Glass, Yves Scherrer, Tanja Samard{\v{z}}i{\'c}, Nikola Ljube{\v{s}}i{\'c}, J{\"o}rg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal Van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain
We present the results and the findings of the Second VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects.
1 code implementation • ACL 2018 • Ahmed Ali, Steve Renals
Measuring the performance of automatic speech recognition (ASR) systems requires manually transcribed data in order to compute the word error rate (WER), which is often time-consuming and expensive.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
2 code implementations • 12 Mar 2018 • Suwon Shon, Ahmed Ali, James Glass
Although the Siamese network with language embeddings did not achieve as good a result as the end-to-end DID system, the two approaches had good synergy when combined together in a fused system.
Sound Audio and Speech Processing
no code implementations • 21 Sep 2017 • Ahmed Ali, Preslav Nakov, Peter Bell, Steve Renals
We study the problem of evaluating automatic speech recognition (ASR) systems that target dialectal speech input.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 21 Sep 2017 • Ahmed Ali, Stephan Vogel, Steve Renals
Two hours of audio per dialect were released for development and a further two hours were used for evaluation.
no code implementations • 28 Aug 2017 • Suwon Shon, Ahmed Ali, James Glass
In order to achieve a robust ADI system, we explored both Siamese neural network models to learn similarity and dissimilarities among Arabic dialects, as well as i-vector post-processing to adapt domain mismatches.
no code implementations • WS 2017 • Marcos Zampieri, Shervin Malmasi, Nikola Ljube{\v{s}}i{\'c}, Preslav Nakov, Ahmed Ali, J{\"o}rg Tiedemann, Yves Scherrer, No{\"e}mi Aepli
We present the results of the VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which we organized as part of the fourth edition of the VarDial workshop at EACL{'}2017.
no code implementations • EACL 2017 • Fahim Dalvi, Yifan Zhang, Sameer Khurana, Nadir Durrani, Hassan Sajjad, Ahmed Abdelali, Hamdy Mubarak, Ahmed Ali, Stephan Vogel
This paper presents QCRI{'}s Arabic-to-English live speech translation system.
no code implementations • EACL 2017 • Renars Liepins, Ulrich Germann, Guntis Barzdins, Alex Birch, ra, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herv{\'e} Bourlard, Jo{\~a}o Prieto, Ond{\v{r}}ej Klejch, Peter Bell, Alex Lazaridis, ros, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Mir, Sebasti{\~a}o a, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell
We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • WS 2016 • Shervin Malmasi, Marcos Zampieri, Nikola Ljube{\v{s}}i{\'c}, Preslav Nakov, Ahmed Ali, J{\"o}rg Tiedemann
We present the results of the third edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial{'}2016 workshop at COLING{'}2016.
no code implementations • 19 Sep 2016 • Ahmed Ali, Peter Bell, James Glass, Yacine Messaoui, Hamdy Mubarak, Steve Renals, Yifan Zhang
For language modelling, we made available over 110M words crawled from Aljazeera Arabic website Aljazeera. net for a 10 year duration 2000-2011.
no code implementations • 19 Sep 2016 • Sameer Khurana, Ahmed Ali, Steve Renals
In this work, we present a new Vector Space Model (VSM) of speech utterances for the task of spoken dialect identification.
1 code implementation • 23 Sep 2015 • Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James Glass, Peter Bell, Steve Renals
We used these features in a binary classifier to discriminate between Modern Standard Arabic (MSA) and Dialectal Arabic, with an accuracy of 100%.