Search Results for author: Ahmed Adel Attia

Found 5 papers, 1 papers with code

Improving Speech Inversion Through Self-Supervised Embeddings and Enhanced Tract Variables

no code implementations • 17 Sep 2023 • Ahmed Adel Attia, Yashish M. Siriwardena, Carol Espy-Wilson

The performance of deep learning models depends significantly on their capacity to encode input features efficiently and decode them into meaningful outputs.

Self-Supervised Learning

Paper
Add Code

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

no code implementations • 12 Sep 2023 • Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Enhancing Speech Articulation Analysis using a Geometric Transformation of the X-ray Microbeam Dataset

no code implementations • 18 May 2023 • Ahmed Adel Attia, Mark Tiede, Carol Y. Espy-Wilson

Accurate analysis of speech articulation is crucial for speech analysis.

Anatomy

Paper
Add Code

Masked Autoencoders Are Articulatory Learners

1 code implementation • 27 Oct 2022 • Ahmed Adel Attia, Carol Espy-Wilson

Articulatory recordings track the positions and motion of different articulators along the vocal tract and are widely used to study speech production and to develop speech technologies such as articulatory based speech synthesizers and speech inversion systems.

Paper
Code

Audio Data Augmentation for Acoustic-to-articulatory Speech Inversion using Bidirectional Gated RNNs

no code implementations • 25 May 2022 • Yashish M. Siriwardena, Ahmed Adel Attia, Ganesh Sivaraman, Carol Espy-Wilson

In this work, we compare and contrast different ways of doing data augmentation and show how this technique improves the performance of articulatory speech inversion not only on noisy speech, but also on clean speech data.

Data Augmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.