Search Results for author: Laureano Moro-Velazquez

Found 14 papers, 4 papers with code

Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

1 code implementation • 16 Nov 2023 • Helin Wang, Venkatesh Ravichandran, Milind Rao, Becky Lammers, Myra Sydnor, Nicholas Maragakis, Ankur A. Butala, Jayne Zhang, Lora Clawson, Victoria Chovaz, Laureano Moro-Velazquez

Spoken language understanding (SLU) systems often exhibit suboptimal performance in processing atypical speech, typically caused by neurological conditions and motor impairments.

Data Augmentation Fairness +2

Paper
Code

Time Scale Network: A Shallow Neural Network For Time Series Data

no code implementations • 10 Nov 2023 • Trevor Meyer, Camden Shultz, Najim Dehak, Laureano Moro-Velazquez, Pedro Irazoqui

The network simultaneously learns features at many time scales for sequence classification with significantly reduced parameters and operations.

EEG Seizure prediction +3

Paper
Add Code

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

no code implementations • 8 Sep 2023 • Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velazquez, Thomas Thebaud, Najim Dehak

Cascaded SpeechCLIP attempted to generate localized word-level information and utilize both the pretrained image and text encoders.

audio-visual learning Quantization +1

Paper
Add Code

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

1 code implementation • 18 Jun 2023 • Helin Wang, Thomas Thebaud, Jesus Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velazquez

We present a novel typical-to-atypical voice conversion approach (DuTa-VC), which (i) can be trained with nonparallel data (ii) first introduces diffusion probabilistic model (iii) preserves the target speaker identity (iv) is aware of the phoneme duration of the target speaker.

Data Augmentation speech-recognition +2

Paper
Code

Regularizing Contrastive Predictive Coding for Speech Applications

no code implementations • 12 Apr 2023 • Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

These representations significantly reduce the amount of labeled data needed for downstream task performance, such as automatic speech recognition.

Acoustic Unit Discovery Automatic Speech Recognition +3

Paper
Add Code

Stabilized training of joint energy-based models and their practical applications

no code implementations • 7 Mar 2023 • Martin Sustek, Samik Sadhu, Lukas Burget, Hynek Hermansky, Jesus Villalba, Laureano Moro-Velazquez, Najim Dehak

The JEM training relies on "positive examples" (i. e. examples from the training data set) as well as on "negative examples", which are samples from the modeled distribution $p(x)$ generated by means of Stochastic Gradient Langevin Dynamics (SGLD).

Paper
Add Code

Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech

no code implementations • 10 Aug 2022 • Jaejin Cho, Jes'us Villalba, Laureano Moro-Velazquez, Najim Dehak

In recent studies, self-supervised pre-trained models tend to outperform supervised pre-trained models in transfer learning.

Alzheimer's Disease Detection Self-Supervised Learning +3

Paper
Add Code

Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations

1 code implementation • 10 Aug 2022 • Jaejin Cho, Raghavendra Pappagari, Piotr Żelasko, Laureano Moro-Velazquez, Jesús Villalba, Najim Dehak

This paper applies a non-contrastive self-supervised learning method on an unlabeled speech corpus to learn utterance-level embeddings.

Emotion Recognition Self-Supervised Learning +1

Paper
Code

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

no code implementations • 5 Oct 2021 • Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

We overcome this limitation with a segmental contrastive predictive coding (SCPC) framework to model the signal structure at a higher level, e. g., phone level.

Boundary Detection Representation Learning +1

Paper
Add Code

Beyond Isolated Utterances: Conversational Emotion Recognition

no code implementations • 13 Sep 2021 • Raghavendra Pappagari, Piotr Żelasko, Jesús Villalba, Laureano Moro-Velazquez, Najim Dehak

While most of the current approaches focus on inferring emotion from isolated utterances, we argue that this is not sufficient to achieve conversational emotion recognition (CER) which deals with recognizing emotions in conversations.

Speech Emotion Recognition

Paper
Add Code

Pathological voice adaptation with autoencoder-based voice conversion

no code implementations • 15 Jun 2021 • Marc Illa, Bence Mark Halpern, Rob van Son, Laureano Moro-Velazquez, Odette Scharenborg

This approach alleviates the evaluation problem one normally has when converting typical speech to pathological speech, as in our approach, the voice conversion (VC) model does not need to be optimised for speech degradation but only for the speaker change.

Speech Synthesis Voice Conversion

Paper
Add Code

Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation

no code implementations • 3 Jun 2021 • Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

We overcome this limitation with a segmental contrastive predictive coding (SCPC) framework that can model the signal structure at a higher level e. g. at the phoneme level.

Paper
Add Code

Artificial Intelligence applied to chest X-Ray images for the automatic detection of COVID-19. A thoughtful evaluation approach

1 code implementation • 29 Nov 2020 • Julian D. Arias-Londoño, Jorge A. Gomez-Garcia, Laureano Moro-Velazquez, Juan I. Godino-Llorente

Current standard protocols used in the clinic for diagnosing COVID-19 include molecular or antigen tests, generally complemented by a plain chest X-Ray.

COVID-19 Diagnosis

Paper
Code

CopyPaste: An Augmentation Method for Speech Emotion Recognition

no code implementations • 27 Oct 2020 • Raghavendra Pappagari, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

Data augmentation is a widely used strategy for training robust machine learning models.

Data Augmentation Speaker Recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.