Search Results for author: Pablo Riera

Found 9 papers, 5 papers with code

EnCodecMAE: Leveraging neural codecs for universal audio representation learning

1 code implementation14 Sep 2023 Leonardo Pepino, Pablo Riera, Luciana Ferrer

The goal of universal audio representation learning is to obtain foundational models that can be used for a variety of downstream tasks involving speech, music or environmental sounds.

Representation Learning

Mispronunciation detection using self-supervised speech representations

1 code implementation30 Jul 2023 Jazmin Vidal, Pablo Riera, Luciana Ferrer

We compare two downstream approaches: 1) training the model for phone recognition (PR) using native English data, and 2) training a model directly for the target task using non-native English data.

Self-Supervised Learning speech-recognition +1

Phone and speaker spatial organization in self-supervised speech representations

no code implementations24 Feb 2023 Pablo Riera, Manuela Cerdeiro, Leonardo Pepino, Luciana Ferrer

In this work, we analyze the spatial organization of phone and speaker information in several state-of-the-art speech representations using methods that do not require a downstream model.

Study of positional encoding approaches for Audio Spectrogram Transformers

1 code implementation13 Oct 2021 Leonardo Pepino, Pablo Riera, Luciana Ferrer

Transformers have revolutionized the world of deep learning, specially in the field of natural language processing.

Audio Classification

Simple and Cheap Setup for Timing Tapping Responses Synchronized to Auditory Stimuli

4 code implementations30 Apr 2021 Martin Miguel, Pablo Riera, Diego Fernandez Slezak

It is intended for presenting any auditory stimuli and recording tapping response times with within 2 milliseconds precision (up to -2ms lag).

Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings

2 code implementations8 Apr 2021 Leonardo Pepino, Pablo Riera, Luciana Ferrer

Emotion recognition datasets are relatively small, making the use of the more sophisticated deep learning approaches challenging.

Speech Emotion Recognition speech-recognition +2

A Study on the Manifestation of Trust in Speech

no code implementations9 Feb 2021 Lara Gauder, Leonardo Pepino, Pablo Riera, Silvina Brussino, Jazmín Vidal, Agustín Gravano, Luciana Ferrer

An automatic prediction of the level of trust that a user has on a certain system could be used to attempt to correct potential distrust by having the system take relevant actions like, for example, apologizing or explaining its decisions.

Detecting Distrust Towards the Skills of a Virtual Assistant Using Speech

no code implementations30 Jul 2020 Leonardo Pepino, Pablo Riera, Lara Gauder, Agustín Gravano, Luciana Ferrer

Research has shown that trust is an essential aspect of human-computer interaction directly determining the degree to which the person is willing to use the system.

Cannot find the paper you are looking for? You can Submit a new open access paper.