Search Results for author: Mireia Farrús

Found 10 papers, 1 papers with code

Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting

no code implementations • COLING 2022 • Guillermo Cámbara, Jordi Luque, Mireia Farrús

We do so by transferring the codebook as weights for the latent bottleneck of a Keyword Spotting Perceiver, thus initializing such model with phonetic embeddings already.

Small-Footprint Keyword Spotting

Paper
Add Code

Data Augmentation for Low-Resource Quechua ASR Improvement

no code implementations • 14 Jul 2022 • Rodolfo Zevallos, Nuria Bel, Guillermo Cámbara, Mireia Farrús, Jordi Luque

In this paper we describe our data augmentation approach to improve the results of ASR models for low-resource and agglutinative languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Voice Quality and Pitch Features in Transformer-Based Speech Recognition

no code implementations • 21 Dec 2021 • Guillermo Cámbara, Jordi Luque, Mireia Farrús

Jitter and shimmer measurements have shown to be carriers of voice quality and prosodic information which enhance the performance of tasks like speaker recognition, diarization or automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Influence of ASR and Language Model on Alzheimer's Disease Detection

no code implementations • 20 Sep 2021 • Joan Codina-Filbà, Guillermo Cámbara, Jordi Luque, Mireia Farrús

Furthermore, we study the influence of a language model -- which tends to correct non-standard sequences of words -- with the lack of language model to decode the hypothesis from the ASR.

Alzheimer's Disease Detection Language Modelling

Paper
Add Code

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

no code implementations • 9 May 2021 • Guillermo Cámbara, Alex Peiró-Lilja, Mireia Farrús, Jordi Luque

Nowadays, research in speech technologies has gotten a lot out thanks to recently created public domain corpora that contain thousands of recording hours.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

no code implementations • 29 Jan 2021 • Martin Kocour, Guillermo Cámbara, Jordi Luque, David Bonet, Mireia Farrús, Martin Karafiát, Karel Veselý, Jan ''Honza'' Ĉernocký

This paper describes joint effort of BUT and Telef\'onica Research on development of Automatic Speech Recognition systems for Albayzin 2020 Challenge.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Combining Prosodic, Voice Quality and Lexical Features to Automatically Detect Alzheimer's Disease

no code implementations • 18 Nov 2020 • Mireia Farrús, Joan Codina-Filbà

Both tasks have been performed extracting 28 features from speech -- based on prosody and voice quality -- and 51 features from the transcriptions -- based on lexical and turn-taking information.

regression

Paper
Add Code

Convolutional Speech Recognition with Pitch and Voice Quality Features

no code implementations • 2 Sep 2020 • Guillermo Cámbara, Jordi Luque, Mireia Farrús

Up to our knowledge, this is the first work combining such pitch and voice quality features with modern convolutional architectures, showing improvements up to 7% and 3% relative WER points, for the publicly available Spanish Common Voice and LibriSpeech 100h datasets, respectively.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

no code implementations • 12 Nov 2019 • Guillermo Cámbara, Jordi Luque, Mireia Farrús

The use of photoplethysmogram signal (PPG) for heart and sleep monitoring is commonly found nowadays in smartphones and wrist wearables.

Paper
Add Code

Prosodic Phrase Alignment for Machine Dubbing

1 code implementation • 20 Aug 2019 • Alp Öktem, Mireia Farrús, Antonio Bonafonte

Dubbing is a type of audiovisual translation where dialogues are translated and enacted so that they give the impression that the media is in the target language.

Machine Translation Translation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.