Search Results for author: Alberto Abad

Found 20 papers, 3 papers with code

A new European Portuguese corpus for the study of Psychosis through speech analysis

no code implementations LREC 2022 Maria Forjó, Daniel Neto, Alberto Abad, HSofia Pinto, Joaquim Gago

Psychosis is a clinical syndrome characterized by the presence of symptoms such as hallucinations, thought disorder and disorganized speech.

Multilingual Transfer Learning for Children Automatic Speech Recognition

no code implementations LREC 2022 Thomas Rolland, Alberto Abad, Catia Cucchiarini, Helmer Strik

Our results provide a positive answer to our research question, by showing that using transfer learning on top of a multilingual model for an unseen language outperforms conventional single language-specific learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Privacy-oriented manipulation of speaker representations

no code implementations10 Oct 2023 Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso

Speaker embeddings are ubiquitous, with applications ranging from speaker recognition and diarization to speech synthesis and voice anonymisation.

Speaker Recognition Speech Synthesis

Memory-augmented conformer for improved end-to-end long-form ASR

1 code implementation22 Sep 2023 Carlos Carvalho, Alberto Abad

Conformers have recently been proposed as a promising modelling approach for automatic speech recognition (ASR), outperforming recurrent neural network-based approaches and transformers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

A Simple Feature Method for Prosody Rhythm Comparison

no code implementations20 Dec 2022 Mariana Julião, Alberto Abad, Helena Moniz

Of all components of Prosody, Rhythm has been regarded as the hardest to address, as it is utterly linked to Pitch and Intensity.

Clustering Sentence

Privacy-preserving Automatic Speaker Diarization

no code implementations26 Oct 2022 Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso

Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deals with recordings of multiple speakers, raising special concerns in terms of privacy.

Privacy Preserving speaker-diarization +1

Towards End-to-End Private Automatic Speaker Recognition

no code implementations23 Jun 2022 Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso

This poses two important issues: first, knowledge of the speaker embedding extraction model may create security and robustness liabilities for the authentication system, as this knowledge might help attackers in crafting adversarial examples able to mislead the system; second, from the point of view of a service provider the speaker embedding extraction model is arguably one of the most valuable components in the system and, as such, disclosing it would be highly undesirable.

Privacy Preserving Speaker Recognition +1

Using Self-Supervised Feature Extractors with Attention for Automatic COVID-19 Detection from Speech

no code implementations30 Jun 2021 John Mendonça, Rubén Solera-Ureña, Alberto Abad, Isabel Trancoso

Experimental results demonstrate that models trained on features extracted from self-supervised models perform similarly or outperform fully-supervised models and models based on handcrafted features.

Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning

no code implementations22 Feb 2021 Rui Ribeiro, Alberto Abad, José Lopes

We evaluated our model on the MultiWOZ dataset and outperformed DiKTNet in both BLEU and Entity F1 scores when the same amount of data is available.

Domain Adaptation Meta-Learning

Pathological speech detection using x-vector embeddings

no code implementations2 Mar 2020 Catarina Botelho, Francisco Teixeira, Thomas Rolland, Alberto Abad, Isabel Trancoso

We test our approach against knowledge-based features and i-vectors, and report results for two European Portuguese corpora, for OSA and PD, as well as for an additional Spanish corpus for PD.

Attentive Filtering Networks for Audio Replay Attack Detection

1 code implementation31 Oct 2018 Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King

In this work, we propose our replay attacks detection system - Attentive Filtering Network, which is composed of an attention-based filtering mechanism that enhances feature representations in both the frequency and time domains, and a ResNet-based classifier.

Speaker Verification

The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.

no code implementations LREC 2016 Miguel Matos, Alberto Abad, Ant{\'o}nio Serralheiro

In this paper, we describe a new corpus -named DIRHA-L2F RealCorpus- composed of typical home automation speech interactions in European Portuguese that has been recorded by the INESC-ID{'}s Spoken Language Systems Laboratory (L2F) to support the activities of the Distant-speech Interaction for Robust Home Applications (DIRHA) EU-funded project.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

The DIRHA simulated corpus

no code implementations LREC 2014 Luca Cristoforetti, Mirco Ravanelli, Maurizio Omologo, Aless Sosi, ro, Alberto Abad, Martin Hagmueller, Petros Maragos

This paper describes a multi-microphone multi-language acoustic corpus being developed under the EC project Distant-speech Interaction for Robust Home Applications (DIRHA).

Dialogue Management Distant Speech Recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.