Search Results for author: Alfonso Ortega

Found 11 papers, 2 papers with code

Unsupervised Multiple Domain Translation through Controlled Disentanglement in Variational Autoencoder

1 code implementation17 Jan 2024 Antonio Almudévar, Théo Mariotte, Alfonso Ortega, Marie Tahon

One of this latent variables is imposed to depend exclusively on the domain, while the other one must depend on the rest of the variability factors of the data.

Disentanglement Translation

Improved Vocal Effort Transfer Vector Estimation for Vocal Effort-Robust Speaker Verification

no code implementations3 May 2023 Iván López-Espejo, Santi Prieto, Alfonso Ortega, Eduardo Lleida

Despite the maturity of modern speaker verification technology, its performance still significantly degrades when facing non-neutrally-phonated (e. g., shouted and whispered) speech.

Speaker Verification

Human-Centric Multimodal Machine Learning: Recent Advances and Testbed on AI-based Recruitment

1 code implementation13 Feb 2023 Alejandro Peña, Ignacio Serna, Aythami Morales, Julian Fierrez, Alfonso Ortega, Ainhoa Herrarte, Manuel Alcantara, Javier Ortega-Garcia

With the aim of studying how current multimodal algorithms based on heterogeneous sources of information are affected by sensitive elements and inner biases in the data, we propose a fictitious case study focused on automated recruitment: FairCVtest.

Decision Making Fairness

Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems

no code implementations6 Nov 2021 Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida

This paper explores three novel approaches to improve the performance of speaker verification (SV) systems based on deep neural networks (DNN) using Multi-head Self-Attention (MSA) mechanisms and memory layers.

Knowledge Distillation Philosophy +1

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

no code implementations27 Oct 2021 Pablo Gimeno, Victoria Mingote, Alfonso Ortega, Antonio Miguel, Eduardo Lleida

Area under the ROC curve (AUC) optimisation techniques developed for neural networks have recently demonstrated their capabilities in different audio and speech related tasks.

Segmentation

Symbolic AI for XAI: Evaluating LFIT Inductive Programming for Fair and Explainable Automatic Recruitment

no code implementations1 Dec 2020 Alfonso Ortega, Julian Fierrez, Aythami Morales, Zilong Wang, Tony Ribeiro

Machine learning methods are growing in relevance for biometrics and personal information processing in domains such as forensics, e-health, recruitment, and e-learning.

BIG-bench Machine Learning Explainable Artificial Intelligence (XAI) +1

Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions

no code implementations6 Aug 2020 Santi Prieto, Alfonso Ortega, Iván López-Espejo, Eduardo Lleida

These compensation techniques are borrowed from the area of robustness for automatic speech recognition and, in this work, we apply them to compensate the mismatch between shouted and normal conditions in speaker verification.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition

no code implementations27 Dec 2018 Antonio Miguel, Jorge Llombart, Alfonso Ortega, Eduardo Lleida

As in Joint Factor Analysis, the model uses tied hidden variables to model speaker and session variability and a MAP adaptation of some of the parameters of the model.

Speaker Recognition Speaker Verification

Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification

no code implementations22 Dec 2018 Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida

Moreover, we can apply a convolutional neural network as front-end, and thanks to the alignment process being differentiable, we can train the whole network to produce a supervector for each utterance which will be discriminative with respect to the speaker and the phrase simultaneously.

Text-Dependent Speaker Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.