Search Results for author: Victoria Mingote

Found 6 papers, 0 papers with code

Direct Text to Speech Translation System using Acoustic Units

no code implementations • 14 Sep 2023 • Victoria Mingote, Pablo Gimeno, Luis Vicente, Sameer Khurana, Antoine Laurent, Jarod Duret

This framework employs text in different source languages as input to generate speech in the target language without the need for text transcriptions in this language.

Speech-to-Speech Translation text-to-speech translation +1

Paper
Add Code

Improved Cross-Lingual Transfer Learning For Automatic Speech Translation

no code implementations • 1 Jun 2023 • Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James Glass

Having a single model that supports multiple translation tasks is desirable.

Cross-Lingual Transfer Knowledge Distillation +4

Paper
Add Code

Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems

no code implementations • 6 Nov 2021 • Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida

This paper explores three novel approaches to improve the performance of speaker verification (SV) systems based on deep neural networks (DNN) using Multi-head Self-Attention (MSA) mechanisms and memory layers.

Knowledge Distillation Philosophy +1

Paper
Add Code

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

no code implementations • 27 Oct 2021 • Pablo Gimeno, Victoria Mingote, Alfonso Ortega, Antonio Miguel, Eduardo Lleida

Area under the ROC curve (AUC) optimisation techniques developed for neural networks have recently demonstrated their capabilities in different audio and speech related tasks.

Segmentation

Paper
Add Code

Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification

no code implementations • 31 Jan 2019 • Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida

This paper explores two techniques to improve the performance of text-dependent speaker verification systems based on deep neural networks.

Text-Dependent Speaker Verification

Paper
Add Code

Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification

no code implementations • 22 Dec 2018 • Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida

Moreover, we can apply a convolutional neural network as front-end, and thanks to the alignment process being differentiable, we can train the whole network to produce a supervector for each utterance which will be discriminative with respect to the speaker and the phrase simultaneously.

Text-Dependent Speaker Verification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.