Search Results for author: Natalia Tomashenko

Found 27 papers, 11 papers with code

ON-TRAC’ systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks

no code implementations • ACL (IWSLT) 2021 • Hang Le, Florentin Barbier, Ha Nguyen, Natalia Tomashenko, Salima Mdhaffar, Souhir Gabiche Gahbiche, Benjamin Lecouteux, Didier Schwab, Yannick Estève

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2021, low-resource speech translation and multilingual speech translation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

The VoicePrivacy 2024 Challenge Evaluation Plan

1 code implementation • 3 Apr 2024 • Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco

The task of the challenge is to develop a voice anonymization system for speech data which conceals the speaker's voice identity while protecting linguistic content and emotional states.

Paper
Code

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

no code implementations • 11 Sep 2023 • Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different domains including computer vision and natural language processing.

Self-Supervised Learning

Paper
Add Code

Federated Learning for ASR based on Wav2vec 2.0

2 code implementations • 20 Feb 2023 • Tuan Nguyen, Salima Mdhaffar, Natalia Tomashenko, Jean-François Bonastre, Yannick Estève

This paper presents a study on the use of federated learning to train an ASR model based on a wav2vec 2. 0 model pre-trained by self supervision.

Federated Learning Language Modelling

4,139

Paper
Code

The VoicePrivacy 2020 Challenge Evaluation Plan

1 code implementation • 14 May 2022 • Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

The VoicePrivacy Challenge aims to promote the development of privacy preservation tools for speech technology by gathering a new community to define the tasks of interest and the evaluation methodology, and benchmarking solutions through a series of challenges.

Benchmarking

Paper
Code

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems

no code implementations • 4 Apr 2022 • Marcely Zanon Boito, Laurent Besacier, Natalia Tomashenko, Yannick Estève

These models are pre-trained on unlabeled audio data and then used in speech processing downstream tasks such as automatic speech recognition (ASR) or speech translation (ST).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

The VoicePrivacy 2022 Challenge Evaluation Plan

1 code implementation • 23 Mar 2022 • Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas Evans, Junichi Yamagishi, Jean-François Bonastre

Participants apply their developed anonymization systems, run evaluation scripts and submit objective evaluation results and anonymized speech data to the organizers.

Speaker Verification

Paper
Code

Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition

no code implementations • 7 Nov 2021 • Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève

The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR.

Speaker Verification speech-recognition +1

Paper
Add Code

Privacy attacks for automatic speech recognition acoustic models in a federated learning framework

no code implementations • 6 Nov 2021 • Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre

This paper investigates methods to effectively retrieve speaker information from the personalized speaker adapted neural network acoustic models (AMs) in automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

The VoicePrivacy 2020 Challenge: Results and findings

1 code implementation • 1 Sep 2021 • Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O'Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche

We provide a systematic overview of the challenge design with an analysis of submitted systems and evaluation results.

Paper
Code

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

1 code implementation • 23 Apr 2021 • Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

In this paper, we propose LeBenchmark: a reproducible framework for assessing SSL from speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

Speaker anonymisation using the McAdams coefficient

2 code implementations • 2 Nov 2020 • Jose Patino, Natalia Tomashenko, Massimiliano Todisco, Andreas Nautsch, Nicholas Evans

Anonymisation has the goal of manipulating speech signals in order to degrade the reliability of automatic approaches to speaker recognition, while preserving other aspects of speech, such as those relating to intelligibility and naturalness.

Speaker Recognition

Paper
Code

Speech Pseudonymisation Assessment Using Voice Similarity Matrices

2 code implementations • 30 Aug 2020 • Paul-Gauthier Noé, Jean-François Bonastre, Driss Matrouf, Natalia Tomashenko, Andreas Nautsch, Nicholas Evans

The proliferation of speech technologies and rising privacy legislation calls for the development of privacy preservation solutions for speech applications.

De-identification Voice Similarity

Paper
Code

ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020

no code implementations • WS 2020 • Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, Benjamin Lecouteux, Yannick Estève, Laurent Besacier

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2020, offline speech translation and simultaneous speech translation.

Data Augmentation Translation

Paper
Add Code

The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment

2 code implementations • 19 May 2020 • Andreas Nautsch, Jose Patino, Natalia Tomashenko, Junichi Yamagishi, Paul-Gauthier Noe, Jean-Francois Bonastre, Massimiliano Todisco, Nicholas Evans

Mounting privacy legislation calls for the preservation of privacy in speech technology, though solutions are gravely lacking.

Cryptography and Security Audio and Speech Processing

Paper
Code

Design Choices for X-vector Based Speaker Anonymization

no code implementations • 18 May 2020 • Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi

The recently proposed x-vector based anonymization scheme converts any input voice into that of a random pseudo-speaker.

Speaker Verification

Paper
Add Code

Introducing the VoicePrivacy Initiative

3 code implementations • 4 May 2020 • Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

The VoicePrivacy initiative aims to promote the development of privacy preservation tools for speech technology by gathering a new community to define the tasks of interest and the evaluation methodology, and benchmarking solutions through a series of challenges.

Benchmarking

Paper
Code

Exploring Gaussian mixture model framework for speaker adaptation of deep neural network acoustic models

no code implementations • 15 Mar 2020 • Natalia Tomashenko, Yuri Khokhlov, Yannick Esteve

Experimental results on the TED-LIUM corpus show that the proposed adaptation technique can be effectively integrated into DNN and TDNN setups at different levels and provide additional gain in recognition performance: up to 6% of relative word error rate reduction (WERR) over the strong feature-space adaptation techniques based on maximum likelihood linear regression (fMLLR) speaker adapted DNN baseline, and up to 18% of relative WERR in comparison with a speaker independent (SI) DNN baseline model, trained on conventional features.

regression

Paper
Add Code

Dialogue history integration into end-to-end signal-to-concept spoken language understanding systems

no code implementations • 14 Feb 2020 • Natalia Tomashenko, Christian Raymond, Antoine Caubriere, Renato de Mori, Yannick Esteve

The dialog history is represented in the form of dialog history embedding vectors (so-called h-vectors) and is provided as an additional information to end-to-end SLU models in order to improve the system performance.

slot-filling Slot Filling +1

Paper
Add Code

ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task

no code implementations • EMNLP (IWSLT) 2019 • Ha Nguyen, Natalia Tomashenko, Marcely Zanon Boito, Antoine Caubriere, Fethi Bougares, Mickael Rouvier, Laurent Besacier, Yannick Esteve

This paper describes the ON-TRAC Consortium translation systems developed for the end-to-end model task of IWSLT Evaluation 2019 for the English-to-Portuguese language pair.

Translation

Paper
Add Code

Recent Advances in End-to-End Spoken Language Understanding

no code implementations • 29 Sep 2019 • Natalia Tomashenko, Antoine Caubriere, Yannick Esteve, Antoine Laurent, Emmanuel Morin

This work investigates spoken language understanding (SLU) systems in the scenario when the semantic information is extracted directly from the speech signal by means of a single end-to-end neural network model.

General Classification named-entity-recognition +5

Paper
Add Code

Curriculum d'apprentissage : reconnaissance d'entit\'es nomm\'ees pour l'extraction de concepts s\'emantiques (Curriculum learning : named entity recognition for semantic concept extraction)

no code implementations • JEPTALNRECITAL 2019 • Antoine Caubri{\`e}re, Natalia Tomashenko, Yannick Est{\`e}ve, Antoine Laurent, Emmanuel Morin

Les r{\'e}sultats montrent un int{\'e}r{\^e}t {\`a} l{'}utilisation des donn{\'e}es d{'}entit{\'e}s nomm{\'e}es, permettant un gain relatif allant jusqu{'}{\`a} 6, 5 {\%}.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability

no code implementations • 18 Jun 2019 • Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin, Yannick Estève

We present an end-to-end approach to extract semantic concepts directly from the speech audio signal.

POS POS Tagging +2

Paper
Add Code

TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation

3 code implementations • 12 May 2018 • François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia Tomashenko, Yannick Estève

We present the recent development on Automatic Speech Recognition (ASR) systems in comparison with the two previous releases of the TED-LIUM Corpus from 2012 and 2014.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

18,376

Paper
Code

Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models

no code implementations • LREC 2018 • Natalia Tomashenko, Yannick Est{\`e}ve

Automatic Speech Recognition (ASR) Data Augmentation +1

Paper
Add Code

Fast and Accurate OOV Decoder on High-Level Features

no code implementations • 19 Jul 2017 • Yuri Khokhlov, Natalia Tomashenko, Ivan Medennikov, Alexei Romanenko

The proposed approach is based on using high-level features from an automatic speech recognition (ASR) system, so called phoneme posterior based (PPB) features, for decoding.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Exploration de param\`etres acoustiques d\'eriv\'es de GMM pour l'adaptation non supervis\'ee de mod\`eles acoustiques \`a base de r\'eseaux de neurones profonds (Exploring GMM-derived features for unsupervised adaptation of deep neural network acoustic models)

no code implementations • JEPTALNRECITAL 2016 • Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher, Yannick Est{\`e}ve

L{'}{\'e}tude pr{\'e}sent{\'e}e dans cet article am{\'e}liore une m{\'e}thode r{\'e}cemment propos{\'e}e pour l{'}adaptation de mod{\`e}les acoustiques markoviens coupl{\'e}s {\`a} un r{\'e}seau de neurones profond (DNN-HMM).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.