Search Results for author: Yannick Estève

Found 32 papers, 5 papers with code

Impact Analysis of the Use of Speech and Language Models Pretrained by Self-Supersivion for Spoken Language Understanding

no code implementations • LREC 2022 • Salima Mdhaffar, Valentin Pelloin, Antoine Caubrière, Gaëlle Laperriere, Sahar Ghannay, Bassam Jabaian, Nathalie Camelin, Yannick Estève

Pretrained models through self-supervised learning have been recently introduced for both acoustic and language modeling.

Language Modelling Self-Supervised Learning +3

Paper
Add Code

ON-TRAC’ systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks

no code implementations • ACL (IWSLT) 2021 • Hang Le, Florentin Barbier, Ha Nguyen, Natalia Tomashenko, Salima Mdhaffar, Souhir Gabiche Gahbiche, Benjamin Lecouteux, Didier Schwab, Yannick Estève

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2021, low-resource speech translation and multilingual speech translation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

The Spoken Language Understanding MEDIA Benchmark Dataset in the Era of Deep Learning: data updates, training and evaluation tools

no code implementations • LREC 2022 • Gaëlle Laperrière, Valentin Pelloin, Antoine Caubrière, Salima Mdhaffar, Nathalie Camelin, Sahar Ghannay, Bassam Jabaian, Yannick Estève

In this paper, we focus on the French MEDIA SLU dataset, distributed since 2005 and used as a benchmark dataset for a large number of research works.

Intent Detection Spoken Language Understanding

Paper
Add Code

Findings of the IWSLT 2022 Evaluation Campaign

no code implementations • IWSLT (ACL) 2022 • Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe

The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.

Speech-to-Speech Translation Translation

Paper
Add Code

Is one brick enough to break the wall of spoken dialogue state tracking?

no code implementations • 3 Nov 2023 • Lucas Druart, Valentin Vielzeuf, Yannick Estève

In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs (a. k. a dialogue state tracking) is key to a smooth interaction.

Dialogue State Tracking

Paper
Add Code

Enhancing expressivity transfer in textless speech-to-speech translation

no code implementations • 11 Oct 2023 • Jarod Duret, Benjamin O'Brien, Yannick Estève, Titouan Parcollet

Textless speech-to-speech translation systems are rapidly advancing, thanks to the integration of self-supervised learning techniques.

Self-Supervised Learning Speech-to-Speech Translation +1

Paper
Add Code

Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations

no code implementations • 6 Oct 2023 • Manon Macary, Marie Tahon, Yannick Estève, Daniel Luzzati

In the context of telephone conversations, we can break down the audio information into acoustic and linguistic by using the speech signal and its transcription.

Emotion Recognition Transfer Learning

Paper
Add Code

Semantic enrichment towards efficient speech representations

no code implementations • 3 Jul 2023 • Gaëlle Laperrière, Ha Nguyen, Sahar Ghannay, Bassam Jabaian, Yannick Estève

Over the past few years, self-supervised learned speech representations have emerged as fruitful replacements for conventional surface representations when solving Spoken Language Understanding (SLU) tasks.

Spoken Language Understanding

Paper
Add Code

Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

no code implementations • 29 Jun 2023 • Jarod Duret, Titouan Parcollet, Yannick Estève

We propose a method for speech-to-speech emotionpreserving translation that operates at the level of discrete speech units.

Machine Translation Prosody Prediction +2

Paper
Add Code

Some voices are too common: Building fair speech recognition systems using the Common Voice dataset

no code implementations • 1 Jun 2023 • Lucas Maison, Yannick Estève

Automatic speech recognition (ASR) systems become increasingly efficient thanks to new advances in neural network training like self-supervised learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

OLISIA: a Cascade System for Spoken Dialogue State Tracking

1 code implementation • 20 Apr 2023 • Léo Jacqmin, Lucas Druart, Yannick Estève, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

Though Dialogue State Tracking (DST) is a core component of spoken dialogue systems, recent work on this task mostly deals with chat corpora, disregarding the discrepancies between spoken and written language. In this paper, we propose OLISIA, a cascade system which integrates an Automatic Speech Recognition (ASR) model and a DST model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

Improving Accented Speech Recognition with Multi-Domain Training

no code implementations • 14 Mar 2023 • Lucas Maison, Yannick Estève

Thanks to the rise of self-supervised learning, automatic speech recognition (ASR) systems now achieve near-human performance on a wide variety of datasets.

Accented Speech Recognition Automatic Speech Recognition +3

Paper
Add Code

Federated Learning for ASR based on Wav2vec 2.0

2 code implementations • 20 Feb 2023 • Tuan Nguyen, Salima Mdhaffar, Natalia Tomashenko, Jean-François Bonastre, Yannick Estève

This paper presents a study on the use of federated learning to train an ASR model based on a wav2vec 2. 0 model pre-trained by self supervision.

Federated Learning Language Modelling

4,139

Paper
Code

On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding

no code implementations • 11 Oct 2022 • Gaëlle Laperrière, Valentin Pelloin, Mickaël Rouvier, Themos Stafylakis, Yannick Estève

In this paper we examine the use of semantically-aligned speech representations for end-to-end spoken language understanding (SLU).

Sentence Sentence Embedding +2

Paper
Add Code

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks

no code implementations • IWSLT (ACL) 2022 • Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier, Souhir Gahbiche, Yannick Estève

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2022: low-resource and dialect speech translation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems

no code implementations • 4 Apr 2022 • Marcely Zanon Boito, Laurent Besacier, Natalia Tomashenko, Yannick Estève

These models are pre-trained on unlabeled audio data and then used in speech processing downstream tasks such as automatic speech recognition (ASR) or speech translation (ST).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

End-to-end model for named entity recognition from speech without paired training data

no code implementations • 2 Apr 2022 • Salima Mdhaffar, Jarod Duret, Titouan Parcollet, Yannick Estève

Our approach is based on the use of an external model trained to generate a sequence of vectorial representations from text.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Speech Resources in the Tamasheq Language

1 code implementation • LREC 2022 • Marcely Zanon Boito, Fethi Bougares, Florentin Barbier, Souhir Gahbiche, Loïc Barrault, Mickael Rouvier, Yannick Estève

In this paper we present two datasets for Tamasheq, a developing language mainly spoken in Mali and Niger.

Translation

Paper
Code

Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition

no code implementations • 7 Nov 2021 • Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève

The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR.

Speaker Verification speech-recognition +1

Paper
Add Code

Privacy attacks for automatic speech recognition acoustic models in a federated learning framework

no code implementations • 6 Nov 2021 • Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre

This paper investigates methods to effectively retrieve speaker information from the personalized speaker adapted neural network acoustic models (AMs) in automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Where are we in semantic concept extraction for Spoken Language Understanding?

no code implementations • 24 Jun 2021 • Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève

More recent works on self-supervised training with unlabeled data open new perspectives in term of performance for automatic speech recognition and natural language processing.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +7

Paper
Add Code

Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation

no code implementations • 29 Apr 2021 • Ha Nguyen, Yannick Estève, Laurent Besacier

Boosted by the simultaneous translation shared task at IWSLT 2020, promising end-to-end online speech translation approaches were recently proposed.

Translation

Paper
Add Code

An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies

no code implementations • 4 Mar 2021 • Ha Nguyen, Yannick Estève, Laurent Besacier

This paper proposes a decoding strategy for end-to-end simultaneous speech translation.

Translation

Paper
Add Code

End2End Acoustic to Semantic Transduction

no code implementations • 1 Feb 2021 • Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato de Mori, Antoine Caubrière, Yannick Estève, Sylvain Meignier

In this paper, we propose a novel end-to-end sequence-to-sequence spoken language understanding model using an attention mechanism.

Language Modelling Spoken Language Understanding

Paper
Add Code

On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

no code implementations • 18 Nov 2020 • Manon Macary, Marie Tahon, Yannick Estève, Anthony Rousseau

Pre-training for feature extraction is an increasingly studied approach to get better continuous representations of audio and text content.

Speech Emotion Recognition

Paper
Add Code

Leverage Unlabeled Data for Abstractive Speech Summarization with Self-Supervised Learning and Back-Summarization

no code implementations • 30 Jul 2020 • Paul Tardy, Louis de Seynes, François Hernandez, Vincent Nguyen, David Janiszek, Yannick Estève

In order to build a corpus for this task, it is necessary to obtain the (automatic or manual) transcription of each meeting, and then to segment and align it with the corresponding manual report to produce training examples suitable for training.

Abstractive Text Summarization Denoising +2

Paper
Add Code

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation

2 code implementations • LREC 2020 • Paul Tardy, David Janiszek, Yannick Estève, Vincent Nguyen

We report automatic alignment and summarization performances on this corpus and show that automatic alignment is relevant for data annotation since it leads to large improvement of almost +4 on all ROUGE scores on the summarization task.

Meeting Summarization Text Summarization

Paper
Code

ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020

no code implementations • WS 2020 • Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, Benjamin Lecouteux, Yannick Estève, Laurent Besacier

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2020, offline speech translation and simultaneous speech translation.

Data Augmentation Translation

Paper
Add Code

Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability

no code implementations • 18 Jun 2019 • Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin, Yannick Estève

We present an end-to-end approach to extract semantic concepts directly from the speech audio signal.

POS POS Tagging +2

Paper
Add Code

End-to-end named entity extraction from speech

no code implementations • 30 May 2018 • Sahar Ghannay, Antoine Caubrière, Yannick Estève, Antoine Laurent, Emmanuel Morin

Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation

3 code implementations • 12 May 2018 • François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia Tomashenko, Yannick Estève

We present the recent development on Automatic Speech Recognition (ASR) systems in comparison with the two previous releases of the TED-LIUM Corpus from 2012 and 2014.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

18,370

Paper
Code

ASR error management for improving spoken language understanding

no code implementations • 26 May 2017 • Edwin Simonnet, Sahar Ghannay, Nathalie Camelin, Yannick Estève, Renato de Mori

This paper addresses the problem of automatic speech recognition (ASR) error detection and their use for improving spoken language understanding (SLU) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.