Search Results for author: Titouan Parcollet

Found 35 papers, 19 papers with code

Enhancing expressivity transfer in textless speech-to-speech translation

no code implementations11 Oct 2023 Jarod Duret, Benjamin O'Brien, Yannick Estève, Titouan Parcollet

Textless speech-to-speech translation systems are rapidly advancing, thanks to the integration of self-supervised learning techniques.

Self-Supervised Learning Speech-to-Speech Translation +1

Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads

no code implementations28 Aug 2023 Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli

Self-supervised learning (SSL) leverages large datasets of unlabeled speech to reach impressive performance with reduced amounts of annotated data.

Benchmarking Self-Supervised Learning

SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding

1 code implementation12 Jul 2023 Titouan Parcollet, Rogier Van Dalen, Shucong Zhang, Sourav Bhattacharya

Unfortunately, token mixing with self-attention takes quadratic time in the length of the speech utterance, slowing down inference as well as training and increasing memory consumption.

speech-recognition Speech Recognition

Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?

1 code implementation1 Jun 2023 Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli

Self-supervised learning (SSL) has recently allowed leveraging large datasets of unlabeled speech signals to reach impressive performance on speech tasks using only small amounts of annotated data.

Benchmarking Self-Supervised Learning

Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations

no code implementations1 Jun 2023 Salah Zaiem, Titouan Parcollet, Slim Essid

Self-Supervised Learning (SSL) has allowed leveraging large amounts of unlabeled speech data to improve the performance of speech recognition models even with small annotated datasets.

Data Augmentation Domain Adaptation +3

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition

2 code implementations29 May 2023 Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek

In particular, multi-head HyperConformer achieves comparable or higher recognition performance while being more efficient than Conformer in terms of inference speed, memory, parameter count, and available training data.

speech-recognition Speech Recognition

Stabilising and accelerating light gated recurrent units for automatic speech recognition

no code implementations16 Feb 2023 Adel Moumen, Titouan Parcollet

The light gated recurrent units (Li-GRU) is well-known for achieving impressive results in automatic speech recognition (ASR) tasks while being lighter and faster to train than a standard gated recurrent units (GRU).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity

no code implementations ICLR 2022 Xinchi Qiu, Javier Fernandez-Marques, Pedro PB Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane

When the available hardware cannot meet the memory and compute requirements to efficiently train high performing machine learning models, a compromise in either the training quality or the model complexity is needed.

Federated Learning

Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning

1 code implementation8 Apr 2022 Salah Zaiem, Titouan Parcollet, Slim Essid

Thus, this work introduces a conditional independance-based method which allows for automatically selecting a suitable distribution on the choice of augmentations and their parametrization from a set of predefined ones, for contrastive self-supervised pre-training.

Contrastive Learning Data Augmentation +1

Pretext Tasks selection for multitask self-supervised speech representation learning

1 code implementation1 Jul 2021 Salah Zaiem, Titouan Parcollet, Slim Essid, Abdel Heba

Through solving pretext tasks, self-supervised learning leverages unlabeled data to extract useful latent representations replacing traditional input features in the downstream task.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

End-to-End Speech Recognition from Federated Acoustic Models

1 code implementation29 Apr 2021 Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro P. B. de Gusmao, Daniel J. Beutel, Nicholas D. Lane

Training Automatic Speech Recognition (ASR) models under federated learning (FL) settings has attracted a lot of attention recently.

2k 4k +4

Conditional independence for pretext task selection in Self-supervised speech representation learning

1 code implementation15 Apr 2021 Salah Zaiem, Titouan Parcollet, Slim Essid

Through solving pretext tasks, self-supervised learning (SSL) leverages unlabeled data to extract useful latent representations replacing traditional input features in the downstream task.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

On-device Federated Learning with Flower

no code implementations7 Apr 2021 Akhil Mathur, Daniel J. Beutel, Pedro Porto Buarque de Gusmão, Javier Fernandez-Marques, Taner Topal, Xinchi Qiu, Titouan Parcollet, Yan Gao, Nicholas D. Lane

Federated Learning (FL) allows edge devices to collaboratively learn a shared prediction model while keeping their training data on the device, thereby decoupling the ability to do machine learning from the need to store data in the cloud.

BIG-bench Machine Learning Federated Learning

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

2 code implementations4 Apr 2021 Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba, Titouan Parcollet

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers.

Ranked #4 on Spoken Language Understanding on Timers and Such (using extra training data)

Spoken Language Understanding

A first look into the carbon footprint of federated learning

no code implementations15 Feb 2021 Xinchi Qiu, Titouan Parcollet, Javier Fernandez-Marques, Pedro Porto Buarque de Gusmao, Yan Gao, Daniel J. Beutel, Taner Topal, Akhil Mathur, Nicholas D. Lane

Despite impressive results, deep learning-based technologies also raise severe privacy and environmental concerns induced by the training procedure often conducted in data centers.

Federated Learning

Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation

1 code implementation8 Dec 2020 Paul-Gauthier Noé, Mohammad Mohammadamini, Driss Matrouf, Titouan Parcollet, Andreas Nautsch, Jean-François Bonastre

In order to allow the user to choose which information to protect, we introduce in this paper the concept of attribute-driven privacy preservation in speaker voice representation.

Attribute Disentanglement +6

Flower: A Friendly Federated Learning Research Framework

1 code implementation28 Jul 2020 Daniel J. Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Javier Fernandez-Marques, Yan Gao, Lorenzo Sani, Kwing Hei Li, Titouan Parcollet, Pedro Porto Buarque de Gusmão, Nicholas D. Lane

Federated Learning (FL) has emerged as a promising technique for edge devices to collaboratively learn a shared prediction model, while keeping their training data on the device, thereby decoupling the ability to do machine learning from the need to store the data in the cloud.

Federated Learning

Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition

3 code implementations19 May 2020 Yan Gao, Titouan Parcollet, Nicholas Lane

In the specific context of Automatic Speech Recognition (ASR), distillation from ensembles of acoustic models has recently shown promising results in increasing recognition performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Quaternion Neural Networks for Multi-channel Distant Speech Recognition

1 code implementation18 May 2020 Xinchi Qiu, Titouan Parcollet, Mirco Ravanelli, Nicholas Lane, Mohamed Morchid

In this paper, we propose to capture these inter- and intra- structural dependencies with quaternion neural networks, which can jointly process multiple signals as whole quaternion entities.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Real to H-space Encoder for Speech Recognition

no code implementations17 Jun 2019 Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato de Mori

Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

The PyTorch-Kaldi Speech Recognition Toolkit

11 code implementations19 Nov 2018 Mirco Ravanelli, Titouan Parcollet, Yoshua Bengio

Experiments, that are conducted on several datasets and tasks, show that PyTorch-Kaldi can effectively be used to develop modern state-of-the-art speech recognizers.

Distant Speech Recognition Noisy Speech Recognition

Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

1 code implementation20 Jun 2018 Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio

Quaternion numbers and quaternion neural networks have shown their efficiency to process multidimensional inputs as entities, to encode internal dependencies, and to solve many tasks with less learning parameters than real-valued models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Quaternion Recurrent Neural Networks

3 code implementations ICLR 2019 Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato de Mori, Yoshua Bengio

Recurrent neural networks (RNNs) are powerful architectures to model sequential data, due to their capability to learn short and long-term dependencies between the basic elements of a sequence.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.