Search Results for author: Benoit Sagot

Found 10 papers, 0 papers with code

XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words

no code implementations8 Oct 2023 Robin Algayres, Pablo Diego-Simon, Benoit Sagot, Emmanuel Dupoux

Due to the absence of explicit word boundaries in the speech stream, the task of segmenting spoken sentences into word units without text supervision is particularly challenging.

Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning

no code implementations11 Apr 2022 Robin Algayres, Adel Nabli, Benoit Sagot, Emmanuel Dupoux

We introduce a simple neural encoder architecture that can be trained using an unsupervised contrastive learning objective which gets its positive samples from data-augmented k-Nearest Neighbors search.

Contrastive Learning

Are discrete units necessary for Spoken Language Modeling?

no code implementations11 Mar 2022 Tu Anh Nguyen, Benoit Sagot, Emmanuel Dupoux

The approach relies first on transforming the audio into a sequence of discrete units (or pseudo-text) and then training a language model directly on such pseudo-text.

Language Modelling

Evaluating the reliability of acoustic speech embeddings

no code implementations27 Jul 2020 Robin Algayres, Mohamed Salah Zaiem, Benoit Sagot, Emmanuel Dupoux

However, there is currently no clear methodology to compare or optimise the quality of these embeddings in a task-neutral way.

Information Retrieval Retrieval

Enhancing BERT for Lexical Normalization

no code implementations WS 2019 Benjamin Muller, Benoit Sagot, Djam{\'e} Seddah

In this article, focusing on User Generated Content (UGC), we study the ability of BERT to perform lexical normalisation.

Language Modelling Lexical Normalization

Cannot find the paper you are looking for? You can Submit a new open access paper.