2 code implementations • 23 Nov 2020 • Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Evgeny Kharitonov, Alexei Baevski, Ewan Dunbar, Emmanuel Dupoux
We introduce a new unsupervised task, spoken language modeling: the learning of linguistic representations from raw audio signals without any labels, along with the Zero Resource Speech Benchmark 2021: a suite of 4 black-box, zero-shot metrics probing for the quality of the learned models at 4 linguistic levels: phonetics, lexicon, syntax and semantics.
1 code implementation • 21 Dec 2023 • Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux
We introduce EmphAssess, a prosodic benchmark designed to evaluate the capability of speech-to-speech models to encode and reproduce prosodic emphasis.
no code implementations • 6 Jul 2019 • Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan
For this task, we propose an approach based on language models and evaluate the impact on the results of different preprocessings and matching techniques.
no code implementations • JEPTALNRECITAL 2019 • Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan
Pour r{\'e}soudre cette t{\^a}che, nous proposons une approche reposant sur des mod{\`e}les de langue et {\'e}valuons l{'}impact de diff{\'e}rents pr{\'e}-traitements et de diff{\'e}rentes techniques d{'}appariement sur les r{\'e}sultats.
no code implementations • 29 Apr 2021 • Ewan Dunbar, Mathieu Bernard, Nicolas Hamilakis, Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Eugene Kharitonov, Emmanuel Dupoux
We present the Zero Resource Speech Challenge 2021, which asks participants to learn a language model directly from audio, without any text or labels.
no code implementations • 30 Mar 2022 • Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski
Language information, however, is very salient in the bilingual model only, suggesting CPC models learn to discriminate languages when trained on multiple languages.
no code implementations • 27 Jun 2022 • Maureen de Seyssel, Guillaume Wisniewski, Emmanuel Dupoux
According to the Language Familiarity Effect (LFE), people are better at discriminating between speakers of their native language.
no code implementations • 6 Oct 2022 • Tu Anh Nguyen, Maureen de Seyssel, Robin Algayres, Patricia Roze, Ewan Dunbar, Emmanuel Dupoux
However, word boundary information may be absent or unreliable in the case of speech input (word boundaries are not marked explicitly in the speech stream).
no code implementations • 23 Feb 2023 • Maureen de Seyssel, Marvin Lavechin, Hadrien Titeux, Arthur Thomas, Gwendal Virlet, Andrea Santos Revilla, Guillaume Wisniewski, Bogdan Ludusan, Emmanuel Dupoux
In the lexical task, the model needs to correctly distinguish between pauses inserted between words and within words.