Search Results for author: Maureen de Seyssel

Found 9 papers, 2 papers with code

Qwant Research @DEFT 2019 : appariement de documents et extraction d'informations \`a partir de cas cliniques (Document matching and information retrieval using clinical cases)

no code implementations JEPTALNRECITAL 2019 Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

Pour r{\'e}soudre cette t{\^a}che, nous proposons une approche reposant sur des mod{\`e}les de langue et {\'e}valuons l{'}impact de diff{\'e}rents pr{\'e}-traitements et de diff{\'e}rentes techniques d{'}appariement sur les r{\'e}sultats.

Information Retrieval Retrieval

Qwant Research @DEFT 2019: Document matching and information retrieval using clinical cases

no code implementations6 Jul 2019 Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

For this task, we propose an approach based on language models and evaluate the impact on the results of different preprocessings and matching techniques.

Information Retrieval Retrieval +3

The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling

2 code implementations23 Nov 2020 Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Evgeny Kharitonov, Alexei Baevski, Ewan Dunbar, Emmanuel Dupoux

We introduce a new unsupervised task, spoken language modeling: the learning of linguistic representations from raw audio signals without any labels, along with the Zero Resource Speech Benchmark 2021: a suite of 4 black-box, zero-shot metrics probing for the quality of the learned models at 4 linguistic levels: phonetics, lexicon, syntax and semantics.

Clustering Language Modelling +1

The Zero Resource Speech Challenge 2021: Spoken language modelling

no code implementations29 Apr 2021 Ewan Dunbar, Mathieu Bernard, Nicolas Hamilakis, Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Eugene Kharitonov, Emmanuel Dupoux

We present the Zero Resource Speech Challenge 2021, which asks participants to learn a language model directly from audio, without any text or labels.

Language Modelling

Probing phoneme, language and speaker information in unsupervised speech representations

no code implementations30 Mar 2022 Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski

Language information, however, is very salient in the bilingual model only, suggesting CPC models learn to discriminate languages when trained on multiple languages.

Language Modelling

Is the Language Familiarity Effect gradual? A computational modelling approach

no code implementations27 Jun 2022 Maureen de Seyssel, Guillaume Wisniewski, Emmanuel Dupoux

According to the Language Familiarity Effect (LFE), people are better at discriminating between speakers of their native language.

Are word boundaries useful for unsupervised language learning?

no code implementations6 Oct 2022 Tu Anh Nguyen, Maureen de Seyssel, Robin Algayres, Patricia Roze, Ewan Dunbar, Emmanuel Dupoux

However, word boundary information may be absent or unreliable in the case of speech input (word boundaries are not marked explicitly in the speech stream).

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models

1 code implementation21 Dec 2023 Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux

We introduce EmphAssess, a prosodic benchmark designed to evaluate the capability of speech-to-speech models to encode and reproduce prosodic emphasis.

Resynthesis Speech-to-Speech Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.