Search Results for author: Julien Pinquier

Found 14 papers, 2 papers with code

Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval?

no code implementations • 29 Aug 2023 • Etienne Labbé, Thomas Pellegrini, Julien Pinquier

For ATR, we propose using the standard Cross-Entropy loss values obtained for any audio/caption pair.

Paper
Add Code

Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates

1 code implementation • 14 Nov 2022 • Etienne Labbé, Thomas Pellegrini, Julien Pinquier

For this reason, several complementary metrics, such as BLEU, CIDEr, SPICE and SPIDEr, are used to compare a single automatic caption to one or several captions of reference, produced by a human annotator.

AudioCaps Audio captioning +3

Paper
Code

Audio-video fusion strategies for active speaker detection in meetings

no code implementations • 9 Jun 2022 • Lionel Pibre, Francisco Madrigal, Cyrille Equoy, Frédéric Lerasle, Thomas Pellegrini, Julien Pinquier, Isabelle Ferrané

In this paper, we propose two different types of fusion for the detection of the active speaker, combining two visual modalities and an audio modality through neural networks.

Management Optical Flow Estimation +2

Paper
Add Code

End-to-end acoustic modelling for phone recognition of young readers

no code implementations • 4 Mar 2021 • Lucile Gelin, Morgane Daniel, Julien Pinquier, Thomas Pellegrini

Through transfer learning, a Transformer model complemented with a Connectionist Temporal Classification (CTC) objective function, reaches a phone error rate of 28. 1%, outperforming a state-of-the-art DNN-HMM model by 6. 6% relative, as well as other end-to-end architectures by more than 8. 5% relative.

Acoustic Modelling Transfer Learning

Paper
Add Code

Une nouvelle mesure de la r\'everb\'eration pour pr\'edire les performances a priori de la transcription de la parole (A new reverberation measure to predict a priori ASR performance)

no code implementations • JEPTALNRECITAL 2020 • S{\'e}bastien Ferreira, J{\'e}r{\^o}me Farinas, Julien Pinquier, Julie Mauclair, St{\'e}phane Rabant

Dans cette {\'e}tude, nous explorons la pr{\'e}diction a priori de la qualit{\'e} de la transcription automatique de la parole dans le cas de la parole r{\'e}verb{\'e}r{\'e}e enregistr{\'e}e avec un seul microphone.

Paper
Add Code

Reconnaissance de phones fond\'ee sur du Transfer Learning pour des enfants apprenants lecteurs en environnement de classe (Transfer Learning based phone recognition on children learning to read, with speech recorded in a classroom environment)

no code implementations • JEPTALNRECITAL 2020 • Lucile Gelin, Morgane Daniel, Thomas Pellegrini, Julien Pinquier

A conditions {\'e}gales, les performances actuelles de la reconnaissance vocale pour enfants sont inf{\'e}rieures {\`a} celles des syst{\`e}mes pour adultes.

Transfer Learning

Paper
Add Code

\'Etude des facteurs affectant la compr\'ehensibilit\'e de documents multimodaux : une \'etude exp\'erimentale (Factors affecting the comprehensibility of multimodal documents : an experimental study )

no code implementations • JEPTALNRECITAL 2020 • R, Estelle ria, Lionel Fontan, Maxime Le Coz, Isabelle Ferran{\'e}, Julien Pinquier

La compr{\'e}hensibilit{\'e} de documents audiovisuels peut d{\'e}pendre de facteurs propres {\`a} l{'}auditeur/spectateur (ex.

Paper
Add Code

Analyse de l'effet de la r\'everb\'eration sur la reconnaissance automatique de la parole (Analyzing how reverberation affects Automatic Speech Recognition)

no code implementations • JEPTALNRECITAL 2020 • S{\'e}bastien Ferreira, J{\'e}r{\^o}me Farinas, Julien Pinquier, Julie Mauclair, St{\'e}phane Rabant

La Reconnaissance Automatique de la Parole (RAP) est moins performante lorsque le signal de parole est de mauvaise qualit{\'e}.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Subjective Evaluation of Comprehensibility in Movie Interactions

no code implementations • LREC 2020 • R, Estelle ria, Lionel Fontan, Maxime Le Coz, Isabelle Ferran{\'e}, Julien Pinquier

Various research works have dealt with the comprehensibility of textual, audio, or audiovisual documents, and showed that factors related to text (e. g. linguistic complexity), sound (e. g. speech intelligibility), image (e. g. presence of visual context), or even to cognition and emotion can play a major role in the ability of humans to understand the semantic and pragmatic contents of a given document.

Paper
Add Code

Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

no code implementations • 9 Mar 2020 • Vincent Roger, Jérôme Farinas, Julien Pinquier

In that sense we propose an overview of few-shot techniques and perspectives of using such techniques for the focused speech problems in this survey.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Improving Vehicle Re-Identification using CNN Latent Spaces: Metrics Comparison and Track-to-track Extension

1 code implementation • 21 Oct 2019 • Geoffrey Roman-Jimenez, Patrice Guyot, Thierry Malon, Sylvie Chambon, Vincent Charvillat, Alain Crouzil, André Péninou, Julien Pinquier, Florence Sedes, Christine Sénac

We compared T2TP with I2TP using the same CNN models.

Retrieval Vehicle Re-Identification

Paper
Code

Toward a Computational Multidimensional Lexical Similarity Measure for Modeling Word Association Tasks in Psycholinguistics

no code implementations • WS 2019 • Bruno Gaume, Lydia Mai Ho-Dac, Ludovic Tanguy, C{\'e}cile Fabre, B{\'e}n{\'e}dicte Pierrejean, Nabil Hathout, J{\'e}r{\^o}me Farinas, Julien Pinquier, Lola Danet, Patrice P{\'e}ran, Xavier De Boissezon, M{\'e}lanie Jucla

This paper presents the first results of a multidisciplinary project, the {``}Evolex{''} project, gathering researchers in Psycholinguistics, Neuropsychology, Computer Science, Natural Language Processing and Linguistics.

General Classification Semantic Similarity +1

Paper
Add Code

Carcinologic Speech Severity Index Project: A Database of Speech Disorder Productions to Assess Quality of Life Related to Speech After Cancer

no code implementations • LREC 2018 • Corine Ast{\'e}sano, Mathieu Balaguer, J{\'e}r{\^o}me Farinas, Corinne Fredouille, Pascal Gaillard, Alain Ghio, Imed Laaridh, Muriel Lalain, Beno{\^\i}t Lepage, Julie Mauclair, Olivier Nocaudie, Julien Pinquier, Oriol Pont, Gilles Pouchoulin, Mich{\`e}le Puech, Dani{\`e}le Robert, Etienne Sicard, Virginie Woisard

Paper
Add Code

Influence de la quantit\'e de donn\'ees sur une t\^ache de segmentation de phones fond\'ee sur les r\'eseaux de neurones (Phone-level speech segmentation with neural networks : influence of the amount of data )

no code implementations • JEPTALNRECITAL 2016 • C{\'e}line Manenti, Thomas Pellegrini, Julien Pinquier

Dans cet article, nous d{\'e}crivons une {\'e}tude exp{\'e}rimentale de segmentation de parole en unit{\'e}s acoustiques sous-lexicales (phones) {\`a} l{'}aide de r{\'e}seaux de neurones.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.