Search Results for author: Valentin Vielzeuf

Found 14 papers, 3 papers with code

Efficiency-oriented approaches for self-supervised speech representation learning

no code implementations18 Dec 2023 Luis Lugo, Valentin Vielzeuf

Self-supervised learning enables the training of large neural models without the need for large, labeled datasets.

Automatic Speech Recognition Representation Learning +4

Is one brick enough to break the wall of spoken dialogue state tracking?

no code implementations3 Nov 2023 Lucas Druart, Valentin Vielzeuf, Yannick Estève

In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs (a. k. a dialogue state tracking) is key to a smooth interaction.

Dialogue State Tracking

Are cascade dialogue state tracking models speaking out of turn in spoken dialogues?

no code implementations3 Nov 2023 Lucas Druart, Léo Jacqmin, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs is key to a smooth interaction.

Dialogue State Tracking

OLISIA: a Cascade System for Spoken Dialogue State Tracking

1 code implementation20 Apr 2023 Léo Jacqmin, Lucas Druart, Yannick Estève, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

Though Dialogue State Tracking (DST) is a core component of spoken dialogue systems, recent work on this task mostly deals with chat corpora, disregarding the discrepancies between spoken and written language. In this paper, we propose OLISIA, a cascade system which integrates an Automatic Speech Recognition (ASR) model and a DST model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Are E2E ASR models ready for an industrial usage?

no code implementations9 Dec 2021 Valentin Vielzeuf, Grigory Antipov

The Automated Speech Recognition (ASR) community experiences a major turning point with the rise of the fully-neural (End-to-End, E2E) approaches.

speech-recognition Speech Recognition

Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition

1 code implementation31 Aug 2021 Maxime Burchi, Valentin Vielzeuf

The recently proposed Conformer architecture has shown state-of-the-art performances in Automatic Speech Recognition by combining convolution with attention to model both local and global dependencies.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning

no code implementations8 Nov 2019 Valentin Vielzeuf, Alexis Lechervy, Stéphane Pateux, Frédéric Jurie

This model outperforms its teacher on novel tasks, achieving results on par with state-of-the-art methods on 15 facial analysis tasks (and domains), at an affordable training cost.

Transfer Learning

Multi-Level Sensor Fusion with Deep Learning

no code implementations5 Nov 2018 Valentin Vielzeuf, Alexis Lechervy, Stéphane Pateux, Frédéric Jurie

In the context of deep learning, this article presents an original deep network, namely CentralNet, for the fusion of information coming from different sensors.

Sensor Fusion

CentralNet: a Multilayer Approach for Multimodal Fusion

2 code implementations22 Aug 2018 Valentin Vielzeuf, Alexis Lechervy, Stéphane Pateux, Frédéric Jurie

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media.

Multi-Task Learning

CAKE: Compact and Accurate K-dimensional representation of Emotion

no code implementations30 Jul 2018 Corentin Kervadec, Valentin Vielzeuf, Stéphane Pateux, Alexis Lechervy, Frédéric Jurie

Alongside, Deep Neural Networks (DNN) are reaching excellent performances and are becoming interesting features extraction tools in many computer vision tasks. Inspired by works from the psychology community, we first study the link between the compact two-dimensional representation of the emotion known as arousal-valence, and discrete emotion classes (e. g. anger, happiness, sadness, etc.)

Ranked #22 on Facial Expression Recognition (FER) on AffectNet (Accuracy (7 emotion) metric)

Emotion Recognition Facial Expression Recognition (FER)

Cannot find the paper you are looking for? You can Submit a new open access paper.