Search Results for author: Leyuan Qu

Found 6 papers, 0 papers with code

Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition

no code implementations20 Feb 2023 Leyuan Qu, Cornelius Weber, Stefan Wermter

Furthermore, our proposed combined loss rescaling and weight consolidation methods can support continual learning of an ASR system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer

no code implementations16 Nov 2022 Leyuan Qu, Wei Wang, Cornelius Weber, Pengcheng Yue, Taihao Li, Stefan Wermter

Once training is completed, EmoAug enriches expressions of emotional speech with different prosodic attributes, such as stress, rhythm and intensity, by feeding different styles into the paralinguistic encoder.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading

no code implementations9 Dec 2021 Leyuan Qu, Cornelius Weber, Stefan Wermter

The aim of this work is to investigate the impact of crossmodal self-supervised pre-training for speech reconstruction (video-to-audio) by leveraging the natural co-occurrence of audio and visual streams in videos.

Lip Reading speech-recognition +1

Multimodal Target Speech Separation with Voice and Face References

no code implementations17 May 2020 Leyuan Qu, Cornelius Weber, Stefan Wermter

Target speech separation refers to isolating target speech from a multi-speaker mixture signal by conditioning on auxiliary information about the target speaker.

Audio and Speech Processing Sound

Cannot find the paper you are looking for? You can Submit a new open access paper.