no code implementations • 24 Jan 2024 • Jiajun He, Xiaohan Shi, Xingfeng Li, Tomoki Toda
Therefore, in this paper, we incorporate two auxiliary tasks, ASR error detection (AED) and ASR error correction (AEC), to enhance the semantic coherence of ASR text, and further introduce a novel multi-modal fusion (MF) method to learn shared representations across modalities.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 13 Nov 2023 • Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda
This paper proposes an efficient attempt to noisy speech emotion recognition (NSER).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3