Search Results for author: Rongfeng Su

Found 3 papers, 0 papers with code

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

no code implementations • 9 Mar 2024 • Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.

Paper
Add Code

On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition

no code implementations • 28 Mar 2022 • Mengzhe Geng, Xurong Xie, Rongfeng Su, Jianwei Yu, Zengrui Jin, Tianzi Wang, Shujie Hu, Zi Ye, Helen Meng, Xunying Liu

Accurate recognition of dysarthric and elderly speech remain challenging tasks to date.

speech-recognition Speech Recognition

Paper
Add Code

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition

no code implementations • 18 Aug 2021 • Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang

The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.