Search Results for author: Yudong Yang

Found 1 papers, 0 papers with code

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

no code implementations • 9 Mar 2024 • Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.