no code implementations • 29 Sep 2023 • Hongfei Xue, Qijie Shao, Kaixun Huang, Peikun Chen, Lei Xie, Jie Liu
We first analyze the different layers of the SSL model for language-related and content-related information, uncovering layers that show a stronger correlation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 23 May 2023 • Hongfei Xue, Qijie Shao, Peikun Chen, Pengcheng Guo, Lei Xie, Jie Liu
Different from UniSpeech, UniData2vec replaces the quantized discrete representations with continuous and contextual representations from a teacher model for phonetically-aware pre-training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 26 Mar 2022 • Yan Ju, Shan Jia, Lipeng Ke, Hongfei Xue, Koki Nagano, Siwei Lyu
Specifically, we design a two-branch model to combine global spatial information from the whole image and local informative features from multiple patches selected by a novel patch selection module.