1 code implementation • 14 Nov 2022 • Dexin Liao, Tao Jiang, Feng Wang, Lin Li, Qingyang Hong
Transformer has achieved extraordinary performance in Natural Language Processing and Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant Conformer has become a state-of-the-art architecture in the field of Automatic Speech Recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 30 Jun 2021 • Dexin Liao, Jing Li, Yiming Zhi, Song Li, Qingyang Hong, Lin Li
For the SV system, we proposed a multi-task learning network, where phonetic branch is trained with the character label of the utterance, and speaker branch is trained with the label of the speaker.