no code implementations • 14 Dec 2022 • Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter
Human speech can be characterized by different components, including semantic content, speaker identity and prosodic information.
no code implementations • 16 Nov 2022 • Wang Qi, Yu-Ping Ruan, Yuan Zuo, Taihao Li
Conventional fine-tuning encounters increasing difficulties given the size of current Pre-trained Language Models, which makes parameter-efficient tuning become the focal point of frontier research.
no code implementations • 16 Nov 2022 • Leyuan Qu, Wei Wang, Taihao Li, Cornelius Weber, Stefan Wermter, Fuji Ren
Once training is completed, EmoAug enriches expressions of emotional speech in different prosodic attributes, such as stress, rhythm and intensity, by feeding different styles into the paralinguistic encoder.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
2 code implementations • 21 Oct 2022 • Yunfan Li, Mouxing Yang, Dezhong Peng, Taihao Li, Jiantao Huang, Xi Peng
Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.
Ranked #1 on
Short Text Clustering
on Biomedical
1 code implementation • CVPR 2022 • Mouxing Yang, Zhenyu Huang, Peng Hu, Taihao Li, Jiancheng Lv, Xi Peng
To solve the TNL problem, we propose a novel method for robust VI-ReID, termed DuAlly Robust Training (DART).
no code implementations • 6 Oct 2021 • Fen Wang, Gene Cheung, Taihao Li, Ying Du, Yu-Ping Ruan
Sensor placement for linear inverse problems is the selection of locations to assign sensors so that the entire physical signal can be well recovered from partial observations.