no code implementations • 23 Jan 2024 • Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke
Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services.
no code implementations • 20 Nov 2023 • Chenyang Gao, Yue Gu, Ivan Marsic
Despite its success, previous studies showed that PIT is plagued by excessive label assignment switching in adjacent epochs, impeding the model to learn better label assignments.
no code implementations • 7 Mar 2023 • Chenyang Gao, Yue Gu, Francesco Caliva, Yuzong Liu
Self-supervised speech representation learning (S3RL) is revolutionizing the way we leverage the ever-growing availability of data.
no code implementations • 20 Oct 2021 • Chenyang Gao, Yue Gu, Ivan Marsic
We investigate the use of the mapping-based method in the time domain and show that it can perform better on a large training set than the masking-based method.
2 code implementations • 8 Jan 2021 • Chenyang Gao, Guanyu Cai, Xinyang Jiang, Feng Zheng, Jun Zhang, Yifei Gong, Pai Peng, Xiaowei Guo, Xing Sun
Secondly, a BERT with locality-constrained attention is proposed to obtain representations of descriptions at different scales.
Ranked #14 on Text based Person Retrieval on CUHK-PEDES