no code implementations • 7 Oct 2023 • Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C Woodland
For the reverse-time process, a parametrised score function is conditioned on a target speaker embedding to extract the target speaker from the mixture of sources.
no code implementations • 18 May 2022 • Guangzhi Sun, Chao Zhang, Philip C Woodland
MBWE and BLMD further improved the effectiveness of TCPGen and achieved more significant WER reductions on the biasing words.