no code implementations • 11 Dec 2024 • Xingchen Song, Mengtao Xing, Changwei Ma, Shengqiang Li, Di wu, BinBin Zhang, Fuping Pan, Dinghao Zhou, Yuekai Zhang, Shun Lei, Zhendong Peng, Zhiyong Wu
Finally, we explore the feasibility of unifying TTS and ASR tasks using the same data for training, thanks to the simplified pipeline and the S3Tokenizer that reduces the quality requirements for TTS training data.
no code implementations • 16 Nov 2021 • Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, Yannan Wang, Tao Yu
In speech enhancement, complex neural network has shown promising performance due to their effectiveness in processing complex-valued spectrum.
7 code implementations • Interspeech 2020 • Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie
Speech enhancement has benefited from the success of deep learning in terms of intelligibility and perceptual quality.
Ranked #14 on
Speech Enhancement
on Deep Noise Suppression (DNS) Challenge
(PESQ-NB metric)
Speech Enhancement
Audio and Speech Processing
Sound