no code implementations • 16 Feb 2023 • Xiao-Ying Zhao, Qiu-Shi Zhu, Jie Zhang
With advances in deep learning, neural network based speech enhancement (SE) has developed rapidly in the last decade.
no code implementations • 28 Sep 2022 • Xiao-Ying Zhao, Qiu-Shi Zhu, Jie Zhang
Specifically, the encoder and bottleneck layer of the DEMUCS model are initialized using the self-supervised pretrained WavLM model, the convolution in the encoder is replaced by causal convolution, and the transformer encoder in the bottleneck layer is based on causal attention mask.