1 code implementation • 19 Jan 2024 • Yuewei Zhang, Huanbin Zou, Jie Zhu
Two-stage pipeline is popular in speech enhancement tasks due to its superiority over traditional single-stage methods.
no code implementations • 11 Oct 2023 • Yuewei Zhang, Huanbin Zou, Jie Zhu
In speech enhancement (SE), phase estimation is important for perceptual quality, so many methods take clean speech's complex short-time Fourier transform (STFT) spectrum or the complex ideal ratio mask (cIRM) as the learning target.
no code implementations • 11 Oct 2023 • Yuewei Zhang, Huanbin Zou, Jie Zhu
The deep learning-based speech enhancement (SE) methods always take the clean speech's waveform or time-frequency spectrum feature as the learning target, and train the deep neural network (DNN) by reducing the error loss between the DNN's output and the target.