no code implementations • 12 Jan 2024 • Ye-Xin Lu, Yang Ai, Hui-Peng Du, Zhen-Hua Ling
Speech bandwidth extension (BWE) refers to widening the frequency bandwidth range of speech signals, enhancing the speech quality towards brighter and fuller.
1 code implementation • 20 Nov 2023 • Hui-Peng Du, Ye-Xin Lu, Yang Ai, Zhen-Hua Ling
APNet demonstrates the capability to generate synthesized speech of comparable quality to the HiFi-GAN vocoder but with a considerably improved inference speed.
1 code implementation • 17 Aug 2023 • Ye-Xin Lu, Yang Ai, Zhen-Hua Ling
Compared to existing phase-aware speech enhancement methods, it further mitigates the compensation effect between the magnitude and phase by explicit phase estimation, elevating the perceptual quality of enhanced speech.
1 code implementation • 23 May 2023 • Ye-Xin Lu, Yang Ai, Zhen-Hua Ling
This paper proposes MP-SENet, a novel Speech Enhancement Network which directly denoises Magnitude and Phase spectra in parallel.
1 code implementation • 26 Apr 2023 • Ye-Xin Lu, Yang Ai, Zhen-Hua Ling
This paper proposes a source-filter-based generative adversarial neural vocoder named SF-GAN, which achieves high-fidelity waveform generation from input acoustic features by introducing F0-based source excitation signals to a neural filter framework.