Search Results for author: Huanbin Zou

Found 3 papers, 1 papers with code

A Two-Stage Framework in Cross-Spectrum Domain for Real-Time Speech Enhancement

1 code implementation • 19 Jan 2024 • Yuewei Zhang, Huanbin Zou, Jie Zhu

Two-stage pipeline is popular in speech enhancement tasks due to its superiority over traditional single-stage methods.

Paper
Code

Magnitude-and-phase-aware Speech Enhancement with Parallel Sequence Modeling

no code implementations • 11 Oct 2023 • Yuewei Zhang, Huanbin Zou, Jie Zhu

In speech enhancement (SE), phase estimation is important for perceptual quality, so many methods take clean speech's complex short-time Fourier transform (STFT) spectrum or the complex ideal ratio mask (cIRM) as the learning target.

Speech Enhancement

Paper
Add Code

VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention

no code implementations • 11 Oct 2023 • Yuewei Zhang, Huanbin Zou, Jie Zhu

The deep learning-based speech enhancement (SE) methods always take the clean speech's waveform or time-frequency spectrum feature as the learning target, and train the deep neural network (DNN) by reducing the error loss between the DNN's output and the target.

Action Detection Activity Detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.