no code implementations • 5 Feb 2024 • Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo
Although mask-based beamforming is a powerful speech enhancement approach, it often requires manual parameter tuning to handle moving speakers.
no code implementations • 27 May 2022 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo
Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker.
no code implementations • 18 May 2022 • Marvin Tammen, XiLin Li, Simon Doclo, Lalin Theverapperuma
In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility.
no code implementations • 18 May 2022 • Marvin Tammen, Simon Doclo
To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance.
no code implementations • 3 Jun 2021 • Henri Gode, Marvin Tammen, Simon Doclo
To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals.
no code implementations • 9 Apr 2021 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo
In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network and a speaker embedder network requiring reference speech of the target speaker.
1 code implementation • 20 Nov 2020 • Marvin Tammen, Simon Doclo
Multi-frame algorithms for single-microphone speech enhancement, e. g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain.
no code implementations • 21 May 2019 • Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo
In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, provided that an accurate estimate of the correlation matrices and especially the speech interframe correlation (IFC) vector is available.