Search Results for author: Marvin Tammen

Found 8 papers, 1 papers with code

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

no code implementations • 5 Feb 2024 • Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Recently, a mask-based beamformer with attention-based spatial covariance matrix aggregator (ASA) was proposed, which was demonstrated to track moving sources accurately.

Paper
Add Code

Speaker-conditioning Single-channel Target Speaker Extraction using Conformer-based Architectures

no code implementations • 27 May 2022 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker.

Target Speaker Extraction

Paper
Add Code

Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction

no code implementations • 18 May 2022 • Marvin Tammen, XiLin Li, Simon Doclo, Lalin Theverapperuma

In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility.

Speech Enhancement

Paper
Add Code

Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

no code implementations • 18 May 2022 • Marvin Tammen, Simon Doclo

To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance.

Paper
Add Code

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

no code implementations • 3 Jun 2021 • Henri Gode, Marvin Tammen, Simon Doclo

To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals.

Paper
Add Code

Speaker-conditioned Target Speaker Extraction based on Customized LSTM Cells

no code implementations • 9 Apr 2021 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network and a speaker embedder network requiring reference speech of the target speaker.

Target Speaker Extraction

Paper
Add Code

Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement

1 code implementation • 20 Nov 2020 • Marvin Tammen, Simon Doclo

Multi-frame algorithms for single-microphone speech enhancement, e. g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain.

Speech Enhancement

Paper
Code

DNN-Based Speech Presence Probability Estimation for Multi-Frame Single-Microphone Speech Enhancement

no code implementations • 21 May 2019 • Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo

In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, provided that an accurate estimate of the correlation matrices and especially the speech interframe correlation (IFC) vector is available.

Speech Enhancement

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.