Search Results for author: Simon Doclo

Found 40 papers, 2 papers with code

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks

1 code implementation8 Mar 2024 Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor

Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.

Speech Enhancement

Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

no code implementations16 Sep 2017 Nasser Mohammadiha, Simon Doclo

This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space.

Speech Dereverberation

A State-Space Approach to Dynamic Nonnegative Matrix Factorization

no code implementations31 Aug 2017 Nasser Mohammadiha, Paris Smaragdis, Ghazaleh Panahandeh, Simon Doclo

Nonnegative matrix factorization (NMF) has been actively investigated and used in a wide range of problems in the past decade.

Time Series Time Series Analysis

Improving auditory attention decoding performance of linear and non-linear methods using state-space model

no code implementations2 Apr 2020 Ali Aroudi, Tobias de Taillez, Simon Doclo

In this paper, we investigate a state-space model using correlation coefficients obtained with a small correlation window to improve the decoding performance of the linear and the non-linear AAD methods.

EEG Electroencephalogram (EEG)

Binaural LCMV Beamforming with Partial Noise Estimation

no code implementations10 May 2019 Nico Gößling, Elior Hadad, Sharon Gannot, Simon Doclo

While the binaural minimum variance distortionless response (BMVDR) beamformer provides a good noise reduction performance and preserves the binaural cues of the desired source, it does not allow to control the reduction of the interfering sources and distorts the binaural cues of the interfering sources and the background noise.

Noise Estimation

Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement

1 code implementation20 Nov 2020 Marvin Tammen, Simon Doclo

Multi-frame algorithms for single-microphone speech enhancement, e. g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain.

Speech Enhancement

DNN-Based Speech Presence Probability Estimation for Multi-Frame Single-Microphone Speech Enhancement

no code implementations21 May 2019 Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo

In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, provided that an accurate estimate of the correlation matrices and especially the speech interframe correlation (IFC) vector is available.

Speech Enhancement

Speaker-conditioned Target Speaker Extraction based on Customized LSTM Cells

no code implementations9 Apr 2021 Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network and a speaker embedder network requiring reference speech of the target speaker.

Target Speaker Extraction

Comparison of Binaural RTF-Vector-Based Direction of Arrival Estimation Methods Exploiting an External Microphone

no code implementations11 Apr 2021 Daniel Fejgin, Simon Doclo

In this paper we consider a binaural hearing aid setup, where in addition to the head-mounted microphones an external microphone is available.

Direction of Arrival Estimation

Sound Pressure Minimization at the Ear Drum for In-ear ANC Headphones using a Fixed Feedforward Remote Microphone Technique

no code implementations14 May 2021 Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo

Based on measured acoustic paths to predict the sound pressure generated by external sources and the headphone at the ear drum, the FIR filter coefficients of the ANC controller are optimized for different sound fields.

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

no code implementations3 Jun 2021 Henri Gode, Marvin Tammen, Simon Doclo

To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals.

Robust single- and multi-loudspeaker least-squares-based equalization for hearing devices

no code implementations9 Sep 2021 Henning Schepker, Florian Denk, Birger Kollmeier, Simon Doclo

To improve the sound quality of hearing devices, equalization filters can be used that aim at achieving acoustic transparency, i. e., listening with the device in the ear is perceptually similar to the open ear.

Management

Individualized sound pressure equalization in hearing devices exploiting an electro-acoustic model

no code implementations4 Oct 2021 Henning Schepker, Reinhild Rohden, Florian Denk, Birger Kollmeier, Matthias Blau, Simon Doclo

To achieve optimal individualized equalization typically requires knowledge of all transfer functions between the source, the hearing device, and the individual eardrum.

Optimization of a Fixed Virtual Sensing Feedback ANC Controller for In-Ear Headphones with Multiple Loudspeakers

no code implementations7 Oct 2021 Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo

In this paper we consider an in-ear headphone equipped with an inner microphone and multiple loudspeakers and we propose an optimization procedure with a convex objective function to derive a fixed multi-loudspeaker ANC controller aiming at minimizing the sound pressure at the ear drum.

Training Strategies for Own Voice Reconstruction in Hearing Protection Devices using an In-ear Microphone

no code implementations12 May 2022 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

In this paper, we apply a deep learning-based bandwidth-extension system to the own voice reconstruction task and investigate different training strategies in order to overcome the limited availability of training data.

Bandwidth Extension

Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction

no code implementations18 May 2022 Marvin Tammen, XiLin Li, Simon Doclo, Lalin Theverapperuma

In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility.

Speech Enhancement

Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

no code implementations18 May 2022 Marvin Tammen, Simon Doclo

To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance.

Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers

no code implementations18 May 2022 Daniel Fejgin, Simon Doclo

Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors.

Direction of Arrival Estimation

3D Single Source Localization Based on Euclidean Distance Matrices

no code implementations18 May 2022 Klaus Brümann, Simon Doclo

A popular approach for 3D source localization using multiple microphones is the steered-response power method, where the source position is directly estimated by maximizing a function of three continuous position variables.

Position

Bias Analysis of Spatial Coherence-Based RTF Vector Estimation for Acoustic Sensor Networks in a Diffuse Sound Field

no code implementations19 May 2022 Wiebke Middelberg, Simon Doclo

In this paper, we perform a theoretical bias analysis for the SC-based RTF vector estimation method with multiple external microphones.

Speaker-conditioning Single-channel Target Speaker Extraction using Conformer-based Architectures

no code implementations27 May 2022 Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker.

Target Speaker Extraction

Signal-informed DNN-based DOA Estimation combining an External Microphone and GCC-PHAT Features

no code implementations11 Jun 2022 Ulrik Kowalk, Simon Doclo, Joerg Bitzer

Aiming at estimating the direction of arrival (DOA) of a desired speaker in a multi-talker environment using a microphone array, in this paper we propose a signal-informed method exploiting the availability of an external microphone attached to the desired speaker.

Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks

no code implementations4 Nov 2022 Paul Didier, Toon van Waterschoot, Simon Doclo, Marc Moonen

Sampling rate offsets (SROs) between devices in a heterogeneous wireless acoustic sensor network (WASN) can hinder the ability of distributed adaptive algorithms to perform as intended when they rely on coherent signal processing.

Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting a Calibrated External Microphone Array

no code implementations30 Nov 2022 Daniel Fejgin, Simon Doclo

This method exploits the external microphones to estimate the RTF vector corresponding to the binaural hearing aid and constructs a one-dimensional spatial spectrum by comparing the estimated RTF vector against a database of anechoic prototype RTF vectors for several directions.

Direction of Arrival Estimation

Geometry-aware DoA Estimation using a Deep Neural Network with mixed-data input features

no code implementations9 Dec 2022 Ulrik Kowalk, Simon Doclo, Joerg Bitzer

Aiming at designing a supervised learning-based DoA estimation algorithm that generalizes well to different array geometries, in this paper we propose a geometry-aware DoA estimation algorithm.

Dereverberation in Acoustic Sensor Networks Using Weighted Prediction Error With Microphone-dependent Prediction Delays

no code implementations18 Jan 2023 Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

In the WPE algorithm, a prediction delay is required to reduce the correlation between the prediction signals and the direct component in the reference microphone signal.

Speech Dereverberation

Adaptive Dereverberation, Noise and Interferer Reduction Using Sparse Weighted Linearly Constrained Minimum Power Beamforming

no code implementations13 Mar 2023 Henri Gode, Simon Doclo

Interfering sources, background noise and reverberation degrade speech quality and intelligibility in hearing aid applications.

Speech Enhancement

BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones

no code implementations14 Jun 2023 Daniel Fejgin, Wiebke Middelberg, Simon Doclo

There is an emerging need for comparable data for multi-microphone processing, particularly in acoustic sensor networks.

Exploiting an External Microphone for Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers

no code implementations10 Jul 2023 Daniel Fejgin, Simon Doclo

In hearing aid applications, an important objective is to accurately estimate the direction of arrival (DOA) of multiple speakers in noisy and reverberant environments.

Direction of Arrival Estimation

Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables

no code implementations15 Sep 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

To enhance the quality of the in-ear microphone signal using algorithms aiming at joint bandwidth extension, equalization, and noise reduction, it is desirable to have an accurate model of the own voice transfer characteristics between the entrance of the ear canal and the in-ear microphone.

Bandwidth Extension

Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones

no code implementations10 Oct 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

In this paper, we propose a speech-dependent model of the own voice transfer characteristics based on phoneme recognition, assuming a linear time-invariant relative transfer function for each phoneme.

Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker Scenarios

no code implementations25 Oct 2023 Henri Gode, Simon Doclo

Instead of blocking the second speaker, in this paper we propose a covariance blocking and whitening (CBW) method, which first blocks the first speaker and applies whitening using the estimated noise covariance matrix and then estimates the RTF vector of the second speaker based on a singular value decomposition.

Blocking

Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix Structure

no code implementations27 Oct 2023 Wiebke Middelberg, Henri Gode, Simon Doclo

In many multi-microphone algorithms for noise reduction, an estimate of the relative transfer function (RTF) vector of the target speaker is required.

Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns

no code implementations4 Dec 2023 Kaspar Müller, Bilgesu Çakmak, Paul Didier, Simon Doclo, Jan Østergaard, Tobias Wolff

Determining the head orientation of a talker is not only beneficial for various speech signal processing applications, such as source localization or speech enhancement, but also facilitates intuitive voice control and interaction with smart environments or modern car assistants.

Speech Enhancement

Multi-Microphone Noise Data Augmentation for DNN-based Own Voice Reconstruction for Hearables in Noisy Environments

no code implementations14 Dec 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Recording a sufficient amount of noise required for training such a system is costly since noise transmission between outer and inner microphones varies individually.

Data Augmentation

Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers

no code implementations15 Jan 2024 Daniel Fejgin, Elior Hadad, Sharon Gannot, Zbyněk Koldovský, Simon Doclo

According to how the SPS are combined, frequency fusion mechanisms are categorized into narrowband, broadband, or speaker-grouped, where the latter mechanism requires a speaker-wise grouping of frequencies.

Direction of Arrival Estimation

Effect of target signals and delays on spatially selective active noise control for open-fitting hearables

no code implementations15 Jan 2024 Tong Xiao, Simon Doclo

Spatially selective active noise control (ANC) hearables are designed to reduce unwanted noise from certain directions while preserving desired sounds from other directions.

Microphone Subset Selection for the Weighted Prediction Error Algorithm using a Group Sparsity Penalty

no code implementations16 Jan 2024 Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

Reverberation can severely degrade the quality of speech signals recorded using microphones in an enclosure.

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

no code implementations5 Feb 2024 Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Recently, a mask-based beamformer with attention-based spatial covariance matrix aggregator (ASA) was proposed, which was demonstrated to track moving sources accurately.

Cannot find the paper you are looking for? You can Submit a new open access paper.