Search Results for author: Simon Doclo

Found 44 papers, 2 papers with code

Low-Complexity Own Voice Reconstruction for Hearables with an In-Ear Microphone

no code implementations6 Sep 2024 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Hearable devices, equipped with one or more microphones, are commonly used for speech communication.

Data Augmentation

Steered Response Power-Based Direction-of-Arrival Estimation Exploiting an Auxiliary Microphone

no code implementations3 Sep 2024 Klaus Brümann, Simon Doclo

Assuming the availability of an auxiliary microphone at an unknown position which is spatially separated from the CMA, in this paper we propose to compute the SRP-PHAT spectra between the microphones of the CMA based on the SRP-PHAT spectra between the auxiliary microphone and the microphones of the CMA.

Direction of Arrival Estimation

Speech-dependent Data Augmentation for Own Voice Reconstruction with Hearable Microphones in Noisy Environments

no code implementations19 May 2024 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

The proposed techniques use few recorded own voice signals to estimate transfer characteristics and can then be used to simulate a large amount of own voice signals based on single-channel speech signals.

Data Augmentation

Deep low-latency joint speech transmission and enhancement over a gaussian channel

no code implementations30 Apr 2024 Mohammad Bokaei, Jesper Jensen, Simon Doclo, Jan Østergaard

Ensuring intelligible speech communication for hearing assistive devices in low-latency scenarios presents significant challenges in terms of speech enhancement, coding and transmission.

Decoder Speech Enhancement

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks

1 code implementation8 Mar 2024 Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor

Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.

Decoder Speech Enhancement

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

no code implementations5 Feb 2024 Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

Although mask-based beamforming is a powerful speech enhancement approach, it often requires manual parameter tuning to handle moving speakers.

Speech Enhancement

Microphone Subset Selection for the Weighted Prediction Error Algorithm using a Group Sparsity Penalty

no code implementations16 Jan 2024 Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

Reverberation can severely degrade the quality of speech signals recorded using microphones in an enclosure.

Effect of target signals and delays on spatially selective active noise control for open-fitting hearables

no code implementations15 Jan 2024 Tong Xiao, Simon Doclo

Spatially selective active noise control (ANC) hearables are designed to reduce unwanted noise from certain directions while preserving desired sounds from other directions.

Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers

no code implementations15 Jan 2024 Daniel Fejgin, Elior Hadad, Sharon Gannot, Zbyněk Koldovský, Simon Doclo

According to how the SPS are combined, frequency fusion mechanisms are categorized into narrowband, broadband, or speaker-grouped, where the latter mechanism requires a speaker-wise grouping of frequencies.

Direction of Arrival Estimation

Multi-Microphone Noise Data Augmentation for DNN-based Own Voice Reconstruction for Hearables in Noisy Environments

no code implementations14 Dec 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Recording a sufficient amount of noise required for training such a system is costly since noise transmission between outer and inner microphones varies individually.

Data Augmentation

Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns

no code implementations4 Dec 2023 Kaspar Müller, Bilgesu Çakmak, Paul Didier, Simon Doclo, Jan Østergaard, Tobias Wolff

Determining the head orientation of a talker is not only beneficial for various speech signal processing applications, such as source localization or speech enhancement, but also facilitates intuitive voice control and interaction with smart environments or modern car assistants.

Speech Enhancement

Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix Structure

no code implementations27 Oct 2023 Wiebke Middelberg, Henri Gode, Simon Doclo

In many multi-microphone algorithms for noise reduction, an estimate of the relative transfer function (RTF) vector of the target speaker is required.

Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker Scenarios

no code implementations25 Oct 2023 Henri Gode, Simon Doclo

Instead of blocking the second speaker, in this paper we propose a covariance blocking and whitening (CBW) method, which first blocks the first speaker and applies whitening using the estimated noise covariance matrix and then estimates the RTF vector of the second speaker based on a singular value decomposition.

Blocking

Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones

no code implementations10 Oct 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

In this paper, we propose a speech-dependent model of the own voice transfer characteristics based on phoneme recognition, assuming a linear time-invariant relative transfer function for each phoneme.

Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables

no code implementations15 Sep 2023 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

To enhance the quality of the in-ear microphone signal using algorithms aiming at joint bandwidth extension, equalization, and noise reduction, it is desirable to have an accurate model of the own voice transfer characteristics between the entrance of the ear canal and the in-ear microphone.

Bandwidth Extension

Exploiting an External Microphone for Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers

no code implementations10 Jul 2023 Daniel Fejgin, Simon Doclo

In hearing aid applications, an important objective is to accurately estimate the direction of arrival (DOA) of multiple speakers in noisy and reverberant environments.

Direction of Arrival Estimation

BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones

no code implementations14 Jun 2023 Daniel Fejgin, Wiebke Middelberg, Simon Doclo

There is an emerging need for comparable data for multi-microphone processing, particularly in acoustic sensor networks.

Diversity

Adaptive Dereverberation, Noise and Interferer Reduction Using Sparse Weighted Linearly Constrained Minimum Power Beamforming

no code implementations13 Mar 2023 Henri Gode, Simon Doclo

Interfering sources, background noise and reverberation degrade speech quality and intelligibility in hearing aid applications.

Speech Enhancement

Dereverberation in Acoustic Sensor Networks Using Weighted Prediction Error With Microphone-dependent Prediction Delays

no code implementations18 Jan 2023 Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

In the WPE algorithm, a prediction delay is required to reduce the correlation between the prediction signals and the direct component in the reference microphone signal.

Speech Dereverberation

Geometry-aware DoA Estimation using a Deep Neural Network with mixed-data input features

no code implementations9 Dec 2022 Ulrik Kowalk, Simon Doclo, Joerg Bitzer

Aiming at designing a supervised learning-based DoA estimation algorithm that generalizes well to different array geometries, in this paper we propose a geometry-aware DoA estimation algorithm.

Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting a Calibrated External Microphone Array

no code implementations30 Nov 2022 Daniel Fejgin, Simon Doclo

This method exploits the external microphones to estimate the RTF vector corresponding to the binaural hearing aid and constructs a one-dimensional spatial spectrum by comparing the estimated RTF vector against a database of anechoic prototype RTF vectors for several directions.

Direction of Arrival Estimation

Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks

no code implementations4 Nov 2022 Paul Didier, Toon van Waterschoot, Simon Doclo, Marc Moonen

Sampling rate offsets (SROs) between devices in a heterogeneous wireless acoustic sensor network (WASN) can hinder the ability of distributed adaptive algorithms to perform as intended when they rely on coherent signal processing.

Signal-informed DNN-based DOA Estimation combining an External Microphone and GCC-PHAT Features

no code implementations11 Jun 2022 Ulrik Kowalk, Simon Doclo, Joerg Bitzer

Aiming at estimating the direction of arrival (DOA) of a desired speaker in a multi-talker environment using a microphone array, in this paper we propose a signal-informed method exploiting the availability of an external microphone attached to the desired speaker.

Speaker-conditioning Single-channel Target Speaker Extraction using Conformer-based Architectures

no code implementations27 May 2022 Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker.

Target Speaker Extraction

Bias Analysis of Spatial Coherence-Based RTF Vector Estimation for Acoustic Sensor Networks in a Diffuse Sound Field

no code implementations19 May 2022 Wiebke Middelberg, Simon Doclo

In this paper, we perform a theoretical bias analysis for the SC-based RTF vector estimation method with multiple external microphones.

3D Single Source Localization Based on Euclidean Distance Matrices

no code implementations18 May 2022 Klaus Brümann, Simon Doclo

A popular approach for 3D source localization using multiple microphones is the steered-response power method, where the source position is directly estimated by maximizing a function of three continuous position variables.

Position

Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers

no code implementations18 May 2022 Daniel Fejgin, Simon Doclo

Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors.

Direction of Arrival Estimation

Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

no code implementations18 May 2022 Marvin Tammen, Simon Doclo

To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance.

Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction

no code implementations18 May 2022 Marvin Tammen, XiLin Li, Simon Doclo, Lalin Theverapperuma

In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility.

Speech Enhancement

Training Strategies for Own Voice Reconstruction in Hearing Protection Devices using an In-ear Microphone

no code implementations12 May 2022 Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

In this paper, we apply a deep learning-based bandwidth-extension system to the own voice reconstruction task and investigate different training strategies in order to overcome the limited availability of training data.

Bandwidth Extension

Optimization of a Fixed Virtual Sensing Feedback ANC Controller for In-Ear Headphones with Multiple Loudspeakers

no code implementations7 Oct 2021 Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo

In this paper we consider an in-ear headphone equipped with an inner microphone and multiple loudspeakers and we propose an optimization procedure with a convex objective function to derive a fixed multi-loudspeaker ANC controller aiming at minimizing the sound pressure at the ear drum.

Individualized sound pressure equalization in hearing devices exploiting an electro-acoustic model

no code implementations4 Oct 2021 Henning Schepker, Reinhild Rohden, Florian Denk, Birger Kollmeier, Matthias Blau, Simon Doclo

To achieve optimal individualized equalization typically requires knowledge of all transfer functions between the source, the hearing device, and the individual eardrum.

Robust single- and multi-loudspeaker least-squares-based equalization for hearing devices

no code implementations9 Sep 2021 Henning Schepker, Florian Denk, Birger Kollmeier, Simon Doclo

To improve the sound quality of hearing devices, equalization filters can be used that aim at achieving acoustic transparency, i. e., listening with the device in the ear is perceptually similar to the open ear.

Management

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

no code implementations3 Jun 2021 Henri Gode, Marvin Tammen, Simon Doclo

To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals.

Sound Pressure Minimization at the Ear Drum for In-ear ANC Headphones using a Fixed Feedforward Remote Microphone Technique

no code implementations14 May 2021 Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo

Based on measured acoustic paths to predict the sound pressure generated by external sources and the headphone at the ear drum, the FIR filter coefficients of the ANC controller are optimized for different sound fields.

Comparison of Binaural RTF-Vector-Based Direction of Arrival Estimation Methods Exploiting an External Microphone

no code implementations11 Apr 2021 Daniel Fejgin, Simon Doclo

In this paper we consider a binaural hearing aid setup, where in addition to the head-mounted microphones an external microphone is available.

Direction of Arrival Estimation

Speaker-conditioned Target Speaker Extraction based on Customized LSTM Cells

no code implementations9 Apr 2021 Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network and a speaker embedder network requiring reference speech of the target speaker.

Target Speaker Extraction

Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement

1 code implementation20 Nov 2020 Marvin Tammen, Simon Doclo

Multi-frame algorithms for single-microphone speech enhancement, e. g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain.

Speech Enhancement

Improving auditory attention decoding performance of linear and non-linear methods using state-space model

no code implementations2 Apr 2020 Ali Aroudi, Tobias de Taillez, Simon Doclo

In this paper, we investigate a state-space model using correlation coefficients obtained with a small correlation window to improve the decoding performance of the linear and the non-linear AAD methods.

EEG

DNN-Based Speech Presence Probability Estimation for Multi-Frame Single-Microphone Speech Enhancement

no code implementations21 May 2019 Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo

In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, provided that an accurate estimate of the correlation matrices and especially the speech interframe correlation (IFC) vector is available.

Speech Enhancement

Binaural LCMV Beamforming with Partial Noise Estimation

no code implementations10 May 2019 Nico Gößling, Elior Hadad, Sharon Gannot, Simon Doclo

While the binaural minimum variance distortionless response (BMVDR) beamformer provides a good noise reduction performance and preserves the binaural cues of the desired source, it does not allow to control the reduction of the interfering sources and distorts the binaural cues of the interfering sources and the background noise.

Noise Estimation

Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

no code implementations16 Sep 2017 Nasser Mohammadiha, Simon Doclo

This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space.

Speech Dereverberation

A State-Space Approach to Dynamic Nonnegative Matrix Factorization

no code implementations31 Aug 2017 Nasser Mohammadiha, Paris Smaragdis, Ghazaleh Panahandeh, Simon Doclo

Nonnegative matrix factorization (NMF) has been actively investigated and used in a wide range of problems in the past decade.

Time Series Time Series Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.