no code implementations • 6 Sep 2024 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
Hearable devices, equipped with one or more microphones, are commonly used for speech communication.
no code implementations • 3 Sep 2024 • Klaus Brümann, Simon Doclo
Assuming the availability of an auxiliary microphone at an unknown position which is spatially separated from the CMA, in this paper we propose to compute the SRP-PHAT spectra between the microphones of the CMA based on the SRP-PHAT spectra between the auxiliary microphone and the microphones of the CMA.
no code implementations • 19 May 2024 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
The proposed techniques use few recorded own voice signals to estimate transfer characteristics and can then be used to simulate a large amount of own voice signals based on single-channel speech signals.
no code implementations • 30 Apr 2024 • Mohammad Bokaei, Jesper Jensen, Simon Doclo, Jan Østergaard
Ensuring intelligible speech communication for hearing assistive devices in low-latency scenarios presents significant challenges in terms of speech enhancement, coding and transmission.
1 code implementation • 8 Mar 2024 • Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor
Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.
no code implementations • 5 Feb 2024 • Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo
Although mask-based beamforming is a powerful speech enhancement approach, it often requires manual parameter tuning to handle moving speakers.
no code implementations • 16 Jan 2024 • Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo
Reverberation can severely degrade the quality of speech signals recorded using microphones in an enclosure.
no code implementations • 15 Jan 2024 • Tong Xiao, Simon Doclo
Spatially selective active noise control (ANC) hearables are designed to reduce unwanted noise from certain directions while preserving desired sounds from other directions.
no code implementations • 15 Jan 2024 • Daniel Fejgin, Elior Hadad, Sharon Gannot, Zbyněk Koldovský, Simon Doclo
According to how the SPS are combined, frequency fusion mechanisms are categorized into narrowband, broadband, or speaker-grouped, where the latter mechanism requires a speaker-wise grouping of frequencies.
no code implementations • 14 Dec 2023 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
Recording a sufficient amount of noise required for training such a system is costly since noise transmission between outer and inner microphones varies individually.
no code implementations • 4 Dec 2023 • Kaspar Müller, Bilgesu Çakmak, Paul Didier, Simon Doclo, Jan Østergaard, Tobias Wolff
Determining the head orientation of a talker is not only beneficial for various speech signal processing applications, such as source localization or speech enhancement, but also facilitates intuitive voice control and interaction with smart environments or modern car assistants.
no code implementations • 27 Oct 2023 • Wiebke Middelberg, Henri Gode, Simon Doclo
In many multi-microphone algorithms for noise reduction, an estimate of the relative transfer function (RTF) vector of the target speaker is required.
no code implementations • 25 Oct 2023 • Henri Gode, Simon Doclo
Instead of blocking the second speaker, in this paper we propose a covariance blocking and whitening (CBW) method, which first blocks the first speaker and applies whitening using the estimated noise covariance matrix and then estimates the RTF vector of the second speaker based on a singular value decomposition.
no code implementations • 10 Oct 2023 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
In this paper, we propose a speech-dependent model of the own voice transfer characteristics based on phoneme recognition, assuming a linear time-invariant relative transfer function for each phoneme.
no code implementations • 15 Sep 2023 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
To enhance the quality of the in-ear microphone signal using algorithms aiming at joint bandwidth extension, equalization, and noise reduction, it is desirable to have an accurate model of the own voice transfer characteristics between the entrance of the ear canal and the in-ear microphone.
no code implementations • 10 Jul 2023 • Daniel Fejgin, Simon Doclo
In hearing aid applications, an important objective is to accurately estimate the direction of arrival (DOA) of multiple speakers in noisy and reverberant environments.
no code implementations • 14 Jun 2023 • Daniel Fejgin, Wiebke Middelberg, Simon Doclo
There is an emerging need for comparable data for multi-microphone processing, particularly in acoustic sensor networks.
no code implementations • 13 Mar 2023 • Henri Gode, Simon Doclo
Interfering sources, background noise and reverberation degrade speech quality and intelligibility in hearing aid applications.
no code implementations • 18 Jan 2023 • Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo
In the WPE algorithm, a prediction delay is required to reduce the correlation between the prediction signals and the direct component in the reference microphone signal.
no code implementations • 9 Dec 2022 • Ulrik Kowalk, Simon Doclo, Joerg Bitzer
Aiming at designing a supervised learning-based DoA estimation algorithm that generalizes well to different array geometries, in this paper we propose a geometry-aware DoA estimation algorithm.
no code implementations • 30 Nov 2022 • Daniel Fejgin, Simon Doclo
This method exploits the external microphones to estimate the RTF vector corresponding to the binaural hearing aid and constructs a one-dimensional spatial spectrum by comparing the estimated RTF vector against a database of anechoic prototype RTF vectors for several directions.
no code implementations • 4 Nov 2022 • Paul Didier, Toon van Waterschoot, Simon Doclo, Marc Moonen
Sampling rate offsets (SROs) between devices in a heterogeneous wireless acoustic sensor network (WASN) can hinder the ability of distributed adaptive algorithms to perform as intended when they rely on coherent signal processing.
no code implementations • 11 Jun 2022 • Ulrik Kowalk, Simon Doclo, Joerg Bitzer
Aiming at estimating the direction of arrival (DOA) of a desired speaker in a multi-talker environment using a microphone array, in this paper we propose a signal-informed method exploiting the availability of an external microphone attached to the desired speaker.
no code implementations • 27 May 2022 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo
Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker.
no code implementations • 19 May 2022 • Wiebke Middelberg, Simon Doclo
In this paper, we perform a theoretical bias analysis for the SC-based RTF vector estimation method with multiple external microphones.
no code implementations • 18 May 2022 • Klaus Brümann, Simon Doclo
A popular approach for 3D source localization using multiple microphones is the steered-response power method, where the source position is directly estimated by maximizing a function of three continuous position variables.
no code implementations • 18 May 2022 • Daniel Fejgin, Simon Doclo
Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors.
no code implementations • 18 May 2022 • Marvin Tammen, Simon Doclo
To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance.
no code implementations • 18 May 2022 • Marvin Tammen, XiLin Li, Simon Doclo, Lalin Theverapperuma
In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility.
no code implementations • 12 May 2022 • Mattes Ohlenbusch, Christian Rollwage, Simon Doclo
In this paper, we apply a deep learning-based bandwidth-extension system to the own voice reconstruction task and investigate different training strategies in order to overcome the limited availability of training data.
no code implementations • 7 Oct 2021 • Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo
In this paper we consider an in-ear headphone equipped with an inner microphone and multiple loudspeakers and we propose an optimization procedure with a convex objective function to derive a fixed multi-loudspeaker ANC controller aiming at minimizing the sound pressure at the ear drum.
no code implementations • 4 Oct 2021 • Henning Schepker, Reinhild Rohden, Florian Denk, Birger Kollmeier, Matthias Blau, Simon Doclo
To achieve optimal individualized equalization typically requires knowledge of all transfer functions between the source, the hearing device, and the individual eardrum.
no code implementations • 9 Sep 2021 • Henning Schepker, Florian Denk, Birger Kollmeier, Simon Doclo
To improve the sound quality of hearing devices, equalization filters can be used that aim at achieving acoustic transparency, i. e., listening with the device in the ear is perceptually similar to the open ear.
no code implementations • 3 Jun 2021 • Henri Gode, Marvin Tammen, Simon Doclo
To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals.
no code implementations • 14 May 2021 • Piero Rivera Benois, Reinhild Roden, Matthias Blau, Simon Doclo
Based on measured acoustic paths to predict the sound pressure generated by external sources and the headphone at the ear drum, the FIR filter coefficients of the ANC controller are optimized for different sound fields.
no code implementations • 11 Apr 2021 • Daniel Fejgin, Simon Doclo
In this paper we consider a binaural hearing aid setup, where in addition to the head-mounted microphones an external microphone is available.
no code implementations • 9 Apr 2021 • Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo
In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network and a speaker embedder network requiring reference speech of the target speaker.
1 code implementation • 20 Nov 2020 • Marvin Tammen, Simon Doclo
Multi-frame algorithms for single-microphone speech enhancement, e. g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain.
no code implementations • 2 Apr 2020 • Ali Aroudi, Tobias de Taillez, Simon Doclo
In this paper, we investigate a state-space model using correlation coefficients obtained with a small correlation window to improve the decoding performance of the linear and the non-linear AAD methods.
no code implementations • 21 May 2019 • Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo
In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, provided that an accurate estimate of the correlation matrices and especially the speech interframe correlation (IFC) vector is available.
no code implementations • 10 May 2019 • Nico Gößling, Elior Hadad, Sharon Gannot, Simon Doclo
While the binaural minimum variance distortionless response (BMVDR) beamformer provides a good noise reduction performance and preserves the binaural cues of the desired source, it does not allow to control the reduction of the interfering sources and distorts the binaural cues of the interfering sources and the background noise.
no code implementations • 16 Sep 2017 • Nasser Mohammadiha, Simon Doclo
This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space.
no code implementations • 31 Aug 2017 • Nasser Mohammadiha, Paris Smaragdis, Ghazaleh Panahandeh, Simon Doclo
Nonnegative matrix factorization (NMF) has been actively investigated and used in a wide range of problems in the past decade.