1 code implementation • 8 Mar 2024 • Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor
Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.
no code implementations • 28 Dec 2023 • Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor
Experiment 1 also investigates aleatoric uncertainties and shows the model on both $\Phi$ and $\Psi$ has mean entropy 0. 927~bits (out of 4~bits) for correct predictions compared to 1. 896~bits for incorrect predictions which, along with entropy histogram shapes, shows the model helpfully indicates where it is uncertain.
no code implementations • 30 Nov 2023 • Sina Hafezi, Alastair H. Moore, Pierre H. Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner
(i) The first stage is a hybrid beamformer based on a dictionary of weights corresponding to a set of noise field models.
1 code implementation • 8 Aug 2023 • Eric Grinstein, Vincent W. Neo, Patrick A. Naylor
In many signal processing applications, metadata may be advantageously used in conjunction with a high dimensional signal to produce a desired output.
1 code implementation • 28 Jun 2023 • Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer
The analysis of conversations recorded in everyday life requires privacy protection.
no code implementations • 28 Jun 2023 • Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor
In recent years, the need for privacy preservation when manipulating or storing personal data, including speech , has become a major issue.
no code implementations • 15 Mar 2023 • Sina Hafezi, Alastair H. Moore, Pierre Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner
A two-stage multi-channel speech enhancement method is proposed which consists of a novel adaptive beamformer, Hybrid Minimum Variance Distortionless Response (MVDR), Isotropic-MVDR (Iso), and a novel multi-channel spectral Principal Components Analysis (PCA) denoising.
no code implementations • 2 Dec 2022 • Francesco Nespoli, Daniel Barreda, Patrick A. Naylor
Any audio recording encapsulates the unique fingerprint of the associated acoustic environment, namely the background noise and reverberation.
no code implementations • 30 Sep 2022 • Vikas Tokala, Mike Brookes, Patrick A. Naylor
STOI-optimal masking has been previously proposed and developed for single-channel speech enhancement.
no code implementations • 25 Mar 2022 • Dushyant Sharma, Rong Gong, James Fosburgh, Stanislav Yu. Kruchinin, Patrick A. Naylor, Ljubomir Milanovic
We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based channel combination (SACC) scheme, for tackling the distant ASR problem.
1 code implementation • 10 Jan 2019 • Constantinos Papayiannis, Christine Evers, Patrick A. Naylor
A representation of acoustic environments is proposed, which is used to train the GANs.
Audio and Speech Processing Sound