Search Results for author: Patrick A. Naylor

Found 11 papers, 4 papers with code

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks

1 code implementation8 Mar 2024 Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor

Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.

Speech Enhancement

Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification

no code implementations28 Dec 2023 Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor

Experiment 1 also investigates aleatoric uncertainties and shows the model on both $\Phi$ and $\Psi$ has mean entropy 0. 927~bits (out of 4~bits) for correct predictions compared to 1. 896~bits for incorrect predictions which, along with entropy histogram shapes, shows the model helpfully indicates where it is uncertain.

speaker-diarization Speaker Diarization +1

Dual input neural networks for positional sound source localization

1 code implementation8 Aug 2023 Eric Grinstein, Vincent W. Neo, Patrick A. Naylor

In many signal processing applications, metadata may be advantageously used in conjunction with a high dimensional signal to produce a desired output.

Two-Stage Voice Anonymization for Enhanced Privacy

no code implementations28 Jun 2023 Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor

In recent years, the need for privacy preservation when manipulating or storing personal data, including speech , has become a major issue.

Voice Conversion

Subspace Hybrid Beamforming for Head-worn Microphone Arrays

no code implementations15 Mar 2023 Sina Hafezi, Alastair H. Moore, Pierre Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner

A two-stage multi-channel speech enhancement method is proposed which consists of a novel adaptive beamformer, Hybrid Minimum Variance Distortionless Response (MVDR), Isotropic-MVDR (Iso), and a novel multi-channel spectral Principal Components Analysis (PCA) denoising.

Denoising Speech Enhancement

Relative Acoustic Features for Distance Estimation in Smart-Homes

no code implementations2 Dec 2022 Francesco Nespoli, Daniel Barreda, Patrick A. Naylor

Any audio recording encapsulates the unique fingerprint of the associated acoustic environment, namely the background noise and reverberation.

Room Impulse Response (RIR)

Binaural Speech Enhancement Using STOI-Optimal Masks

no code implementations30 Sep 2022 Vikas Tokala, Mike Brookes, Patrick A. Naylor

STOI-optimal masking has been previously proposed and developed for single-channel speech enhancement.

Speech Enhancement

Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator

no code implementations25 Mar 2022 Dushyant Sharma, Rong Gong, James Fosburgh, Stanislav Yu. Kruchinin, Patrick A. Naylor, Ljubomir Milanovic

We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based channel combination (SACC) scheme, for tackling the distant ASR problem.

Data Augmentation of Room Classifiers using Generative Adversarial Networks

1 code implementation10 Jan 2019 Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

A representation of acoustic environments is proposed, which is used to train the GANs.

Audio and Speech Processing Sound

Cannot find the paper you are looking for? You can Submit a new open access paper.