Search Results for author: Patrick A. Naylor

Found 11 papers, 4 papers with code

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks

1 code implementation • 8 Mar 2024 • Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor

Studies have shown that in noisy acoustic environments, providing binaural signals to the user of an assistive listening device may improve speech intelligibility and spatial awareness.

Speech Enhancement

Paper
Code

Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification

no code implementations • 28 Dec 2023 • Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor

Experiment 1 also investigates aleatoric uncertainties and shows the model on both $\Phi$ and $\Psi$ has mean entropy 0. 927~bits (out of 4~bits) for correct predictions compared to 1. 896~bits for incorrect predictions which, along with entropy histogram shapes, shows the model helpfully indicates where it is uncertain.

speaker-diarization Speaker Diarization +1

Paper
Add Code

Subspace Hybrid MVDR Beamforming for Augmented Hearing

no code implementations • 30 Nov 2023 • Sina Hafezi, Alastair H. Moore, Pierre H. Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner

(i) The first stage is a hybrid beamformer based on a dictionary of weights corresponding to a set of noise field models.

Computational Efficiency Speech Enhancement

Paper
Add Code

Dual input neural networks for positional sound source localization

1 code implementation • 8 Aug 2023 • Eric Grinstein, Vincent W. Neo, Patrick A. Naylor

In many signal processing applications, metadata may be advantageously used in conjunction with a high dimensional signal to produce a desired output.

Paper
Code

Long-term Conversation Analysis: Exploring Utility and Privacy

1 code implementation • 28 Jun 2023 • Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer

The analysis of conversations recorded in everyday life requires privacy protection.

Action Detection Activity Detection +7

Paper
Code

Two-Stage Voice Anonymization for Enhanced Privacy

no code implementations • 28 Jun 2023 • Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor

In recent years, the need for privacy preservation when manipulating or storing personal data, including speech , has become a major issue.

Voice Conversion

Paper
Add Code

Subspace Hybrid Beamforming for Head-worn Microphone Arrays

no code implementations • 15 Mar 2023 • Sina Hafezi, Alastair H. Moore, Pierre Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner

A two-stage multi-channel speech enhancement method is proposed which consists of a novel adaptive beamformer, Hybrid Minimum Variance Distortionless Response (MVDR), Isotropic-MVDR (Iso), and a novel multi-channel spectral Principal Components Analysis (PCA) denoising.

Denoising Speech Enhancement

Paper
Add Code

Relative Acoustic Features for Distance Estimation in Smart-Homes

no code implementations • 2 Dec 2022 • Francesco Nespoli, Daniel Barreda, Patrick A. Naylor

Any audio recording encapsulates the unique fingerprint of the associated acoustic environment, namely the background noise and reverberation.

Room Impulse Response (RIR)

Paper
Add Code

Binaural Speech Enhancement Using STOI-Optimal Masks

no code implementations • 30 Sep 2022 • Vikas Tokala, Mike Brookes, Patrick A. Naylor

STOI-optimal masking has been previously proposed and developed for single-channel speech enhancement.

Speech Enhancement

Paper
Add Code

Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator

no code implementations • 25 Mar 2022 • Dushyant Sharma, Rong Gong, James Fosburgh, Stanislav Yu. Kruchinin, Patrick A. Naylor, Ljubomir Milanovic

We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based channel combination (SACC) scheme, for tackling the distant ASR problem.

Paper
Add Code

Data Augmentation of Room Classifiers using Generative Adversarial Networks

1 code implementation • 10 Jan 2019 • Constantinos Papayiannis, Christine Evers, Patrick A. Naylor

A representation of acoustic environments is proposed, which is used to train the GANs.

Audio and Speech Processing Sound

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.