1 code implementation • Speech communication 2019 • Aaron Nicolson, Kuldip K. Paliwal
MMSE approaches utilising the proposed a priori SNR estimator are able to achieve higher enhanced speech quality and intelligibility scores than recent masking- and mapping-based deep learning approaches.
no code implementations • 27 Dec 2019 • Qiquan Zhang, Aaron Nicolson, Mingjiang Wang, Kuldip K. Paliwal, Chenxu Wang
Deep learning has achieved substantial improvement on single-channel speech enhancement tasks.
2 code implementations • 27 Feb 2020 • Mohammad Nikzad, Aaron Nicolson, Yongsheng Gao, Jun Zhou, Kuldip K. Paliwal, Fanhua Shang
Motivated by this, we propose the residual-dense lattice network (RDL-Net), which is a new CNN for speech enhancement that employs both residual and dense aggregations without over-allocating parameters for feature re-usage.
Ranked #16 on Speech Enhancement on VoiceBank + DEMAND
1 code implementation • 7 Sep 2020 • Timothy Roberts, Aaron Nicolson, Kuldip K. Paliwal
This DE measure was an extension of Perceptual Evaluation of Audio Quality, and required reference and test signals.
1 code implementation • 24 Jan 2022 • Aaron Nicolson, Jason Dowling, Bevan Koopman
Our experimental investigation demonstrates that the Convolutional vision Transformer (CvT) ImageNet-21K and the Distilled Generative Pre-trained Transformer 2 (DistilGPT2) checkpoints are best for warm starting the encoder and decoder, respectively.
1 code implementation • 19 Jul 2023 • Aaron Nicolson, Jason Dowling, Bevan Koopman
To improve diagnostic accuracy, we propose a CXR report generator that integrates aspects of the radiologist workflow and is trained with our proposed reward for reinforcement learning.