no code implementations • 24 Nov 2023 • Sarah J. Gascoigne, Nathan Evans, Gerard Hall, Csaba Kozma, Mariella Panagiotopoulou, Gabrielle M. Schroeder, Callum Simpson, Christopher Thornton, Frances Turner, Heather Woodhouse, Jess Blickwedel, Fahmida Chowdhury, Beate Diehl, John S. Duncan, Ryan Faulder, Rhys H. Thomas, Kevin Wilson, Peter N. Taylor, Yujiang Wang
Retrospective analysis of icEEG recordings from 63 subjects (735 seizures) identified seizure onset regions through visual inspection and algorithmic delineation.
1 code implementation • 11 May 2023 • Mike Diessner, Kevin Wilson, Richard D. Whalley
NUBO, short for Newcastle University Bayesian Optimisation, is a Bayesian optimisation framework for the optimisation of expensive-to-evaluate black-box functions, such as physical experiments and computer simulators.
1 code implementation • 19 Jul 2022 • Mike Diessner, Joseph O'Connor, Andrew Wynn, Sylvain Laizet, Yu Guan, Kevin Wilson, Richard D. Whalley
To illustrate how these findings can be used to inform a Bayesian optimization setup tailored to a specific problem, two simulations in the area of computational fluid dynamics are optimized, giving evidence that suitable solutions can be found in a small number of evaluations of the objective function for complex, real problems.
1 code implementation • 30 Jun 2022 • Sarah J. Gascoigne, Leonard Waldmann, Mariella Panagiotopoulou, Fahmida Chowdhury, Alison Cronie, Beate Diehl, John S. Duncan, Jennifer Falconer, Yu Guan, Veronica Leach, Shona Livingstone, Christoforos Papasavvas, Ryan Faulder, Jess Blickwedel, Gabrielle M. Schroeder, Rhys H. Thomas, Kevin Wilson, Peter N. Taylor, Yujiang Wang
In individual patients, 71% had a moderate to large difference (ranksum r > 0. 3) between focal and subclinical seizures in three or more markers.
no code implementations • 5 May 2021 • Soumi Maiti, Hakan Erdogan, Kevin Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey
We present an end-to-end deep network model that performs meeting diarization from single-channel audio recordings.
1 code implementation • 9 Sep 2020 • Quan Wang, Ignacio Lopez Moreno, Mert Saglam, Kevin Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein
We introduce VoiceFilter-Lite, a single-channel source separation model that runs on the device to preserve only the speech signals from a target user, as part of a streaming speech recognition system.
no code implementations • NeurIPS 2020 • Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin Wilson, John R. Hershey
In such supervised approaches, a model is trained to predict the component sources from synthetic mixtures created by adding up isolated ground-truth sources.
no code implementations • 18 Nov 2019 • Zhong-Qiu Wang, Hakan Erdogan, Scott Wisdom, Kevin Wilson, Desh Raj, Shinji Watanabe, Zhuo Chen, John R. Hershey
This work introduces sequential neural beamforming, which alternates between neural network based spectral separation and beamforming based spatial separation.
no code implementations • 8 May 2019 • Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin Wilson, Jonathan Le Roux, John R. Hershey
For learnable bases, shorter windows (2. 5 ms) work best on all tasks.
no code implementations • 20 Nov 2018 • Scott Wisdom, John R. Hershey, Kevin Wilson, Jeremy Thorpe, Michael Chinen, Brian Patton, Rif A. Saurous
Furthermore, the only previous approaches that apply mixture consistency use real-valued masks; mixture consistency has been ignored for complex-valued masks.
Sound Audio and Speech Processing
4 code implementations • 11 Oct 2018 • Quan Wang, Hannah Muckenhirn, Kevin Wilson, Prashant Sridhar, Zelin Wu, John Hershey, Rif A. Saurous, Ron J. Weiss, Ye Jia, Ignacio Lopez Moreno
In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker.
1 code implementation • 2 Aug 2018 • Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin Wilson, Zhonghua Xi
Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization.
Sound Audio and Speech Processing
5 code implementations • 10 Apr 2018 • Ariel Ephrat, Inbar Mosseri, Oran Lang, Tali Dekel, Kevin Wilson, Avinatan Hassidim, William T. Freeman, Michael Rubinstein
Solving this task using only audio as input is extremely challenging and does not provide an association of the separated speech signals with speakers in the video.
no code implementations • 28 Nov 2016 • Brian Patton, Yannis Agiomyrgiannakis, Michael Terry, Kevin Wilson, Rif A. Saurous, D. Sculley
Developers of text-to-speech synthesizers (TTS) often make use of human raters to assess the quality of synthesized speech.
16 code implementations • 29 Sep 2016 • Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson
Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio.