Search Results for author: Kevin Kilgour

Found 8 papers, 0 papers with code

The 2016 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations IWSLT 2016 Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel

For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.

Text-Driven Separation of Arbitrary Sounds

no code implementations12 Apr 2022 Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi

The second model, SoundFilter, takes a mixed source audio clip as an input and separates it based on a conditioning vector from the shared text-audio representation defined by SoundWords, making the model agnostic to the conditioning modality.

Teaching keyword spotters to spot new keywords with limited examples

no code implementations4 Jun 2021 Abhijeet Awasthi, Kevin Kilgour, Hassan Rom

Towards easily customizable KWS models, we present KeySEM (Keyword Speech EMbedding), a speech embedding model pre-trained on the task of recognizing a large number of keywords.

Keyword Spotting

Training Keyword Spotters with Limited and Synthesized Speech Data

no code implementations31 Jan 2020 James Lin, Kevin Kilgour, Dominik Roblek, Matthew Sharifi

With the rise of low power speech-enabled devices, there is a growing demand to quickly produce models for recognizing arbitrary sets of keywords.

Ranked #10 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 12 metric)

Keyword Spotting

Now Playing: Continuous low-power music recognition

no code implementations29 Nov 2017 Blaise Agüera y Arcas, Beat Gfeller, Ruiqi Guo, Kevin Kilgour, Sanjiv Kumar, James Lyon, Julian Odell, Marvin Ritter, Dominik Roblek, Matthew Sharifi, Mihajlo Velimirović

To reduce battery consumption, a small music detector runs continuously on the mobile device's DSP chip and wakes up the main application processor only when it is confident that music is present.

Cannot find the paper you are looking for? You can Submit a new open access paper.