Search Results for author: Kevin Kilgour

Found 8 papers, 0 papers with code

The 2016 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations • IWSLT 2016 • Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel

For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.

Paper
Add Code

Text-Driven Separation of Arbitrary Sounds

no code implementations • 12 Apr 2022 • Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi

The second model, SoundFilter, takes a mixed source audio clip as an input and separates it based on a conditioning vector from the shared text-audio representation defined by SoundWords, making the model agnostic to the conditioning modality.

Paper
Add Code

Teaching keyword spotters to spot new keywords with limited examples

no code implementations • 4 Jun 2021 • Abhijeet Awasthi, Kevin Kilgour, Hassan Rom

Towards easily customizable KWS models, we present KeySEM (Keyword Speech EMbedding), a speech embedding model pre-trained on the task of recognizing a large number of keywords.

Keyword Spotting

Paper
Add Code

Low Latency ASR for Simultaneous Speech Translation

no code implementations • 22 Mar 2020 • Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel

User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Training Keyword Spotters with Limited and Synthesized Speech Data

no code implementations • 31 Jan 2020 • James Lin, Kevin Kilgour, Dominik Roblek, Matthew Sharifi

With the rise of low power speech-enabled devices, there is a growing demand to quickly produce models for recognizing arbitrary sets of keywords.

Ranked #10 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 12 metric)

Keyword Spotting

Paper
Add Code

Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition

no code implementations • 31 Oct 2018 • David B. Ramsay, Kevin Kilgour, Dominik Roblek, Matthew Sharifi

Low power digital signal processors (DSPs) typically have a very limited amount of memory in which to cache data.

speech-recognition Speech Recognition

Paper
Add Code

Now Playing: Continuous low-power music recognition

no code implementations • 29 Nov 2017 • Blaise Agüera y Arcas, Beat Gfeller, Ruiqi Guo, Kevin Kilgour, Sanjiv Kumar, James Lyon, Julian Odell, Marvin Ritter, Dominik Roblek, Matthew Sharifi, Mihajlo Velimirović

To reduce battery consumption, a small music detector runs continuously on the mobile device's DSP chip and wakes up the main application processor only when it is confident that music is present.

Paper
Add Code

Lecture Translator - Speech translation framework for simultaneous lecture translation

no code implementations • NAACL 2016 • Markus M{\"u}ller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Kr{\"u}ger, Thanh-Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian St{\"u}ker, Alex Waibel

Automatic Speech Recognition (ASR) Machine Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.