no code implementations • IWSLT 2016 • Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel
For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.
no code implementations • 12 Apr 2022 • Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi
The second model, SoundFilter, takes a mixed source audio clip as an input and separates it based on a conditioning vector from the shared text-audio representation defined by SoundWords, making the model agnostic to the conditioning modality.
no code implementations • 4 Jun 2021 • Abhijeet Awasthi, Kevin Kilgour, Hassan Rom
Towards easily customizable KWS models, we present KeySEM (Keyword Speech EMbedding), a speech embedding model pre-trained on the task of recognizing a large number of keywords.
no code implementations • 22 Mar 2020 • Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel
User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 31 Jan 2020 • James Lin, Kevin Kilgour, Dominik Roblek, Matthew Sharifi
With the rise of low power speech-enabled devices, there is a growing demand to quickly produce models for recognizing arbitrary sets of keywords.
Ranked #10 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 12 metric)
no code implementations • 31 Oct 2018 • David B. Ramsay, Kevin Kilgour, Dominik Roblek, Matthew Sharifi
Low power digital signal processors (DSPs) typically have a very limited amount of memory in which to cache data.
no code implementations • 29 Nov 2017 • Blaise Agüera y Arcas, Beat Gfeller, Ruiqi Guo, Kevin Kilgour, Sanjiv Kumar, James Lyon, Julian Odell, Marvin Ritter, Dominik Roblek, Matthew Sharifi, Mihajlo Velimirović
To reduce battery consumption, a small music detector runs continuously on the mobile device's DSP chip and wakes up the main application processor only when it is confident that music is present.
no code implementations • NAACL 2016 • Markus M{\"u}ller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Kr{\"u}ger, Thanh-Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian St{\"u}ker, Alex Waibel