no code implementations • 5 Feb 2021 • Ruizhi Li, Gregory Sell, Hynek Hermansky
Performance degradation of an Automatic Speech Recognition (ASR) system is commonly observed when the test acoustic condition is different from training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 23 Oct 2019 • Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky
The multi-stream paradigm of audio processing, in which several sources are simultaneously considered, has been an active research area for information fusion.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 9 Apr 2019 • Ruizhi Li, Gregory Sell, Hynek Hermansky
Measuring performance of an automatic speech recognition (ASR) system without ground-truth could be beneficial in many scenarios, especially with data from unseen domains, where performance can be highly inconsistent.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 6 Nov 2018 • Matthew Maciejewski, Gregory Sell, Leibny Paola Garcia-Perera, Shinji Watanabe, Sanjeev Khudanpur
To date, the bulk of research on single-channel speech separation has been conducted using clean, near-field, read speech, which is not representative of many modern applications.
no code implementations • 18 Aug 2015 • Aren Jansen, Gregory Sell, Vince Lyzinski
Several popular graph embedding techniques for representation learning and dimensionality reduction rely on performing computationally expensive eigendecompositions to derive a nonlinear transformation of the input data space.