no code implementations • 17 Sep 2021 • Felix Weninger, Marco Gaudesi, Ralf Leibold, Roberto Gemello, Puming Zhan
We use a single-channel encoder for CT speech and a multi-channel encoder with Spatial Filtering neural beamforming for FT speech, which are jointly trained with the encoder selection.
no code implementations • 27 Jul 2020 • Felix Weninger, Franco Mana, Roberto Gemello, Jesús Andrés-Ferrer, Puming Zhan
In the result, the Noisy Student algorithm with soft labels and consistency regularization achieves 10. 4% word error rate (WER) reduction when adding 475h of unlabeled data, corresponding to a recovery rate of 92%.