End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures

19 Nov 2019Gabriel SynnaeveQiantong XuJacob KahnTatiana LikhomanenkoEdouard GraveVineel PratapAnuroop SriramVitaliy LiptchinskyRonan Collobert

We study pseudo-labeling for the semi-supervised training of ResNet, Time-Depth Separable ConvNets, and Transformers for speech recognition, with either CTC or Seq2Seq loss functions. We perform experiments on the standard LibriSpeech dataset, and leverage additional unlabeled data from LibriVox through pseudo-labeling... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.