Streaming End-to-end Speech Recognition For Mobile Devices

15 Nov 2018Yanzhang HeTara N. SainathRohit PrabhavalkarIan McGrawRaziel AlvarezDing ZhaoDavid RybachAnjuli KannanYonghui WuRuoming PangQiao LiangDeepti BhatiaYuan ShangguanBo LiGolan PundakKhe Chai SimTom BagbyShuo-yiin ChangKanishka RaoAlexander Gruenstein

End-to-end (E2E) models, which directly predict output character sequences given input speech, are good candidates for on-device speech recognition. E2E models, however, present numerous challenges: In order to be truly useful, such models must decode speech utterances in a streaming fashion, in real time; they must be robust to the long tail of use cases; they must be able to leverage user-specific context (e.g., contact lists); and above all, they must be extremely accurate... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.