Jasper: An End-to-End Convolutional Neural Acoustic Model

5 Apr 2019Jason LiVitaly LavrukhinBoris GinsburgRyan LearyOleksii KuchaievJonathan M. CohenHuyen NguyenRavi Teja Gadde

In this paper, we report state-of-the-art results on LibriSpeech among end-to-end speech recognition models without any external training data. Our model, Jasper, uses only 1D convolutions, batch normalization, ReLU, dropout, and residual connections... (read more)

PDF Abstract

Evaluation Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK COMPARE
Speech Recognition LibriSpeech test-other deep 1d convs + ctc + external lm rescoring Word Error Rate (WER) 8.79 # 5