TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	LibriSpeech test-clean	Convolutional Speech Recognition	Word Error Rate (WER)	3.26	# 40
Speech Recognition	LibriSpeech test-other	Convolutional Speech Recognition	Word Error Rate (WER)	10.47	# 41
Speech Recognition	WSJ dev93	Convolutional Speech Recognition	Word Error Rate (WER)	6.8	# 4
Speech Recognition	WSJ eval92	Convolutional Speech Recognition	Word Error Rate (WER)	3.5	# 10
Speech Recognition	WSJ eval93	Convolutional Speech Recognition	Word Error Rate (WER)	6.8	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-convolutional-speech-recognition/speech-recognition-on-wsj-eval93)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval93?p=fully-convolutional-speech-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-convolutional-speech-recognition/speech-recognition-on-wsj-dev93)](https://paperswithcode.com/sota/speech-recognition-on-wsj-dev93?p=fully-convolutional-speech-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-convolutional-speech-recognition/speech-recognition-on-wsj-eval92)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval92?p=fully-convolutional-speech-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-convolutional-speech-recognition/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=fully-convolutional-speech-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-convolutional-speech-recognition/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=fully-convolutional-speech-recognition)`

Fully Convolutional Speech Recognition

17 Dec 2018 · Neil Zeghidour, Qiantong Xu, Vitaliy Liptchinsky, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert ·

Current state-of-the-art speech recognition systems build on recurrent neural networks for acoustic and/or language modeling, and rely on feature extraction pipelines to extract mel-filterbanks or cepstral coefficients. In this paper we present an alternative approach based solely on convolutional neural networks, leveraging recent advances in acoustic models from the raw waveform and language modeling. This fully convolutional approach is trained end-to-end to predict characters from the raw waveform, removing the feature extraction step altogether. An external convolutional language model is used to decode words. On Wall Street Journal, our model matches the current state-of-the-art. On Librispeech, we report state-of-the-art performance among end-to-end models, including Deep Speech 2 trained with 12 times more acoustic data and significantly more linguistic data.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Language Modelling

speech-recognition

Speech Recognition

Datasets

LibriSpeech

Results from the Paper

Edit

Ranked #3 on Speech Recognition on WSJ eval93

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	LibriSpeech test-clean	Convolutional Speech Recognition	Word Error Rate (WER)	3.26	# 40	Compare
Speech Recognition	LibriSpeech test-other	Convolutional Speech Recognition	Word Error Rate (WER)	10.47	# 41	Compare
Speech Recognition	WSJ dev93	Convolutional Speech Recognition	Word Error Rate (WER)	6.8	# 4	Compare
Speech Recognition	WSJ eval92	Convolutional Speech Recognition	Word Error Rate (WER)	3.5	# 10	Compare
Speech Recognition	WSJ eval93	Convolutional Speech Recognition	Word Error Rate (WER)	6.8	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Fully Convolutional Speech Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove