TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	LibriSpeech test-clean	CTC-CRF 4gram-LM	Word Error Rate (WER)	4.09	# 44
Speech Recognition	LibriSpeech test-other	CTC-CRF 4gram-LM	Word Error Rate (WER)	10.65	# 42
Speech Recognition	WSJ dev93	Convolutional Speech Recognition	Word Error Rate (WER)	6.23	# 3
Speech Recognition	WSJ eval92	CTC-CRF 4gram-LM	Word Error Rate (WER)	3.79	# 14
Speech Recognition	WSJ eval93	CTC-CRF 4gram-LM	Word Error Rate (WER)	6.23	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crf-based-single-stage-acoustic-modeling-with/speech-recognition-on-wsj-eval93)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval93?p=crf-based-single-stage-acoustic-modeling-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crf-based-single-stage-acoustic-modeling-with/speech-recognition-on-wsj-dev93)](https://paperswithcode.com/sota/speech-recognition-on-wsj-dev93?p=crf-based-single-stage-acoustic-modeling-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crf-based-single-stage-acoustic-modeling-with/speech-recognition-on-wsj-eval92)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval92?p=crf-based-single-stage-acoustic-modeling-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crf-based-single-stage-acoustic-modeling-with/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=crf-based-single-stage-acoustic-modeling-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crf-based-single-stage-acoustic-modeling-with/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=crf-based-single-stage-acoustic-modeling-with)`

CRF-based Single-stage Acoustic Modeling with CTC Topology

16 Apr 2019 · Hongyu Xiang, Zhijian Ou ·

In this paper, we develop conditional random field (CRF) based single-stage (SS) acoustic modeling with connectionist temporal classification (CTC) inspired state topology, which is called CTC-CRF for short. CTC-CRF is conceptually simple, which basically implements a CRF layer on top of features generated by the bottom neural network with the special state topology. Like SS-LF-MMI (lattice-free maximum-mutual-information), CTC-CRFs can be trained from scratch (flat-start), eliminating GMM-HMM pre-training and tree-building. Evaluation experiments are conducted on the WSJ, Switchboard and Librispeech datasets. In a head-to-head comparison, the CTC-CRF model using simple Bidirectional LSTMs consistently outperforms the strong SS-LF-MMI, across all the three benchmarking datasets and in both cases of mono-phones and mono-chars. Additionally, CTC-CRFs avoid some ad-hoc operation in SS-LF-MMI.

PDF Abstract

Code

Add Remove Mark official

thu-spmi/cat

307

Tasks

Add Remove

Benchmarking

Speech Recognition

Datasets

LibriSpeech

Results from the Paper

Add Remove

Ranked #2 on Speech Recognition on WSJ eval93

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	LibriSpeech test-clean	CTC-CRF 4gram-LM	Word Error Rate (WER)	4.09	# 44	Compare
Speech Recognition	LibriSpeech test-other	CTC-CRF 4gram-LM	Word Error Rate (WER)	10.65	# 42	Compare
Speech Recognition	WSJ dev93	Convolutional Speech Recognition	Word Error Rate (WER)	6.23	# 3	Compare
Speech Recognition	WSJ eval92	CTC-CRF 4gram-LM	Word Error Rate (WER)	3.79	# 14	Compare
Speech Recognition	WSJ eval93	CTC-CRF 4gram-LM	Word Error Rate (WER)	6.23	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

CRF-based Single-stage Acoustic Modeling with CTC Topology

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove