TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Speech Recognition	Switchboard (300hr)	End-to-end LF-MMI	Word Error Rate (WER)	9.3	# 1
Speech Recognition	WSJ eval92	End-to-end LF-MMI	Word Error Rate (WER)	3.0	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-speech-recognition-using-lattice/speech-recognition-on-switchboard-300hr)](https://paperswithcode.com/sota/speech-recognition-on-switchboard-300hr?p=end-to-end-speech-recognition-using-lattice)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-speech-recognition-using-lattice/speech-recognition-on-wsj-eval92)](https://paperswithcode.com/sota/speech-recognition-on-wsj-eval92?p=end-to-end-speech-recognition-using-lattice)`

End-to-end speech recognition using lattice-free MMI

Interspeech 2018 2018 · Hossein Hadian, Hossein Sameti, Daniel Povey, Sanjeev Khudanpur ·

We present our work on end-to-end training of acoustic models using the lattice-free maximum mutual information (LF-MMI) objective function in the context of hidden Markov models. By end-to-end training, we mean flat-start training of a single DNN in one stage without using any previously trained models, forced alignments, or building state-tying decision trees. We use full biphones to enable context-dependent modeling without trees, and show that our end-to-end LF-MMI approach can achieve comparable results to regular LF-MMI on well-known large vocabulary tasks. We also compare with other end-to-end methods such as CTC in character-based and lexicon-free settings and show 5 to 25 percent relative reduction in word error rates on different large vocabulary tasks while using significantly smaller models.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

speech-recognition

Speech Recognition

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Ranked #1 on Speech Recognition on Switchboard (300hr)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	Switchboard (300hr)	End-to-end LF-MMI	Word Error Rate (WER)	9.3	# 1	Compare
Speech Recognition	WSJ eval92	End-to-end LF-MMI	Word Error Rate (WER)	3.0	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

End-to-end speech recognition using lattice-free MMI

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove