TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Recognition	swb_hub_500 WER fullSWBCH	RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model	Percentage error	12.2	# 5
Speech Recognition	Switchboard + Hub500	RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model	Percentage error	6.6	# 7
Speech Recognition	Switchboard + Hub500	IBM 2016	Percentage error	6.9	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-ibm-2016-english-conversational-telephone/speech-recognition-on-swb_hub_500-wer)](https://paperswithcode.com/sota/speech-recognition-on-swb_hub_500-wer?p=the-ibm-2016-english-conversational-telephone)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-ibm-2016-english-conversational-telephone/speech-recognition-on-switchboard-hub500)](https://paperswithcode.com/sota/speech-recognition-on-switchboard-hub500?p=the-ibm-2016-english-conversational-telephone)`

The IBM 2016 English Conversational Telephone Speech Recognition System

27 Apr 2016 · George Saon, Tom Sercu, Steven Rennie, Hong-Kwang J. Kuo ·

We describe a collection of acoustic and language modeling techniques that lowered the word error rate of our English conversational telephone LVCSR system to a record 6.6% on the Switchboard subset of the Hub5 2000 evaluation testset. On the acoustic side, we use a score fusion of three strong models: recurrent nets with maxout activations, very deep convolutional nets with 3x3 kernels, and bidirectional long short-term memory nets which operate on FMLLR and i-vector features. On the language modeling side, we use an updated model "M" and hierarchical neural network LMs.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Language Modelling

speech-recognition

Speech Recognition

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #5 on Speech Recognition on swb_hub_500 WER fullSWBCH

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	swb_hub_500 WER fullSWBCH	RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model	Percentage error	12.2	# 5	Compare
Speech Recognition	Switchboard + Hub500	RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model	Percentage error	6.6	# 7	Compare
Speech Recognition	Switchboard + Hub500	IBM 2016	Percentage error	6.9	# 9	Compare

Methods

Add Remove

Maxout

Edit Social Preview

The IBM 2016 English Conversational Telephone Speech Recognition System

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove