TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	EXTRA DATA	REMOVE
Speech Recognition	LibriSpeech test-clean	tdnn + chain + rnnlm rescoring	Word Error Rate (WER)	3.06	# 39
Speech Recognition	LibriSpeech test-other	tdnn + chain + rnnlm rescoring	Word Error Rate (WER)	7.63	# 36

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-network-language-modeling-with-letter/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=neural-network-language-modeling-with-letter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-network-language-modeling-with-letter/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=neural-network-language-modeling-with-letter)`

Neural Network Language Modeling with Letter-based Features and Importance Sampling

ICASSP 2018 · Hainan Xu, Ke Li, Yiming Wang, Jian Wang, Shiyin Kang, Xie Chen, Daniel Povey, Sanjeev Khudanpur ·

In this paper we describe an extension of the Kaldi software toolkit to support neural-based language modeling, intended for use in automatic speech recognition (ASR) and related tasks. We combine the use of subword features (letter n-grams) and one-hot encoding of frequent words so that the models can handle large vocabularies containing infrequent words. We propose a new objective function that allows for training of unnormalized probabilities. An importance sampling based method is supported to speed up training when the vocabulary is large. Experimental results on five corpora show that Kaldi-RNNLM rivals other recurrent neural network language model toolkits both on performance and training speed.

PDF

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Automatic Speech Recognition

Automatic Speech Recognition (ASR)

Language Modelling

speech-recognition

Speech Recognition

Datasets

LibriSpeech

Results from the Paper

Add Remove

Ranked #36 on Speech Recognition on LibriSpeech test-other (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Uses Extra Training Data	Benchmark
Speech Recognition	LibriSpeech test-clean	tdnn + chain + rnnlm rescoring	Word Error Rate (WER)	3.06	# 39		Compare
Speech Recognition	LibriSpeech test-other	tdnn + chain + rnnlm rescoring	Word Error Rate (WER)	7.63	# 36		Compare

Methods

Add Remove

SPEED

Edit Social Preview

Neural Network Language Modeling with Letter-based Features and Importance Sampling

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove