TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Speech Recognition	LibriSpeech test-clean	Conv + Transformer AM + Iterative Pseudo-Labeling (n-gram LM + Transformer Rescoring)	Word Error Rate (WER)	2.10	# 21
Speech Recognition	LibriSpeech test-other	Conv + Transformer AM + Iterative Pseudo-Labeling (n-gram LM + Transformer Rescoring)	Word Error Rate (WER)	3.83	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/iterative-pseudo-labeling-for-speech/speech-recognition-on-librispeech-test-other)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-other?p=iterative-pseudo-labeling-for-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/iterative-pseudo-labeling-for-speech/speech-recognition-on-librispeech-test-clean)](https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean?p=iterative-pseudo-labeling-for-speech)`

Iterative Pseudo-Labeling for Speech Recognition

19 May 2020 · Qiantong Xu, Tatiana Likhomanenko, Jacob Kahn, Awni Hannun, Gabriel Synnaeve, Ronan Collobert ·

Pseudo-labeling has recently shown promise in end-to-end automatic speech recognition (ASR). We study Iterative Pseudo-Labeling (IPL), a semi-supervised algorithm which efficiently performs multiple iterations of pseudo-labeling on unlabeled data as the acoustic model evolves. In particular, IPL fine-tunes an existing model at each iteration using both labeled data and a subset of unlabeled data. We study the main components of IPL: decoding with a language model and data augmentation. We then demonstrate the effectiveness of IPL by achieving state-of-the-art word-error rate on the Librispeech test sets in both standard and low-resource setting. We also study the effect of language models trained on different corpora to show IPL can effectively utilize additional text. Finally, we release a new large in-domain text corpus which does not overlap with the Librispeech training transcriptions to foster research in low-resource, semi-supervised ASR

PDF Abstract

Code

Add Remove Mark official

facebookresearch/wav2letter official

6,336

Tasks

Add Remove

Automatic Speech Recognition

Automatic Speech Recognition (ASR)

Data Augmentation

Language Modelling

speech-recognition

Speech Recognition

Datasets

LibriSpeech Libri-Light

Results from the Paper

Add Remove

Ranked #11 on Speech Recognition on LibriSpeech test-other

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Recognition	LibriSpeech test-clean	Conv + Transformer AM + Iterative Pseudo-Labeling (n-gram LM + Transformer Rescoring)	Word Error Rate (WER)	2.10	# 21	Compare
Speech Recognition	LibriSpeech test-other	Conv + Transformer AM + Iterative Pseudo-Labeling (n-gram LM + Transformer Rescoring)	Word Error Rate (WER)	3.83	# 11	Compare

Methods

Add Remove

IPL

Edit Social Preview

Iterative Pseudo-Labeling for Speech Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove