TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Separation	LibriCSS	Conformer (large)	0S	5.4	# 1
Speech Separation	LibriCSS	Conformer (large)	40%	17.1	# 1
Speech Separation	LibriCSS	Conformer (large)	0L	5.0	# 1
Speech Separation	LibriCSS	Conformer (large)	10%	7.5	# 1
Speech Separation	LibriCSS	Conformer (large)	20%	10.7	# 1
Speech Separation	LibriCSS	Conformer (large)	30%	13.8	# 1
Speech Separation	LibriCSS	Conformer (base)	0S	5.6	# 2
Speech Separation	LibriCSS	Conformer (base)	40%	18.9	# 2
Speech Separation	LibriCSS	Conformer (base)	0L	5.4	# 2
Speech Separation	LibriCSS	Conformer (base)	10%	8.2	# 2
Speech Separation	LibriCSS	Conformer (base)	20%	11.8	# 2
Speech Separation	LibriCSS	Conformer (base)	30%	15.5	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/continuous-speech-separation-with-conformer/speech-separation-on-libricss)](https://paperswithcode.com/sota/speech-separation-on-libricss?p=continuous-speech-separation-with-conformer)`

Continuous Speech Separation with Conformer

13 Aug 2020 · Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou ·

Continuous speech separation plays a vital role in complicated speech related tasks such as conversation transcription. The separation model extracts a single speaker signal from a mixed speech. In this paper, we use transformer and conformer in lieu of recurrent neural networks in the separation system, as we believe capturing global information with the self-attention based method is crucial for the speech separation. Evaluating on the LibriCSS dataset, the conformer separation model achieves state of the art results, with a relative 23.5% word error rate (WER) reduction from bi-directional LSTM (BLSTM) in the utterance-wise evaluation and a 15.4% WER reduction in the continuous evaluation.

PDF Abstract

Code

Add Remove Mark official

Sanyuan-Chen/CSS_with_Conformer

103

Tasks

Add Remove

Speech Separation

Datasets

LibriCSS

Results from the Paper

Edit

Ranked #1 on Speech Separation on LibriCSS (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Separation	LibriCSS	Conformer (large)	0S	5.4	# 1	Compare
			40%	17.1	# 1	Compare
			0L	5.0	# 1	Compare
			10%	7.5	# 1	Compare
			20%	10.7	# 1	Compare
			30%	13.8	# 1	Compare
Speech Separation	LibriCSS	Conformer (base)	0S	5.6	# 2	Compare
			40%	18.9	# 2	Compare
			0L	5.4	# 2	Compare
			10%	8.2	# 2	Compare
			20%	11.8	# 2	Compare
			30%	15.5	# 2	Compare

Methods

Add Remove

LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Continuous Speech Separation with Conformer

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove