TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	Sudo rm-rf (U=8)	SI-SDR-WB	18.6	# 4
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	Sudo rm-rf (U=8)	PESQ-WB	2.69	# 13
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	RemixIT (w Sudo U=32)	SI-SDR-WB	18.0	# 5
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	RemixIT (w Sudo U=32)	PESQ-WB	2.60	# 15

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/continual-self-training-with-bootstrapped/speech-enhancement-on-deep-noise-suppression)](https://paperswithcode.com/sota/speech-enhancement-on-deep-noise-suppression?p=continual-self-training-with-bootstrapped)`

Continual self-training with bootstrapped remixing for speech enhancement

19 Oct 2021 · Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar ·

We propose RemixIT, a simple and novel self-supervised training method for speech enhancement. The proposed method is based on a continuously self-training scheme that overcomes limitations from previous studies including assumptions for the in-domain noise distribution and having access to clean target signals. Specifically, a separation teacher model is pre-trained on an out-of-domain dataset and is used to infer estimated target signals for a batch of in-domain mixtures. Next, we bootstrap the mixing process by generating artificial mixtures using permuted estimated clean and noise signals. Finally, the student model is trained using the permuted estimated sources as targets while we periodically update teacher's weights using the latest student model. Our experiments show that RemixIT outperforms several previous state-of-the-art self-supervised methods under multiple speech enhancement tasks. Additionally, RemixIT provides a seamless alternative for semi-supervised and unsupervised domain adaptation for speech enhancement tasks, while being general enough to be applied to any separation task and paired with any separation model.

PDF Abstract

Code

Add Remove Mark official

etzinis/unsup_speech_enh_adaptation

Tasks

Add Remove

Domain Adaptation

Speech Enhancement

Unsupervised Domain Adaptation

Datasets

VCTK

FSD50K WHAM! DNS Challenge

Results from the Paper

Edit

Ranked #13 on Speech Enhancement on Deep Noise Suppression (DNS) Challenge

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	Sudo rm-rf (U=8)	SI-SDR-WB	18.6	# 4	Compare
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	Sudo rm-rf (U=8)	PESQ-WB	2.69	# 13	Compare
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	RemixIT (w Sudo U=32)	SI-SDR-WB	18.0	# 5	Compare
Speech Enhancement	Deep Noise Suppression (DNS) Challenge	RemixIT (w Sudo U=32)	PESQ-WB	2.60	# 15	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Continual self-training with bootstrapped remixing for speech enhancement

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove