TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Separation	WSJ0-2mix	SPGM + DM	SI-SDRi	22.7	# 5
Speech Separation	WSJ0-2mix	SPGM + DM	Number of parameters (M)	26.2	# 4
Speech Separation	WSJ0-2mix	SPGM + DM	MACs (G)	77	# 3
Speech Separation	WSJ0-2mix	SPGM	SI-SDRi	22.1	# 10
Speech Separation	WSJ0-2mix	SPGM	Number of parameters (M)	26.2	# 4
Speech Separation	WSJ0-2mix	SPGM	MACs (G)	77	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spgm-prioritizing-local-features-for-enhanced/speech-separation-on-wsj0-2mix)](https://paperswithcode.com/sota/speech-separation-on-wsj0-2mix?p=spgm-prioritizing-local-features-for-enhanced)`

SPGM: Prioritizing Local Features for enhanced speech separation performance

22 Sep 2023 · Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma ·

Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlapping chunks for its intra- and inter-blocks that separately model intra-chunk local features and inter-chunk global relationships. However, it has been found that inter-blocks, which comprise half a dual-path model's parameters, contribute minimally to performance. Thus, we propose the Single-Path Global Modulation (SPGM) block to replace inter-blocks. SPGM is named after its structure consisting of a parameter-free global pooling module followed by a modulation module comprising only 2% of the model's total parameters. The SPGM block allows all transformer layers in the model to be dedicated to local feature modelling, making the overall model single-path. SPGM achieves 22.1 dB SI-SDRi on WSJ0-2Mix and 20.4 dB SI-SDRi on Libri2Mix, exceeding the performance of Sepformer by 0.5 dB and 0.3 dB respectively and matches the performance of recent SOTA models with up to 8 times fewer parameters. Model and weights are available at huggingface.co/yipjiaqi/spgm

PDF Abstract

Code

Add Remove Mark official

yipjiaqi/spgm official

Tasks

Add Remove

Speech Separation

Datasets

WSJ0-2mix LibriMix

Results from the Paper

Edit

Ranked #5 on Speech Separation on WSJ0-2mix

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Separation	WSJ0-2mix	SPGM + DM	SI-SDRi	22.7	# 5	Compare
			Number of parameters (M)	26.2	# 4	Compare
			MACs (G)	77	# 3	Compare
Speech Separation	WSJ0-2mix	SPGM	SI-SDRi	22.1	# 10	Compare
			Number of parameters (M)	26.2	# 4	Compare
			MACs (G)	77	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

SPGM: Prioritizing Local Features for enhanced speech separation performance

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove