TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multi-task Audio Source Seperation	MTASS	Complex-MTASSNet	SDRi (Speech)	12.57	# 1
Multi-task Audio Source Seperation	MTASS	Complex-MTASSNet	SDRi (Music)	9.86	# 1
Multi-task Audio Source Seperation	MTASS	Complex-MTASSNet	SDRi (Noise)	8.42	# 1
Multi-task Audio Source Seperation	MTASS	Complex-MTASSNet	SDRi (Average)	10.28	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-audio-source-separation/multi-task-audio-source-seperation-on-mtass)](https://paperswithcode.com/sota/multi-task-audio-source-seperation-on-mtass?p=multi-task-audio-source-separation)`

Multi-Task Audio Source Separation

14 Jul 2021 · Lu Zhang, Chenxing Li, Feng Deng, Xiaorui Wang ·

The audio source separation tasks, such as speech enhancement, speech separation, and music source separation, have achieved impressive performance in recent studies. The powerful modeling capabilities of deep neural networks give us hope for more challenging tasks. This paper launches a new multi-task audio source separation (MTASS) challenge to separate the speech, music, and noise signals from the monaural mixture. First, we introduce the details of this task and generate a dataset of mixtures containing speech, music, and background noises. Then, we propose an MTASS model in the complex domain to fully utilize the differences in spectral characteristics of the three audio signals. In detail, the proposed model follows a two-stage pipeline, which separates the three types of audio signals and then performs signal compensation separately. After comparing different training targets, the complex ratio mask is selected as a more suitable target for the MTASS. The experimental results also indicate that the residual signal compensation module helps to recover the signals further. The proposed model shows significant advantages in separation performance over several well-known separation models.

PDF Abstract

Code

Add Remove Mark official

Windstudent/Complex-MTASSNet official

Tasks

Add Remove

Audio Source Separation

Multi-task Audio Source Seperation

Music Source Separation

Speech Enhancement

Speech Separation

Datasets

Introduced in the Paper:

MTASS

Used in the Paper:

AISHELL-1

Results from the Paper

Edit

Ranked #1 on Multi-task Audio Source Seperation on MTASS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multi-task Audio Source Seperation	MTASS	Complex-MTASSNet	SDRi (Speech)	12.57	# 1	Compare
			SDRi (Music)	9.86	# 1	Compare
			SDRi (Noise)	8.42	# 1	Compare
			SDRi (Average)	10.28	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Multi-Task Audio Source Separation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove