TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Music Source Separation	MUSDB18	TAK2	SDR (vocals)	7.16	# 16
Music Source Separation	MUSDB18	TAK2	SDR (drums)	6.81	# 17
Music Source Separation	MUSDB18	TAK2	SDR (other)	4.80	# 14
Music Source Separation	MUSDB18	TAK2	SDR (bass)	5.40	# 22
Music Source Separation	MUSDB18	TAK2	SDR (avg)	6.04	# 17

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mmdenselstm-an-efficient-combination-of/music-source-separation-on-musdb18)](https://paperswithcode.com/sota/music-source-separation-on-musdb18?p=mmdenselstm-an-efficient-combination-of)`

MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation

7 May 2018 · Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji ·

Deep neural networks have become an indispensable technique for audio source separation (ASS). It was recently reported that a variant of CNN architecture called MMDenseNet was successfully employed to solve the ASS problem of estimating source amplitudes, and state-of-the-art results were obtained for DSD100 dataset. To further enhance MMDenseNet, here we propose a novel architecture that integrates long short-term memory (LSTM) in multiple scales with skip connections to efficiently model long-term structures within an audio context. The experimental results show that the proposed method outperforms MMDenseNet, LSTM and a blend of the two networks. The number of parameters and processing time of the proposed model are significantly less than those for simple blending. Furthermore, the proposed method yields better results than those obtained using ideal binary masks for a singing voice separation task.

PDF Abstract

Code

Add Remove Mark official

tsurumeso/vocal-remover

1,382

Datasets

MUSDB18

Edit Social Preview

MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation

Code Edit Add Remove Mark official

Categories

Datasets Edit

Code

Add Remove Mark official

Datasets