TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multi-task Audio Source Seperation	MTASS	Demucs	SDRi (Speech)	9.93	# 3
Multi-task Audio Source Seperation	MTASS	Demucs	SDRi (Music)	6.38	# 3
Multi-task Audio Source Seperation	MTASS	Demucs	SDRi (Noise)	6.29	# 3
Multi-task Audio Source Seperation	MTASS	Demucs	SDRi (Average)	7.53	# 3
Music Source Separation	MUSDB18	DEMUCS	SDR (vocals)	6.84	# 19
Music Source Separation	MUSDB18	DEMUCS	SDR (drums)	6.86	# 16
Music Source Separation	MUSDB18	DEMUCS	SDR (other)	4.42	# 19
Music Source Separation	MUSDB18	DEMUCS	SDR (bass)	7.01	# 11
Music Source Separation	MUSDB18	DEMUCS	SDR (avg)	6.28	# 16
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (vocals)	7.29	# 13
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (drums)	7.58	# 8
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (other)	4.69	# 15
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (bass)	7.60	# 8
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (avg)	6.79	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/music-source-separation-in-the-waveform-1/multi-task-audio-source-seperation-on-mtass)](https://paperswithcode.com/sota/multi-task-audio-source-seperation-on-mtass?p=music-source-separation-in-the-waveform-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/music-source-separation-in-the-waveform-1/music-source-separation-on-musdb18)](https://paperswithcode.com/sota/music-source-separation-on-musdb18?p=music-source-separation-in-the-waveform-1)`

Music Source Separation in the Waveform Domain

27 Nov 2019 · Alexandre Défossez, Nicolas Usunier, Léon Bottou, Francis Bach ·

Source separation for music is the task of isolating contributions, or stems, from different instruments recorded individually and arranged together to form a song. Such components include voice, bass, drums and any other accompaniments.Contrarily to many audio synthesis tasks where the best performances are achieved by models that directly generate the waveform, the state-of-the-art in source separation for music is to compute masks on the magnitude spectrum. In this paper, we compare two waveform domain architectures. We first adapt Conv-Tasnet, initially developed for speech source separation,to the task of music source separation. While Conv-Tasnet beats many existing spectrogram-domain methods, it suffersfrom significant artifacts, as shown by human evaluations. We propose instead Demucs, a novel waveform-to-waveform model,with a U-Net structure and bidirectional LSTM.Experiments on the MusDB dataset show that, with proper data augmentation, Demucs beats allexisting state-of-the-art architectures, including Conv-Tasnet, with 6.3 SDR on average, (and up to 6.8 with 150 extra training songs, even surpassing the IRM oracle for the bass source).Using recent development in model quantization, Demucs can be compressed down to 120MBwithout any loss of accuracy.We also provide human evaluations, showing that Demucs benefit from a large advantagein terms of the naturalness of the audio. However, it suffers from some bleeding,especially between the vocals and other source.

PDF Abstract

Code

Add Remove Mark official

facebookresearch/demucs official

↳ Quickstart in

Colab

Spaces

7,621

Tasks

Add Remove

Audio Generation

Audio Synthesis

Data Augmentation

Multi-task Audio Source Seperation

Music Source Separation

Quantization

Datasets

MUSDB18 MTASS

Results from the Paper

Edit

Ranked #3 on Multi-task Audio Source Seperation on MTASS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multi-task Audio Source Seperation	MTASS	Demucs	SDRi (Speech)	9.93	# 3	Compare
			SDRi (Music)	6.38	# 3	Compare
			SDRi (Noise)	6.29	# 3	Compare
			SDRi (Average)	7.53	# 3	Compare
Music Source Separation	MUSDB18	DEMUCS	SDR (vocals)	6.84	# 19	Compare
			SDR (drums)	6.86	# 16	Compare
			SDR (other)	4.42	# 19	Compare
			SDR (bass)	7.01	# 11	Compare
			SDR (avg)	6.28	# 16	Compare
Music Source Separation	MUSDB18	DEMUCS (extra)	SDR (vocals)	7.29	# 13	Compare
			SDR (drums)	7.58	# 8	Compare
			SDR (other)	4.69	# 15	Compare
			SDR (bass)	7.60	# 8	Compare
			SDR (avg)	6.79	# 10	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Music Source Separation in the Waveform Domain

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove