Music Source Separation

53 papers with code • 3 benchmarks • 7 datasets

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Benchmarks

Add a Result

These leaderboards are used to track progress in Music Source Separation

Dataset	Best Model	Compare
MUSDB18	Sparse HT Demucs (fine tuned)	See all
MUSDB18-HQ	BS-RoFormer (L=12, OA)	See all
Slakh2100	LQ-VAE + Scalable Transformer	See all

Libraries

Use these libraries to find Music Source Separation models and implementations

FaceOnLive/Spleeter-Android-iOS

4 papers

190

Datasets

Latest papers with no code

Most implemented Social Latest No code

Hybrid Y-Net Architecture for Singing Voice Separation

no code yet • 5 Mar 2023

This research paper presents a novel deep learning-based neural network architecture, named Y-Net, for achieving music source separation.

Paper
Add Code

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training

no code yet • 1 Feb 2023

Jointist consists of an instrument recognition module that conditions the other two modules: a transcription module that outputs instrument-specific piano rolls, and a source separation module that utilizes instrument information and transcription results.

Paper
Add Code

Multi-scale temporal-frequency attention for music source separation

no code yet • 2 Sep 2022

In recent years, deep neural networks (DNNs) based approaches have achieved the start-of-the-art performance for music source separation (MSS).

Paper
Add Code

Music Separation Enhancement with Generative Modeling

no code yet • 26 Aug 2022

Despite phenomenal progress in recent years, state-of-the-art music separation systems produce source estimates with significant perceptual shortcomings, such as adding extraneous noise or removing harmonics.

Paper
Add Code

Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation

no code yet • IEEE Signal Processing Letters 2022

In this paper, we propose a model which combines the complexed spectrogram domain feature and time-domain feature by a cross-domain encoder (CDE) and adopts the hierarchic temporal convolutional network (HTCN) for multiple music sources separation.

Paper
Add Code

Feature-informed Latent Space Regularization for Music Source Separation

no code yet • 17 Mar 2022

The integration of additional side information to improve music source separation has been investigated numerous times, e. g., by adding features to the input or by adding learning targets in a multi-task learning scenario.

Paper
Add Code

On loss functions and evaluation metrics for music source separation

no code yet • 16 Feb 2022

We investigate which loss functions provide better separations via benchmarking an extensive set of those for music source separation.

Paper
Add Code

SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation

no code yet • 15 Feb 2022

With the recent advancements of data driven approaches using deep neural networks, music source separation has been formulated as an instrument-specific supervised problem.

Paper
Add Code

Distortion Audio Effects: Learning How to Recover the Clean Signal

no code yet • 3 Feb 2022

Given the recent advances in music source separation and automatic mixing, removing audio effects in music tracks is a meaningful step toward developing an automated remixing system.

Paper
Add Code

Upsampling layers for music source separation

no code yet • 23 Nov 2021

Upsampling artifacts are caused by problematic upsampling layers and due to spectral replicas that emerge while upsampling.

Paper
Add Code

Music Source Separation

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result