Transfer Learning with Jukebox for Music Source Separation

28 Nov 2021  ·  W. Zai El Amri, O. Tautz, H. Ritter, A. Melnik ·

In this work, we demonstrate how a publicly available, pre-trained Jukebox model can be adapted for the problem of audio source separation from a single mixed audio channel. Our neural network architecture, which is using transfer learning, is quick to train and the results demonstrate performance comparable to other state-of-the-art approaches that require a lot more compute resources, training data, and time. We provide an open-source code implementation of our architecture (https://github.com/wzaielamri/unmix)

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Music Source Separation MUSDB18-HQ Unmix SDR (drums) 4.925 # 12
SDR (bass) 4.073 # 12
SDR (others) 2.695 # 12
SDR (vocals) 5.06 # 12
SDR (avg) 4.188 # 12

Methods