Browse > Audio Source Separation

Audio Source Separation

11 papers with code ·

Leaderboards

Greatest papers with code

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

8 Jun 2018f90/Wave-U-Net

Models for audio source separation usually operate on the magnitude spectrum, which ignores phase information and makes separation performance dependant on hyper-parameters for the spectral front-end.

AUDIO SOURCE SEPARATION MUSIC SOURCE SEPARATION

Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

ECCV 2018 andrewowens/multisensory

The thud of a bouncing ball, the onset of speech as lips open -- when visual and audio events occur together, it suggests that there might be a common, underlying event that produced both signals.

AUDIO SOURCE SEPARATION TEMPORAL ACTION LOCALIZATION

Improved Speech Enhancement with the Wave-U-Net

27 Nov 2018craigmacartney/Wave-U-Net-For-Speech-Enhancement

We study the use of the Wave-U-Net architecture for speech enhancement, a model introduced by Stoller et al for the separation of music vocals and accompaniment.

AUDIO SOURCE SEPARATION SPEECH ENHANCEMENT SPEECH RECOGNITION

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

31 Oct 2017f90/AdversarialAudioSeparation

Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples.

AUDIO SOURCE SEPARATION DATA AUGMENTATION MUSIC SOURCE SEPARATION

Learning to Separate Object Sounds by Watching Unlabeled Video

ECCV 2018 rhgao/Deep-MIML-Network

Our work is the first to learn audio source separation from large-scale "in the wild" videos containing multiple audio sources per video.

AUDIO DENOISING AUDIO SOURCE SEPARATION DENOISING MULTI-LABEL LEARNING

Co-Separating Sounds of Visual Objects

ICCV 2019 rhgao/co-separation

Learning how objects sound from video is challenging, since they often heavily overlap in a single audio channel.

AUDIO DENOISING AUDIO SOURCE SEPARATION DENOISING

Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations

2 Jul 2019gabolsgabs/cunet

The input vector is embedded to obtain the parameters that control Feature-wise Linear Modulation (FiLM) layers.

AUDIO SOURCE SEPARATION

Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain

30 Oct 2018PabloAlvarado/ssgp

As a result, source separation GP models have been restricted to the analysis of short audio frames.

AUDIO SOURCE SEPARATION

Training Generative Adversarial Networks from Incomplete Observations using Factorised Discriminators

ICLR 2020 f90/FactorGAN

We apply our method to image generation, image segmentation and audio source separation, and obtain improved performance over a standard GAN when additional incomplete training examples are available.

AUDIO SOURCE SEPARATION IMAGE GENERATION SEMANTIC SEGMENTATION