Visually Guided Sound Source Separation

3 papers with code • 0 benchmarks • 0 datasets

The task of visually guided sound source separation (also referred as audio-visual sound separation or visual sound separation) aims to recover sound components from a mixture audio with the aid of visual cues.

Most implemented papers

Visually Guided Sound Source Separation using Cascaded Opponent Filter Network

ly-zhu/ly-zhu.github.io 4 Jun 2020

A key element in COF is a novel opponent filter module that identifies and relocates residual components between sources.

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations

ly-zhu/ly-zhu.github.io 17 Apr 2021

The objective of this paper is to perform audio-visual sound source separation, i. e.~to separate component audios from a mixture based on the videos of sound sources.

Visually-Guided Sound Source Separation with Audio-Visual Predictive Coding

zjsong/audio-visual-predictive-coding 19 Jun 2023

The framework of visually-guided sound source separation generally consists of three parts: visual feature extraction, multimodal feature fusion, and sound signal processing.