Search Results for author: Junwen Xiong

Found 7 papers, 1 papers with code

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

no code implementations2 Mar 2024 Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei zha

Audio-visual saliency prediction can draw support from diverse modality complements, but further performance enhancement is still challenged by customized architectures as well as task-specific loss functions.

Denoising Saliency Prediction

UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection

no code implementations15 Sep 2023 Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei zha, Tao You

While many approaches have crafted task-specific training paradigms for either video saliency prediction or video salient object detection tasks, few attention has been devoted to devising a generalized saliency modeling framework that seamlessly bridges both these distinct tasks.

object-detection Saliency Prediction +3

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction

no code implementations8 Jul 2023 Ganglai Wang, Peng Zhang, Junwen Xiong, Feihan Yang, Wei Huang, Yufei zha

DeepFake based digital facial forgery is threatening public media security, especially when lip manipulation has been used in talking face generation, and the difficulty of fake video detection is further improved.

Face Detection Face Swapping +2

Audio-visual speech separation based on joint feature representation with cross-modal attention

no code implementations5 Mar 2022 Junwen Xiong, Peng Zhang, Lei Xie, Wei Huang, Yufei zha, Yanning Zhang

Multi-modal based speech separation has exhibited a specific advantage on isolating the target character in multi-talker noisy environments.

Optical Flow Estimation Speech Separation

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

1 code implementation4 Mar 2022 Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei zha

Active speaker detection and speech enhancement have become two increasingly attractive topics in audio-visual scenario understanding.

Multi-Task Learning Speech Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.