Search Results for author: Jinxing Zhou

Found 7 papers, 6 papers with code

Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering

1 code implementation • 20 Dec 2023 • Zhangbin Li, Dan Guo, Jinxing Zhou, Jing Zhang, Meng Wang

These selected pairs are constrained to have larger similarity values than the mismatched pairs.

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +4

Paper
Code

Fine-grained Audible Video Description

1 code implementation • CVPR 2023 • Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong

We explore a new task for audio-visual-language modeling called fine-grained audible video description (FAVD).

Language Modelling Masked Language Modeling +5

Paper
Code

Improving Audio-Visual Video Parsing with Pseudo Visual Labels

no code implementations • 4 Mar 2023 • Jinxing Zhou, Dan Guo, Yiran Zhong, Meng Wang

We perform extensive experiments on the LLP dataset and demonstrate that our method can generate high-quality segment-level pseudo labels with the help of our newly proposed loss and the label denoising strategy.

Denoising Pseudo Label

Paper
Add Code

Audio-Visual Segmentation with Semantics

1 code implementation • 30 Jan 2023 • Jinxing Zhou, Xuyang Shen, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong

To deal with these problems, we propose a new baseline method that uses a temporal pixel-wise audio-visual interaction module to inject audio semantics as guidance for the visual segmentation process.

Segmentation Semantic Segmentation +1

430

Paper
Code

Contrastive Positive Sample Propagation along the Audio-Visual Event Line

1 code implementation • 18 Nov 2022 • Jinxing Zhou, Dan Guo, Meng Wang

Visual and audio signals often coexist in natural environments, forming audio-visual events (AVEs).

Contrastive Learning Representation Learning

Paper
Code

Audio-Visual Segmentation

1 code implementation • 11 Jul 2022 • Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong

To deal with the AVS problem, we propose a novel method that uses a temporal pixel-wise audio-visual interaction module to inject audio semantics as guidance for the visual segmentation process.

Segmentation

430

Paper
Code

Positive Sample Propagation along the Audio-Visual Event Line

2 code implementations • CVPR 2021 • Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang

To encourage the network to extract high correlated features for positive samples, a new audio-visual pair similarity loss is proposed.

audio-visual event localization

158

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.