no code implementations • 16 Mar 2024 • Jiyuan Fu, Zhaoyu Chen, Kaixun Jiang, Haijing Guo, Jiafeng Wang, Shuyong Gao, Wenqiang Zhang
Existing work rarely studies the transferability of attacks on VLP models, resulting in a substantial performance gap from white-box attacks.
no code implementations • 10 Mar 2024 • Pinxue Guo, Lingyi Hong, Xinyu Zhou, Shuyong Gao, Wanyun Li, Jinglun Li, Zhaoyu Chen, Xiaoqiang Li, Wei zhang, Wenqiang Zhang
To address these limitations, we propose the setting named Click Video Object Segmentation (ClickVOS) which segments objects of interest across the whole video according to a single click per object in the first frame.
no code implementations • 30 Nov 2023 • Lingyi Hong, Wei zhang, Shuyong Gao, Hong Lu, Wenqiang Zhang
We evaluate our method on several benchmark datasets and achieve state-of-the-art results.
no code implementations • 14 Oct 2023 • Yicheng Song, Shuyong Gao, Haozhe Xing, Yiting Cheng, Yan Wang, Wenqiang Zhang
Unsupervised salient object detection aims to detect salient objects without using supervision signals eliminating the tedious task of manually labeling salient objects.
no code implementations • 14 Oct 2023 • Qianyu Guo, Huifang Du, Xing Jia, Shuyong Gao, Yan Teng, Haofen Wang, Wenqiang Zhang
Finally, the generated features and prototypes are together to train a more generalized classifier.
no code implementations • MM '22: Proceedings of the 30th ACM International Conference on Multimedia 2022 • Yan Wang, Yixuan Sun, Wei Song, Shuyong Gao, Yiwen Huang, Zhaoyu Chen, Weifeng Ge, and Wenqiang Zhang
To obtain consistent prediction probabilities from the dual path, we further propose a dual path regularization loss, aiming to minimize the divergence between the distributions of two-path embeddings.
Ranked #13 on Dynamic Facial Expression Recognition on DFEW
Dynamic Facial Expression Recognition Representation Learning
no code implementations • 15 Jul 2022 • Shuyong Gao, Haozhe Xing, Wei zhang, Yan Wang, Qianyu Guo, Wenqiang Zhang
Several works attempt to use scribble annotations to mitigate this problem, but point supervision as a more labor-saving annotation method (even the most labor-saving method among manual annotation methods for dense prediction), has not been explored.
1 code implementation • 22 Mar 2022 • Shuyong Gao, Wei zhang, Yan Wang, Qianyu Guo, Chenglong Zhang, Yangji He, Wenqiang Zhang
Then we develop a transformer-based point-supervised saliency detection model to produce the first round of saliency maps.
no code implementations • CVPR 2022 • Yan Wang, Yixuan Sun, Yiwen Huang, Zhongying Liu, Shuyong Gao, Wei zhang, Weifeng Ge, Wenqiang Zhang
Current benchmarks for facial expression recognition (FER) mainly focus on static images, while there are limited datasets for FER in videos.