1 code implementation • 14 Nov 2024 • Xuannan Liu, Xing Cui, Peipei Li, Zekun Li, Huaibo Huang, Shuhan Xia, Miaoxuan Zhang, Yueying Zou, Ran He
Consequently, understanding the methods of jailbreak attacks and existing defense mechanisms is essential to ensure the safe deployment of multimodal generative models in real-world scenarios, particularly in security-sensitive applications.
no code implementations • 3 Oct 2024 • Yizhang Zou, Xuegang Hu, Peipei Li, Jun Hu, You Wu
Motivated by this, we propose an online multi-label classification algorithm under Noisy and Changing Label Distribution (NCLD).
no code implementations • 22 Sep 2024 • Lixia Ma, Puning Yang, Yuting Xu, Ziming Yang, Peipei Li, Huaibo Huang
This paper presents a comprehensive survey of recent deep learning-based approaches for facial forgery detection.
1 code implementation • 28 Jun 2024 • Yubo Huang, Jia Wang, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He
In this work, we propose a generative iris prior embedded Transformer model (Gformer), in which we build a hierarchical encoder-decoder network employing Transformer block and generative iris prior.
no code implementations • 13 Jun 2024 • Xuannan Liu, Zekun Li, Peipei Li, Shuhan Xia, Xing Cui, Linzhi Huang, Huaibo Huang, Weihong Deng, Zhaofeng He
Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist.
1 code implementation • 1 Jun 2024 • Xing Cui, Peipei Li, Zekun Li, Xuannan Liu, Yueying Zou, Zhaofeng He
Specifically, semantic guidance is derived by establishing a semantic editing direction based on reasoned intentions, while quality guidance is achieved through classifier guidance using an image fidelity discriminator.
no code implementations • 16 Mar 2024 • Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li
In this paper, we introduce StableGarment, a unified framework to tackle garment-centric(GC) generation tasks, including GC text-to-image, controllable GC text-to-image, stylized GC text-to-image, and robust virtual try-on.
no code implementations • 4 Mar 2024 • Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He
The massive generation of multimodal fake news involving both text and images exhibits substantial distribution discrepancies, prompting the need for generalized detectors.
1 code implementation • 24 Jan 2024 • Zengbin Wang, Saihui Hou, Man Zhang, Xu Liu, Chunshui Cao, Yongzhen Huang, Peipei Li, Shibiao Xu
Gait recognition is a promising biometric method that aims to identify pedestrians from their unique walking patterns.
no code implementations • 28 Dec 2023 • Qianrui Teng, Rui Wang, Xing Cui, Peipei Li, Zhaofeng He
Existing face aging methods often focus on modeling either texture aging or using an entangled shape-texture representation to achieve face aging.
no code implementations • 22 Dec 2023 • Xuannan Liu, Yaoyao Zhong, Xing Cui, Yuhang Zhang, Peipei Li, Weihong Deng
This strategy initially focuses on adapting the masks to the unique individual faces via image-specific training and then enhances their feature-level generalization ability to diverse facial variations of individuals via person-specific training.
no code implementations • ACM Transactions on Intelligent Systems and Technology 2023 • Junwei Lv, Yuqi Chu, Jun Hu, Peipei Li, Xuegang Hu
Existing approaches mainly utilize heuristic stopping rules to capture stopping signals from the prediction results of time series classifiers.
Ranked #1 on Early Classification on ECG200
no code implementations • 8 Oct 2023 • Peipei Li, Xing Cui, Yibo Hu, Man Zhang, Ting Yao, Tao Mei
Directly employing small models may result in a significant drop in performance since it is difficult for a small model to adequately capture local structure and global shape information simultaneously, which are essential clues for point cloud analysis.
2 code implementations • NeurIPS 2023 • Rui Wang, Peipei Li, Huaibo Huang, Chunshui Cao, Ran He, Zhaofeng He
Consequently, we propose a cross-modal ordinal pairwise loss to refine the CLIP feature space, where texts and images maintain both semantic alignment and ordering alignment.
no code implementations • ICCV 2023 • Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He
Face aging is an ill-posed problem because multiple plausible aging patterns may correspond to a given input.
no code implementations • 20 Mar 2023 • Xing Cui, Zekun Li, Peipei Li, Yibo Hu, Hailin Shi, Zhaofeng He
This paper explores interactive facial image editing via dialogue and introduces the ChatEdit benchmark dataset for evaluating image editing and conversation abilities in this context.
1 code implementation • 1 Aug 2022 • Xing Zhao, Haoran Liang, Peipei Li, Guodao Sun, Dongdong Zhao, Ronghua Liang, Xiaofei He
Moreover, inspired by the boundary supervision commonly used in image salient object detection (ISOD), we design a motion-aware loss for predicting object boundary motion and simultaneously perform multitask learning for VSOD and object motion prediction, which can further facilitate the model to extract spatiotemporal features accurately and maintain the object integrity.