no code implementations • 18 Dec 2024 • Biao Liu, Wenyi Fang, Xiaoyu Wu, Yang Zheng, Zheng Hu, Bo Yuan
A recent method, Context Optimization (CoOp), further improves the performance of VL models on downstream tasks by introducing prompt learning.
1 code implementation • 18 Oct 2024 • Bo Cheng, Yuhang Ma, Liebucha Wu, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Dawei Leng, Yuhui Yin
The task of layout-to-image generation involves synthesizing images based on the captions of objects and their spatial positions.
no code implementations • 16 Oct 2024 • Ke Wang, Jiahui Zhu, Minjie Ren, Zeming Liu, Shiwei Li, Zongye Zhang, Chenkai Zhang, Xiaoyu Wu, Qiqi Zhan, Qingjie Liu, Yunhong Wang
The success of Large Language Models (LLMs) is inherently linked to the availability of vast, diverse, and high-quality data for training and evaluation.
no code implementations • 3 Oct 2024 • Xiaoyu Wu, Jiaru Zhang, Steven Wu
To answer this, we propose FineXtract, a framework for extracting fine-tuning data.
no code implementations • 30 May 2024 • Xiaoyu Wu, Jiaru Zhang, Yang Hua, Bohan Lyu, Hao Wang, Tao Song, Haibing Guan
Through this modeling, we identify the primary cause of this corruption stage: a narrowed learning distribution inherent in the nature of few-shot fine-tuning.
no code implementations • CVPR 2024 • Xiaoyu Wu, Yang Hua, Chumeng Liang, Jiaru Zhang, Hao Wang, Tao Song, Haibing Guan
In response, we present Contrasting Gradient Inversion for Diffusion Models (CGI-DM), a novel method featuring vivid visual representations for digital copyright authentication.
2 code implementations • 7 Oct 2023 • BoYang Zheng, Chumeng Liang, Xiaoyu Wu
In this paper, we propose a simple yet effective improvement for the protection against unauthorized diffusion customization by introducing targeted attacks.
1 code implementation • 2 Oct 2023 • Haotian Xue, Chumeng Liang, Xiaoyu Wu, Yongxin Chen
In this work, we present novel findings on attacking latent diffusion models (LDM) and propose new plug-and-play strategies for more effective protection.
no code implementations • 1 Sep 2023 • Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng
A two-stage object detector includes a visual backbone, a region proposal network (RPN), and a region of interest (RoI) head.
1 code implementation • 26 Jun 2023 • Yujiang Pu, Xiaoyu Wu, Lulu Yang, Shengjin Wang
Additionally, we propose a Prompt-Enhanced Learning (PEL) module that integrates semantic priors using knowledge-based prompts to boost the discriminative capacity of context features while ensuring separability between anomaly sub-classes.
Anomaly Detection In Surveillance Videos Video Anomaly Detection +1
1 code implementation • 22 May 2023 • Chumeng Liang, Xiaoyu Wu
Diffusion Models (DMs) have empowered great success in artificial-intelligence-generated content, especially in artwork creation, yet raising new concerns in intellectual properties and copyright.
2 code implementations • 9 Feb 2023 • Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan
Recently, Diffusion Models (DMs) boost a wave in AI for Art yet raise new copyright concerns, where infringers benefit from using unauthorized paintings to train DMs to generate novel paintings in a similar style.
no code implementations • 17 Oct 2022 • Anyi Rao, Xuekun Jiang, Sichen Wang, Yuwei Guo, Zihao Liu, Bo Dai, Long Pang, Xiaoyu Wu, Dahua Lin, Libiao Jin
The ability to choose an appropriate camera view among multiple cameras plays a vital role in TV shows delivery.
no code implementations • 11 Aug 2022 • Yujiang Pu, Xiaoyu Wu
Video anomaly detection is recently formulated as a multiple instance learning task under weak supervision, in which each video is treated as a bag of snippets to be determined whether contains anomalies.
1 code implementation • 7 Jun 2022 • Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui
shifts in prediction mechanisms ($Y|X$-shifts).
1 code implementation • 8 May 2022 • Chunyu Xie, Heng Cai, Jincheng Li, Fanjing Kong, Xiaoyu Wu, Jianfei Song, Henrique Morimitsu, Lin Yao, Dexin Wang, Xiangzheng Zhang, Dawei Leng, Baochang Zhang, Xiangyang Ji, Yafeng Deng
In this work, we build a large-scale high-quality Chinese Cross-Modal Benchmark named CCMB for the research community, which contains the currently largest public pre-training dataset Zero and five human-annotated fine-tuning datasets for downstream tasks.
Ranked #3 on Image Retrieval on Flickr30k-CN
1 code implementation • Conference 2022 • Yujiang Pu, Xiaoyu Wu
Detecting violence in video is a challenging task due to its complex scenarios and great intra-class variability.
no code implementations • 4 Sep 2021 • Ruizhi Chen, Xiaoyu Wu, Yansong Pan, Kaizhao Yuan, Ling Li, TianYun Ma, JiYuan Liang, Rui Zhang, Kai Wang, Chen Zhang, Shaohui Peng, Xishan Zhang, Zidong Du, Qi Guo, Yunji Chen
In this framework, the environment can be easily configured to realize all kinds of RL tasks in the mainstream research.
no code implementations • 10 May 2020 • Xiaoyu Wu, Zeyu Bai, Jianguo Jia, Youzhi Liang
In this paper, we propose a novel multi-variate algorithm using a triple-regression methodology to predict the airborne-pollen allergy season that can be customized for each patient in the long term.