no code implementations • ECCV 2020 • Weitao Wan, Jiansheng Chen, Ming-Hsuan Yang
We call such a new robust training strategy the adversarial training with bi-directional likelihood regularization (ATBLR) method.
no code implementations • 27 Nov 2024 • Yudong Zhang, Ruobing Xie, Jiansheng Chen, Xingwu Sun, Zhanhui Kang, Yu Wang
Large vision-language models (LVLMs) have demonstrated exceptional performance on complex multimodal tasks.
no code implementations • 25 Sep 2024 • Kun Song, Zhiquan Tan, Bochao Zou, Jiansheng Chen, Huimin Ma, Weiran Huang
Experiments show that matrix entropy cannot solely describe the interaction of the information content of data representation and classification head weights but it can effectively reflect the similarity and clustering behavior of the data.
1 code implementation • 8 Sep 2024 • Yudong Zhang, Ruobing Xie, Jiansheng Chen, Xingwu Sun, Yu Wang
We propose an unconventional method named PIP, which utilizes the attention patterns of one randomly selected irrelevant probe question (e. g., "Is there a clock?")
1 code implementation • 16 Jul 2024 • Yu Shang, Yuming Lin, Yu Zheng, Hangyu Fan, Jingtao Ding, Jie Feng, Jiansheng Chen, Li Tian, Yong Li
Toward this problem, we propose UrbanWorld, the first generative urban world model that can automatically create a customized, realistic and interactive 3D urban world with flexible control conditions.
no code implementations • 7 Apr 2024 • Youze Xue, Binghui Chen, Yifeng Geng, Xuansong Xie, Jiansheng Chen, Hongbing Ma
Customized generative text-to-image models have the ability to produce images that closely resemble a given subject.
no code implementations • 25 Mar 2024 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
This paper presents a general approach for text-to-image diffusion models to address the mutual interference between different subjects and their attachments in complex scenes, pursuing better text-image consistency.
no code implementations • 6 Mar 2024 • Yingrui Ji, Yao Zhu, Zhigang Li, Jiansheng Chen, Yunlong Kong, Jingbo Chen
Our work addresses this challenge by enhancing the detection and management of OOD samples in neural networks.
1 code implementation • 20 Feb 2024 • Bochao Zou, Zizheng Guo, Jiansheng Chen, Huimin Ma
Due to the periodicity nature of rPPG, the long-range dependency capturing capacity of the Transformer was assumed to be advantageous for such signals.
1 code implementation • 7 Aug 2023 • Jiawei Li, Jiansheng Chen, JinYuan Liu, Huimin Ma
Finally, we merge all graph features to get the fusion result.
no code implementations • 25 Jun 2023 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
Typical diffusion models and modern large-scale conditional generative models like text-to-image generative models are vulnerable to overfitting when fine-tuned on extremely limited data.
no code implementations • 19 May 2023 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
Our approach only needs the silhouettes of few-shot target samples as training data to learn target geometry distributions and achieve generated shapes with diverse topology and textures.
1 code implementation • 6 Mar 2023 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
We present MotionVideoGAN, a novel video generator synthesizing videos based on the motion space learned by pre-trained image pair generators.
1 code implementation • 8 Feb 2023 • Kun Song, Yuchen Wu, Jiansheng Chen, Tianyu Hu, Huimin Ma
Due to the scarcity of available data, deep learning does not perform well on few-shot learning tasks.
no code implementations • 17 Jan 2023 • Yuchen Wu, Kun Song, Fangzheng Zhao, Jiansheng Chen, Huimin Ma
Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is a challenging cross-modal retrieval task.
no code implementations • 7 Nov 2022 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
Then we fine-tune DDPMs pre-trained on large source domains to solve the overfitting problem when training data is limited.
no code implementations • 27 Oct 2022 • Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
It strengthens global image discrimination and guides adapted GANs to preserve more information learned from source domains for higher image quality.
no code implementations • ICCV 2021 • Cheng Yu, Jiansheng Chen, Youze Xue, Yuyang Liu, Weitao Wan, Jiayu Bao, Huimin Ma
Physical-world adversarial attacks based on universal adversarial patches have been proved to be able to mislead deep convolutional neural networks (CNNs), exposing the vulnerability of real-world visual classification systems based on CNNs.
no code implementations • 18 Nov 2020 • Weitao Wan, Jiansheng Chen, Cheng Yu, Tong Wu, Yuanyi Zhong, Ming-Hsuan Yang
In this work, we propose a Gaussian mixture (GM) loss function for deep neural networks for visual classification.
no code implementations • 30 Sep 2020 • Yiqing Huang, Jiansheng Chen
To effectively learn from the teacher model, we propose Teacher-Critical Training Strategies (TCTS) for both XE and RL training to facilitate better learning processes for the caption model.
2 code implementations • CVPR 2018 • Weitao Wan, Yuanyi Zhong, Tianpeng Li, Jiansheng Chen
We propose a large-margin Gaussian Mixture (L-GM) loss for deep neural networks in classification tasks.
no code implementations • 25 Jan 2017 • Yuanyi Zhong, Jiansheng Chen, Bo Huang
Although these methods differ essentially in many aspects, a common practice of them is to specifically align the facial area based on the prior knowledge of human face structure before feature extraction.
no code implementations • CVPR 2016 • Jiansheng Chen, Gaocheng Bai, Shaoheng Liang, Zhengqin Li
Attention based automatic image cropping aims at preserving the most visually important region in an image.
no code implementations • CVPR 2015 • Zhengqin Li, Jiansheng Chen
We present in this paper a superpixel segmentation algorithm called Linear Spectral Clustering (LSC), which produces compact and uniform superpixels with low computational costs.