no code implementations • ECCV 2020 • Lu Zhang, Jianming Zhang, Zhe Lin, Radomír Měch, Huchuan Lu, You He
We reformulate the problem of detecting and tracking of salient object spots as a new task called object hotspot tracking.
2 code implementations • CVPR 2025 • Xiaomin Li, Yixuan Liu, Takashi Isobe, Xu Jia, Qinpeng Cui, Dong Zhou, Dong Li, You He, Huchuan Lu, Zhongdao Wang, Emad Barsoum
In text-to-image (T2I) generation applications, negative embeddings have proven to be a simple yet effective approach for enhancing generation quality.
no code implementations • 15 Dec 2024 • Xutao Liao, Shaohui Li, Yuhui Xu, Zhi Li, Yu Liu, You He
To further enhance performance, we propose sparsely coded residuals to reduce the errors caused by low-rank approximation on the first- and second-order moments of the optimizers and weight updates.
no code implementations • 2 Dec 2024 • Xiaomin Li, Xu Jia, Qinghe Wang, Haiwen Diao, Mengmeng Ge, Pengxiang Li, You He, Huchuan Lu
They often fail to effectively decouple motion and the appearance in the limited reference videos, thereby weakening the modeling capability of motion patterns.
1 code implementation • 26 Oct 2024 • Jiazuo Yu, Haomiao Xiong, Lu Zhang, Haiwen Diao, Yunzhi Zhuge, Lanqing Hong, Dong Wang, Huchuan Lu, You He, Long Chen
Multimodal Large Language Models (MLLMs) have gained significant attention due to their impressive capabilities in multimodal understanding.
no code implementations • 8 Oct 2024 • Linping Zhang, Yu Liu, Xueqian Wang, Gang Li, You He
We reorganize datasets for CBRSOR tasks based on fine-grained ship remote sensing image slices (FGSRSI-23) and military aircraft recognition (MAR20) datasets.
no code implementations • 6 Jul 2024 • Weizhi Chen, Yaowen Li, Yu Liu, You He
State estimation is a fundamental problem for multi-sensor information fusion, essential in applications such as target tracking, power systems, and control automation.
2 code implementations • CVPR 2024 • Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu, Dong Wang, Huchuan Lu, You He
Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset.
no code implementations • CVPR 2024 • Shixin Hong, Yu Liu, Zhi Li, Shaohui Li, You He
Collaborative perception allows for information sharing between multiple agents such as vehicles and infrastructure to obtain a comprehensive view of the environment through communication and fusion.
no code implementations • 4 Oct 2023 • Siyuan Yang, Lu Zhang, Liqian Ma, Yu Liu, Jingjing Fu, You He
In this paper, we propose MagicRemover, a tuning-free method that leverages the powerful diffusion models for text-guided image inpainting.
1 code implementation • 5 Jun 2023 • Siyuan Yang, Lu Zhang, Yu Liu, Zhizhuo Jiang, You He
We construct a local-global context guidance strategy to capture the multi-perceptual embedding of the past fragment to boost the consistency of future prediction.
1 code implementation • CVPR 2023 • Wenda Zhao, Shigeng Xie, Fan Zhao, You He, Huchuan Lu
Conversely, detection task furnishes object semantic information to improve the infrared and visible image fusion.
no code implementations • CVPR 2022 • Shuai Liu, Xin Li, Huchuan Lu, You He
Multi-object tracking in unmanned aerial vehicle (UAV) videos is an important vision task and can be applied in a wide range of applications.
no code implementations • 8 Apr 2020 • Xiao Jiang, Gang Li, Yu Liu, Xiao-Ping Zhang, You He
To solve this problem, this paper presents a new homogeneous transformation model termed deep homogeneous feature fusion (DHFF) based on image style transfer (IST).
1 code implementation • CVPR 2019 • Yuxuan Sun, Chong Sun, Dong Wang, You He, Huchuan Lu
The ROI (region-of-interest) based pooling method performs pooling operations on the cropped ROI regions for various samples and has shown great success in the object detection methods.
no code implementations • CVPR 2018 • Lu Zhang, Ju Dai, Huchuan Lu, You He, Gang Wang
In this paper, we propose a novel bi-directional message passing model to integrate multi-level features for salient object detection.
Ranked #2 on
RGB Salient Object Detection
on ISTD