no code implementations • 24 Aug 2024 • Wenhao Li, Yichao Cao, Xiu Su, Xi Lin, Shan You, Mingkai Zheng, Yi Chen, Chang Xu
It can generate high-quality videos with chain of off-the-shelf diffusion model experts, each expert responsible for a decoupled subtask.
1 code implementation • 18 May 2024 • Yezhuo Zhang, Zinan Zhou, Yichao Cao, Guangyu Li, Xuanpeng Li
With the rapid growth of the Internet of Things ecosystem, Automatic Modulation Classification (AMC) has become increasingly paramount.
no code implementations • 10 Nov 2023 • Xin Lu, Shikun Chen, Yichao Cao, Xin Zhou, Xiaobo Lu
To handle this limitation, we substitute convolutional descriptors for attention-guided features and propose an Attributes Grouping and Mining Hashing (AGMH), which groups and embeds the category-specific visual attributes in multiple descriptors to generate a comprehensive feature representation for efficient fine-grained image retrieval.
1 code implementation • NeurIPS 2023 • Yichao Cao, Qingfei Tang, Xiu Su, Chen Song, Shan You, Xiaobo Lu, Chang Xu
We conduct a deep analysis of the three hierarchical features inherent in visual HOI detectors and propose a method for high-level relation extraction aimed at VL foundation models, which we call HO prompt-based learning.
no code implementations • ICCV 2023 • Yichao Cao, Qingfei Tang, Feng Yang, Xiu Su, Shan You, Xiaobo Lu, Chang Xu
Human-Object Interaction (HOI) detection is a challenging computer vision task that requires visual models to address the complex interactive relationship between humans and objects and predict HOI triplets.
1 code implementation • 27 Jan 2022 • Feng Yang, Yichao Cao, Qifan Xue, Shuai Jin, Xuanpeng Li, Weigong Zhang
Learning a powerful representation from point clouds is a fundamental and challenging problem in the field of computer vision.
no code implementations • 12 Jan 2021 • Shaosheng Xu, Jinde Cao, Yichao Cao, Tong Wang
As gradient descent method in deep learning causes a series of questions, this paper proposes a novel gradient-free deep learning structure.
2 code implementations • 10 Nov 2020 • Yichao Cao, Qingfei Tang, Xiaobo Lu, Fan Li, Jinde Cao
To overcome these problems, a novel Spatio-Temporal Cross Network (STCNet) is proposed to recognize industrial smoke emissions.
no code implementations • 1 Nov 2020 • Guoliang Liu, Qinghui Zhang, Yichao Cao, Junwei Li, Hao Wu, Guohui Tian
First, we combine the spatial and temporal skeleton features to depict the actions, which include not only the geometrical features, but also multi-scale motion features, such that both the spatial and temporal information of the action are covered.