Search Results for author: Yichao Cao

Found 7 papers, 3 papers with code

Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval

no code implementations10 Nov 2023 Xin Lu, Shikun Chen, Yichao Cao, Xin Zhou, Xiaobo Lu

To handle this limitation, we substitute convolutional descriptors for attention-guided features and propose an Attributes Grouping and Mining Hashing (AGMH), which groups and embeds the category-specific visual attributes in multiple descriptors to generate a comprehensive feature representation for efficient fine-grained image retrieval.

Image Retrieval Retrieval

Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models

1 code implementation NeurIPS 2023 Yichao Cao, Qingfei Tang, Xiu Su, Chen Song, Shan You, Xiaobo Lu, Chang Xu

We conduct a deep analysis of the three hierarchical features inherent in visual HOI detectors and propose a method for high-level relation extraction aimed at VL foundation models, which we call HO prompt-based learning.

Human-Object Interaction Detection Relation Extraction +1

Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection

no code implementations ICCV 2023 Yichao Cao, Qingfei Tang, Feng Yang, Xiu Su, Shan You, Xiaobo Lu, Chang Xu

Human-Object Interaction (HOI) detection is a challenging computer vision task that requires visual models to address the complex interactive relationship between humans and objects and predict HOI triplets.

Human-Object Interaction Detection Sentence +1

A SOM-based Gradient-Free Deep Learning Method with Convergence Analysis

no code implementations12 Jan 2021 Shaosheng Xu, Jinde Cao, Yichao Cao, Tong Wang

As gradient descent method in deep learning causes a series of questions, this paper proposes a novel gradient-free deep learning structure.

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

2 code implementations10 Nov 2020 Yichao Cao, Qingfei Tang, Xiaobo Lu, Fan Li, Jinde Cao

To overcome these problems, a novel Spatio-Temporal Cross Network (STCNet) is proposed to recognize industrial smoke emissions.

Memory Group Sampling Based Online Action Recognition Using Kinetic Skeleton Features

no code implementations1 Nov 2020 Guoliang Liu, Qinghui Zhang, Yichao Cao, Junwei Li, Hao Wu, Guohui Tian

First, we combine the spatial and temporal skeleton features to depict the actions, which include not only the geometrical features, but also multi-scale motion features, such that both the spatial and temporal information of the action are covered.

Action Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.