Search Results for author: Hantao Yao

Found 17 papers, 8 papers with code

Hierarchical Prompts for Rehearsal-free Continual Learning

no code implementations21 Jan 2024 Yukun Zuo, Hantao Yao, Lu Yu, Liansheng Zhuang, Changsheng Xu

Nonetheless, these learnable prompts tend to concentrate on the discriminatory knowledge of the current task while ignoring past task knowledge, leading to that learnable prompts still suffering from catastrophic forgetting.

Continual Learning

Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition

no code implementations11 Jan 2024 Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu

We introduce Hierarchical Augmentation and Distillation (HAD), which comprises the Hierarchical Augmentation Module (HAM) and Hierarchical Distillation Module (HDM) to efficiently utilize the hierarchical structure of data and models, respectively.

Video Recognition

TCP:Textual-based Class-aware Prompt tuning for Visual-Language Model

1 code implementation30 Nov 2023 Hantao Yao, Rui Zhang, Changsheng Xu

However, those textual tokens have a limited generalization ability regarding unseen domains, as they cannot dynamically adjust to the distribution of testing classes.

Language Modelling

Camera-Incremental Object Re-Identification with Identity Knowledge Evolution

1 code implementation25 May 2023 Hantao Yao, Lu Yu, Jifei Luo, Changsheng Xu

In this paper, we propose a novel Identity Knowledge Evolution (IKE) framework for CIOR, consisting of the Identity Knowledge Association (IKA), Identity Knowledge Distillation (IKD), and Identity Knowledge Update (IKU).

Knowledge Distillation Object

Visual-Language Prompt Tuning with Knowledge-guided Context Optimization

1 code implementation CVPR 2023 Hantao Yao, Rui Zhang, Changsheng Xu

Representative CoOp-based work combines the learnable textual tokens with the class tokens to obtain specific textual knowledge.

Language Modelling

UTM: A Unified Multiple Object Tracking Model With Identity-Aware Feature Enhancement

no code implementations CVPR 2023 Sisi You, Hantao Yao, Bing-Kun Bao, Changsheng Xu

Recently, Multiple Object Tracking has achieved great success, which consists of object detection, feature embedding, and identity association.

Multiple Object Tracking object-detection +1

Dual Cluster Contrastive learning for Object Re-Identification

1 code implementation9 Dec 2021 Hantao Yao, Changsheng Xu

Unlike the individual-based updating mechanism, the centroid-based updating mechanism that applies the mean feature of each cluster to update the cluster memory can reduce the impact of individual samples.

Contrastive Learning Object +1

Hierarchical Granularity Transfer Learning

no code implementations NeurIPS 2020 Shaobo Min, Hongtao Xie, Hantao Yao, Xuran Deng, Zheng-Jun Zha, Yongdong Zhang

In this paper, we introduce a new task, named Hierarchical Granularity Transfer Learning (HGTL), to recognize sub-level categories with basic-level annotations and semantic descriptions for hierarchical categories.

Transfer Learning

Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning

no code implementations31 May 2020 Hantao Yao, Shaobo Min, Yongdong Zhang, Changsheng Xu

Then, an attentional graph attribute embedding is proposed to reduce the semantic bias between seen and unseen categories, which utilizes the graph operation to capture the semantic relationship between categories.

Attribute Transfer Learning +1

Joint Person Objectness and Repulsion for Person Search

no code implementations30 May 2020 Hantao Yao, Changsheng Xu

Based on this repulsion constraint, the repulsion term is proposed to reduce the similarity of distractor images that are not most similar to the probe person.

Human Detection Person Search

Domain-aware Visual Bias Eliminating for Generalized Zero-Shot Learning

1 code implementation CVPR 2020 Shaobo Min, Hantao Yao, Hongtao Xie, Chaoqun Wang, Zheng-Jun Zha, Yongdong Zhang

Recent methods focus on learning a unified semantic-aligned visual representation to transfer knowledge between two domains, while ignoring the effect of semantic-free visual representation in alleviating the biased recognition problem.

Generalized Zero-Shot Learning

Multi-Objective Matrix Normalization for Fine-grained Visual Recognition

1 code implementation30 Mar 2020 Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang

In this paper, we propose an efficient Multi-Objective Matrix Normalization (MOMN) method that can simultaneously normalize a bilinear representation in terms of square-root, low-rank, and sparsity.

Fine-Grained Visual Recognition

Domain-Specific Embedding Network for Zero-Shot Recognition

1 code implementation12 Aug 2019 Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang

In contrast to previous methods, the DSEN decomposes the domain-shared projection function into one domain-invariant and two domain-specific sub-functions to explore the similarities and differences between two domains.

Zero-Shot Learning

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval

no code implementations13 Sep 2017 Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, Qi Tian

Targeting to solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) and an efficient indexing and retrieval framework, respectively.

Person Re-Identification Representation Learning +1

Deep Representation Learning with Part Loss for Person Re-Identification

no code implementations4 Jul 2017 Hantao Yao, Shiliang Zhang, Yongdong Zhang, Jintao Li, Qi Tian

The representation learning risk is evaluated by the proposed part loss, which automatically generates several parts for an image, and computes the person classification loss on each part separately.

Classification General Classification +2

One-Shot Fine-Grained Instance Retrieval

no code implementations4 Jul 2017 Hantao Yao, Shiliang Zhang, Yongdong Zhang, Jintao Li, Qi Tian

Aiming to conquer this issue, we propose a retrieval task named One-Shot Fine-Grained Instance Retrieval (OSFGIR).

Fine-Grained Visual Categorization Image Retrieval +1

DR2-Net: Deep Residual Reconstruction Network for Image Compressive Sensing

1 code implementation19 Feb 2017 Hantao Yao, Feng Dai, Dongming Zhang, Yike Ma, Shiliang Zhang, Yongdong Zhang, Qi Tian

Accordingly, DR$^{2}$-Net consists of two components, \emph{i. e.,} linear mapping network and residual network, respectively.

Compressive Sensing Image Reconstruction

Cannot find the paper you are looking for? You can Submit a new open access paper.