Search Results for author: Hantao Yao

Found 17 papers, 8 papers with code

Hierarchical Prompts for Rehearsal-free Continual Learning

no code implementations • 21 Jan 2024 • Yukun Zuo, Hantao Yao, Lu Yu, Liansheng Zhuang, Changsheng Xu

Nonetheless, these learnable prompts tend to concentrate on the discriminatory knowledge of the current task while ignoring past task knowledge, leading to that learnable prompts still suffering from catastrophic forgetting.

Continual Learning

Paper
Add Code

Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition

no code implementations • 11 Jan 2024 • Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu

We introduce Hierarchical Augmentation and Distillation (HAD), which comprises the Hierarchical Augmentation Module (HAM) and Hierarchical Distillation Module (HDM) to efficiently utilize the hierarchical structure of data and models, respectively.

Video Recognition

Paper
Add Code

TCP:Textual-based Class-aware Prompt tuning for Visual-Language Model

1 code implementation • 30 Nov 2023 • Hantao Yao, Rui Zhang, Changsheng Xu

However, those textual tokens have a limited generalization ability regarding unseen domains, as they cannot dynamically adjust to the distribution of testing classes.

Language Modelling

Paper
Code

Camera-Incremental Object Re-Identification with Identity Knowledge Evolution

1 code implementation • 25 May 2023 • Hantao Yao, Lu Yu, Jifei Luo, Changsheng Xu

In this paper, we propose a novel Identity Knowledge Evolution (IKE) framework for CIOR, consisting of the Identity Knowledge Association (IKA), Identity Knowledge Distillation (IKD), and Identity Knowledge Update (IKU).

Knowledge Distillation Object

Paper
Code

Visual-Language Prompt Tuning with Knowledge-guided Context Optimization

1 code implementation • CVPR 2023 • Hantao Yao, Rui Zhang, Changsheng Xu

Representative CoOp-based work combines the learnable textual tokens with the class tokens to obtain specific textual knowledge.

Language Modelling

Paper
Code

UTM: A Unified Multiple Object Tracking Model With Identity-Aware Feature Enhancement

no code implementations • CVPR 2023 • Sisi You, Hantao Yao, Bing-Kun Bao, Changsheng Xu

Recently, Multiple Object Tracking has achieved great success, which consists of object detection, feature embedding, and identity association.

Multiple Object Tracking object-detection +1

Paper
Add Code

Dual Cluster Contrastive learning for Object Re-Identification

1 code implementation • 9 Dec 2021 • Hantao Yao, Changsheng Xu

Unlike the individual-based updating mechanism, the centroid-based updating mechanism that applies the mean feature of each cluster to update the cluster memory can reduce the impact of individual samples.

Ranked #50 on Person Re-Identification on Market-1501

Contrastive Learning Object +1

Paper
Code

Hierarchical Granularity Transfer Learning

no code implementations • NeurIPS 2020 • Shaobo Min, Hongtao Xie, Hantao Yao, Xuran Deng, Zheng-Jun Zha, Yongdong Zhang

In this paper, we introduce a new task, named Hierarchical Granularity Transfer Learning (HGTL), to recognize sub-level categories with basic-level annotations and semantic descriptions for hierarchical categories.

Transfer Learning

Paper
Add Code

Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning

no code implementations • 31 May 2020 • Hantao Yao, Shaobo Min, Yongdong Zhang, Changsheng Xu

Then, an attentional graph attribute embedding is proposed to reduce the semantic bias between seen and unseen categories, which utilizes the graph operation to capture the semantic relationship between categories.

Attribute Transfer Learning +1

Paper
Add Code

Joint Person Objectness and Repulsion for Person Search

no code implementations • 30 May 2020 • Hantao Yao, Changsheng Xu

Based on this repulsion constraint, the repulsion term is proposed to reduce the similarity of distractor images that are not most similar to the probe person.

Human Detection Person Search

Paper
Add Code

Domain-aware Visual Bias Eliminating for Generalized Zero-Shot Learning

1 code implementation • CVPR 2020 • Shaobo Min, Hantao Yao, Hongtao Xie, Chaoqun Wang, Zheng-Jun Zha, Yongdong Zhang

Recent methods focus on learning a unified semantic-aligned visual representation to transfer knowledge between two domains, while ignoring the effect of semantic-free visual representation in alleviating the biased recognition problem.

Generalized Zero-Shot Learning

Paper
Code

Multi-Objective Matrix Normalization for Fine-grained Visual Recognition

1 code implementation • 30 Mar 2020 • Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang

In this paper, we propose an efficient Multi-Objective Matrix Normalization (MOMN) method that can simultaneously normalize a bilinear representation in terms of square-root, low-rank, and sparsity.

Fine-Grained Visual Recognition

Paper
Code

Domain-Specific Embedding Network for Zero-Shot Recognition

1 code implementation • 12 Aug 2019 • Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang

In contrast to previous methods, the DSEN decomposes the domain-shared projection function into one domain-invariant and two domain-specific sub-functions to explore the similarities and differences between two domains.

Zero-Shot Learning

Paper
Code

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval

no code implementations • 13 Sep 2017 • Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, Qi Tian

Targeting to solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) and an efficient indexing and retrieval framework, respectively.

Ranked #93 on Person Re-Identification on Market-1501

Person Re-Identification Representation Learning +1

Paper
Add Code

Deep Representation Learning with Part Loss for Person Re-Identification

no code implementations • 4 Jul 2017 • Hantao Yao, Shiliang Zhang, Yongdong Zhang, Jintao Li, Qi Tian

The representation learning risk is evaluated by the proposed part loss, which automatically generates several parts for an image, and computes the person classification loss on each part separately.

Ranked #97 on Person Re-Identification on Market-1501

Classification General Classification +2