Search Results for author: Zhihai He

Found 32 papers, 5 papers with code

Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval

1 code implementation • 16 Mar 2024 • Shichao Kan, Yuhai Deng, Yixiong Liang, Lihui Cen, Zhe Qu, Yigang Cen, Zhihai He

This paper presents a novel unsupervised deep metric learning approach, termed unsupervised collaborative metric learning with mixed-scale groups (MS-UGCML), devised to learn embeddings for objects of varying scales.

Metric Learning Object +1

Paper
Code

Concept-Guided Prompt Learning for Generalization in Vision-Language Models

no code implementations • 15 Jan 2024 • Yi Zhang, Ce Zhang, Ke Yu, Yushun Tang, Zhihai He

However, for generalization tasks, the current fine-tuning methods for CLIP, such as CoOp and CoCoOp, demonstrate relatively low performance on some fine-grained datasets.

Paper
Add Code

Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation

no code implementations • 2 Nov 2023 • Xueting Hu, Ce Zhang, Yi Zhang, Bowen Hai, Ke Yu, Zhihai He

When CLIP is used for depth estimation tasks, the patches, divided from the input images, can be combined with a series of semantic descriptions of the depth information to obtain similarity results.

Monocular Depth Estimation

Paper
Add Code

BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning

no code implementations • 3 Sep 2023 • Yi Zhang, Ce Zhang, Zihan Liao, Yushun Tang, Zhihai He

Large-scale pre-trained Vision-Language Models (VLMs), such as CLIP and ALIGN, have introduced a new paradigm for learning transferable visual representations.

Paper
Add Code

Unsupervised Prototype Adapter for Vision-Language Models

no code implementations • 22 Aug 2023 • Yi Zhang, Ce Zhang, Xueting Hu, Zhihai He

To leverage the valuable knowledge encoded within these models for downstream tasks, several fine-tuning approaches, including prompt tuning methods and adapter-based methods, have been developed to adapt vision-language models effectively with supervision.

Domain Generalization

Paper
Add Code

Cross-Modal Concept Learning and Inference for Vision-Language Models

no code implementations • 28 Jul 2023 • Yi Zhang, Ce Zhang, Yushun Tang, Zhihai He

Based on these visual concepts, we construct a discriminative representation of images and learn a concept inference network to perform downstream image classification tasks, such as few-shot learning and domain generalization.

Domain Generalization Few-Shot Learning +1

Paper
Add Code

Cross-Inferential Networks for Source-free Unsupervised Domain Adaptation

no code implementations • 29 Jun 2023 • Yushun Tang, Qinghai Guo, Zhihai He

Our main idea is that, when we adapt the network model to predict the sample labels from encoded features, we use these prediction results to construct new training samples with derived labels to learn a new examiner network that performs a different but compatible task in the target domain.

Image Classification Unsupervised Domain Adaptation

Paper
Add Code

Critical Sampling for Robust Evolution Operator Learning of Unknown Dynamical Systems

no code implementations • 15 Apr 2023 • Ce Zhang, Kailiang Wu, Zhihai He

Given an unknown dynamical system, what is the minimum number of samples needed for effective learning of its governing laws and accurate prediction of its future evolution behavior, and how to select these critical samples?

Operator learning

Paper
Add Code

Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation

no code implementations • CVPR 2023 • Zhehan Kan, Shuoshuo Chen, Ce Zhang, Yushun Tang, Zhihai He

This strong correlation suggests that we can use this error as feedback to guide the correction process.

Pose Estimation

Paper
Add Code

Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation

no code implementations • CVPR 2023 • Yushun Tang, Ce Zhang, Heng Xu, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He

We observe that the performance of this feed-forward Hebbian learning for fully test-time adaptation can be significantly improved by incorporating a feedback neuro-modulation layer.

Test-time Adaptation

Paper
Add Code

Contrastive Bayesian Analysis for Deep Metric Learning

1 code implementation • 10 Oct 2022 • Shichao Kan, Zhiquan He, Yigang Cen, Yang Li, Vladimir Mladenovic, Zhihai He

Recent methods for deep metric learning have been focusing on designing different contrastive loss functions between positive and negative pairs of samples so that the learned feature embedding is able to pull positive samples of the same class closer and push negative samples from different classes away from each other.

Contrastive Learning Metric Learning

Paper
Code

Coded Residual Transform for Generalizable Deep Metric Learning

no code implementations • 9 Oct 2022 • Shichao Kan, Yixiong Liang, Min Li, Yigang Cen, Jianxin Wang, Zhihai He

To address this challenge, in this paper, we introduce a new method called coded residual transform (CRT) for deep metric learning to significantly improve its generalization capability.

Metric Learning

Paper
Add Code

Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation

no code implementations • 6 Jul 2022 • Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He

This group-wise structural correlation can be explored to improve the accuracy and robustness of human pose estimation.

Ranked #1 on Multi-Person Pose Estimation on COCO test-dev

Inference Optimization Multi-Person Pose Estimation +1

Paper
Add Code

Self-Supervised Prime-Dual Networks for Few-Shot Image Classification

no code implementations • 29 Sep 2021 • Wenming Cao, Qifan Liu, Guang Liu, Zhihai He

We construct a prime-dual network structure for few-shot learning which establishes a commutative relationship between the support set and the query set, as well as a new self- supervision constraint for highly effective few-shot learning.

Few-Shot Image Classification Few-Shot Learning

Paper
Add Code

Invariance-Guided Feature Evolution for Few-Shot Learning

no code implementations • 29 Sep 2021 • Wenming Cao, Zhineng Zhao, Qifan Liu, Zhihai He

Few-shot learning (FSL) aims to characterize the inherent visual relationship between support and query samples which can be well generalized to unseen classes so that we can accurately infer the labels of query samples from very few support samples.

domain classification Few-Shot Learning +1

Paper
Add Code

Relative Order Analysis and Optimization for Unsupervised Deep Metric Learning

no code implementations • CVPR 2021 • Shichao Kan, Yigang Cen, Yang Li, Vladimir Mladenovic, Zhihai He

During training, this relative order prediction network and the feature embedding network are tightly coupled, providing mutual constraints to each other to improve overall metric learning performance in a cooperative manner.

Image Retrieval Metric Learning +1

Paper
Add Code

Spatial Assembly Networks for Image Representation Learning

no code implementations • CVPR 2021 • Yang Li, Shichao Kan, Jianhe Yuan, Wenming Cao, Zhihai He

It has been long recognized that deep neural networks are sensitive to changes in spatial configurations or scene structures.

Image Classification Image Retrieval +3

Paper
Add Code

Structure-Preserving Progressive Low-rank Image Completion for Defending Adversarial Attacks

no code implementations • 4 Mar 2021 • Zhiqun Zhao, Hengyou Wang, Hao Sun, Zhihai He

In this work, we propose to develop a structure-preserving progressive low-rank image completion (SPLIC) method to remove unneeded texture details from the input images and shift the bias of deep neural networks towards global object structures and semantic cues.

Adversarial Robustness Low-Rank Matrix Completion

Paper
Add Code

Consistency-Sensitivity Guided Ensemble Black-Box Adversarial Attacks in Low-Dimensional Spaces

no code implementations • ICCV 2021 • Jianhe Yuan, Zhihai He

Specifically, we esti-mate the victim model in the black box using a learned lin-ear composition of an ensemble of surrogate models withdiversified network structures.

Adversarial Attack

Paper
Add Code

Unsupervised Deep Metric Learning with Transformed Attention Consistency and Contrastive Clustering Loss

no code implementations • ECCV 2020 • Yang Li, Shichao Kan, Zhihai He

To further enhance the inter-class discriminative power of the feature generated by this network, we adapt the concept of triplet loss from supervised metric learning to our unsupervised case and introduce the contrastive clustering loss.

Clustering Metric Learning

Paper
Add Code

Ensemble Generative Cleaning with Feedback Loops for Defending Adversarial Attacks

no code implementations • CVPR 2020 • Jianhe Yuan, Zhihai He

In this paper, we develop a new method called ensemble generative cleaning with feedback loops (EGC-FL) for effective defense of deep neural networks.

Paper
Add Code

Reciprocal Learning Networks for Human Trajectory Prediction

no code implementations • CVPR 2020 • Hao Sun, Zhiqun Zhao, Zhihai He

Based on this unique property, we develop a new approach, called reciprocal learning, for human trajectory prediction.

Trajectory Prediction

Paper
Add Code

Generative Cleaning Networks with Quantized Nonlinear Transform for Deep Neural Network Defense

no code implementations • 25 Sep 2019 • Jianhe Yuan, Zhihai He

In this paper, we develop a new generative cleaning network with quantized nonlinear transform for effective defense of deep neural networks.

Paper
Add Code

Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets

no code implementations • 4 Sep 2019 • Yang Li, Jianhe Yuan, Zhiqun Zhao, Hao Sun, Zhihai He

In this work, we develop a joint sample discovery and iterative model evolution method for semi-supervised learning on very small labeled training sets.

Paper
Add Code

Progressive Neural Networks for Image Classification

no code implementations • 25 Apr 2018 • Zhi Zhang, Guanghan Ning, Yigang Cen, Yang Li, Zhiqun Zhao, Hao Sun, Zhihai He

The inference structures and computational complexity of existing deep neural networks, once trained, are fixed and remain the same for all test images.

Classification General Classification +1

Paper
Add Code

Dual Path Networks for Multi-Person Human Pose Estimation

no code implementations • 27 Oct 2017 • Guanghan Ning, Zhihai He

The task of multi-person human pose estimation in natural scenes is quite challenging.

Pose Estimation regression

Paper
Add Code

Knowledge Projection for Deep Neural Networks

no code implementations • 26 Oct 2017 • Zhi Zhang, Guanghan Ning, Zhihai He

In this paper, we will develop a new framework for training deep neural networks on datasets with limited labeled samples using cross-network knowledge projection which is able to improve the network performance while reducing the overall computational complexity significantly.

Paper
Add Code

Towards a Social Virtual Reality Learning Environment in High Fidelity

1 code implementation • 18 Jul 2017 • Chiara Zizza, Adam Starr, Devin Hudson, Sai Shreya Nuguri, Prasad Calyam, Zhihai He

Virtual Learning Environments (VLEs) are spaces designed to educate students remotely via online platforms.

Human-Computer Interaction Multimedia

Paper
Code

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

1 code implementation • 5 May 2017 • Guanghan Ning, Zhi Zhang, Zhihai He

Human pose estimation using deep neural networks aims to map input images with large variations into multiple body keypoints which must satisfy a set of geometric constraints and inter-dependency imposed by the human body model.

Ranked #6 on Pose Estimation on Leeds Sports Poses

Pose Estimation

Paper
Code

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

2 code implementations • 19 Jul 2016 • Guanghan Ning, Zhi Zhang, Chen Huang, Zhihai He, Xiaobo Ren, Haohong Wang

In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking.

Binary Classification object-detection +3

879

Paper
Code

A Classification Leveraged Object Detector

no code implementations • 7 Apr 2016 • Miao Sun, Tony X. Han, Zhihai He

Currently, the state-of-the-art image classification algorithms outperform the best available object detector by a big margin in terms of average precision.

Classification General Classification +4

Paper
Add Code

Ensemble Video Object Cut in Highly Dynamic Scenes

no code implementations • CVPR 2013 • Xiaobo Ren, Tony X. Han, Zhihai He

We incorporate this similarity information into a graph-cut energy minimization framework for foreground object segmentation.

Change Detection Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.