Search Results for author: Aihua Zheng

Found 22 papers, 10 papers with code

Which Model to Transfer? A Survey on Transferability Estimation

no code implementations23 Feb 2024 Yuhe Ding, Bo Jiang, Aijing Yu, Aihua Zheng, Jian Liang

In this survey, we present the first review of existing advances in this area and categorize them into two separate realms: source-free model transferability estimation and source-dependent model transferability estimation.

Transfer Learning

Unleashing the power of Neural Collapse for Transferability Estimation

no code implementations9 Oct 2023 Yuhe Ding, Bo Jiang, Lijun Sheng, Aihua Zheng, Jian Liang

Transferability estimation aims to provide heuristics for quantifying how suitable a pre-trained model is for a specific downstream task, without fine-tuning them all.

Fairness Image Classification +3

Multi-query Vehicle Re-identification: Viewpoint-conditioned Network, Unified Dataset and New Metric

no code implementations25 May 2023 Aihua Zheng, Chaobin Zhang, Weijun Zhang, Chenglong Li, Jin Tang, Chang Tan, Ruoran Jia

Existing vehicle re-identification methods mainly rely on the single query, which has limited information for vehicle representation and thus significantly hinders the performance of vehicle Re-ID in complicated surveillance networks.

Scene Recognition Vehicle Re-Identification

Flare-Aware Cross-modal Enhancement Network for Multi-spectral Vehicle Re-identification

1 code implementation23 May 2023 Aihua Zheng, Zhiqi Ma, Zi Wang, Chenglong Li

Finally, to evaluate the proposed FACENet in handling intense flare, we introduce a new multi-spectral vehicle re-ID dataset, called WMVEID863, with additional challenges such as motion blur, significant background changes, and particularly intense flare degradation.

Vehicle Re-Identification

MODIFY: Model-driven Face Stylization without Style Images

1 code implementation17 Mar 2023 Yuhe Ding, Jian Liang, Jie Cao, Aihua Zheng, Ran He

Briefly, MODIFY first trains a generative model in the target domain and then translates a source input to the target domain via the provided style model.

Translation

MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection

1 code implementation9 Feb 2023 Yuhe Ding, Jian Liang, Bo Jiang, Aihua Zheng, Ran He

Existing cross-domain keypoint detection methods always require accessing the source data during adaptation, which may violate the data privacy law and pose serious security concerns.

Data Augmentation Keypoint Detection

Parallel Augmentation and Dual Enhancement for Occluded Person Re-identification

1 code implementation11 Oct 2022 Zi Wang, Huaibo Huang, Aihua Zheng, Chenglong Li, Ran He

To alleviate these two issues, we propose a simple yet effective method with Parallel Augmentation and Dual Enhancement (PADE), which is robust on both occluded and non-occluded data and does not require any auxiliary clues.

Person Re-Identification

Multi-spectral Vehicle Re-identification with Cross-directional Consistency Network and a High-quality Benchmark

1 code implementation1 Aug 2022 Aihua Zheng, Xianpeng Zhu, Zhiqi Ma, Chenglong Li, Jin Tang, Jixin Ma

In particular, we design a new cross-directional center loss to pull the modality centers of each identity close to mitigate cross-modality discrepancy, while the sample centers of each identity close to alleviate the sample discrepancy.

Vehicle Re-Identification

Disentangled Generation Network for Enlarged License Plate Recognition and A Unified Dataset

no code implementations2 Jun 2022 Chenglong Li, Xiaobin Yang, Guohao Wang, Aihua Zheng, Chang Tan, Ruoran Jia, Jin Tang

License plate recognition plays a critical role in many practical applications, but license plates of large vehicles are difficult to be recognized due to the factors of low resolution, contamination, low illumination, and occlusion, to name a few.

Disentanglement License Plate Recognition +2

ProxyMix: Proxy-based Mixup Training with Label Refinery for Source-Free Domain Adaptation

2 code implementations29 May 2022 Yuhe Ding, Lijun Sheng, Jian Liang, Aihua Zheng, Ran He

First of all, to avoid additional parameters and explore the information in the source model, ProxyMix defines the weights of the classifier as the class prototypes and then constructs a class-balanced proxy source domain by the nearest neighbors of the prototypes to bridge the unseen source domain and the target domain.

Object Recognition Source-Free Domain Adaptation +1

Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching

1 code implementation IEEE Transactions on Multimedia 2021 Aihua Zheng, Menglan Hu, Bo Jiang *, Yan Huang, Yan Yan, and Bin Luo

AML aims to generate a modality-independent representation for each person in each modality via adversarial learning, while simultaneously learns a robust similarity measure for cross-modality matching via metric learning.

audio-visual learning Metric Learning +1

Viewpoint-aware Progressive Clustering for Unsupervised Vehicle Re-identification

no code implementations18 Nov 2020 Aihua Zheng, Xia Sun, Chenglong Li, Jin Tang

Comprehensive experiments against the state-of-the-art methods on two multi-viewpoint benchmark datasets VeRi and VeRi-Wild validate the promising performance of the proposed method in both with and without domain adaption scenarios while handling unsupervised vehicle Re-ID.

Clustering Domain Adaptation +2

Unsupervised Contrastive Photo-to-Caricature Translation based on Auto-distortion

no code implementations10 Nov 2020 Yuhe Ding, Xin Ma, Mandi Luo, Aihua Zheng, Ran He

Considering the intuitive artifacts in the existing methods, we propose a contrastive style loss for style rendering to enforce the similarity between the style of rendered photo and the caricature, and simultaneously enhance its discrepancy to the photos.

Caricature Photo-To-Caricature Translation +1

Lets Play Music: Audio-driven Performance Video Generation

no code implementations5 Nov 2020 Hao Zhu, Yi Li, Feixia Zhu, Aihua Zheng, Ran He

We propose a new task named Audio-driven Per-formance Video Generation (APVG), which aims to synthesizethe video of a person playing a certain instrument guided bya given music audio clip.

Video Generation

STADB: A Self-Thresholding Attention Guided ADB Network for Person Re-identification

1 code implementation7 Jul 2020 Bo Jiang, Sheng Wang, Xiao Wang, Aihua Zheng

Specifically, STADB first obtains an attention map by channel-wise pooling and returns a drop mask by thresholding the attention map.

Person Re-Identification

Deep Audio-Visual Learning: A Survey

no code implementations14 Jan 2020 Hao Zhu, Mandi Luo, Rui Wang, Aihua Zheng, Ran He

Audio-visual learning, aimed at exploiting the relationship between audio and visual modalities, has drawn considerable attention since deep learning started to be used successfully.

audio-visual learning Representation Learning

Multi-Adapter RGBT Tracking

no code implementations17 Jul 2019 Chenglong Li, Andong Lu, Aihua Zheng, Zhengzheng Tu, Jin Tang

In a specific, the generality adapter is to extract shared object representations, the modality adapter aims at encoding modality-specific information to deploy their complementary advantages, and the instance adapter is to model the appearance properties and temporal variations of a certain object.

Visual Tracking

Attributes Guided Feature Learning for Vehicle Re-identification

no code implementations22 May 2019 Hongchao Li, Xianmin Lin, Aihua Zheng, Chenglong Li, Bin Luo, Ran He, Amir Hussain

In particular, our network is end-to-end trained and contains three subnetworks of deep features embedded by the corresponding attributes (i. e., camera view, vehicle type and vehicle color).

Generative Adversarial Network Vehicle Re-Identification

Pedestrian Attribute Recognition: A Survey

1 code implementation22 Jan 2019 Xiao Wang, Shaofei Zheng, Rui Yang, Aihua Zheng, Zhe Chen, Jin Tang, Bin Luo

We also review some popular network architectures which have been widely applied in the deep learning community.

Attribute Multi-Label Learning +2

Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning

no code implementations17 Dec 2018 Hao Zhu, Huaibo Huang, Yi Li, Aihua Zheng, Ran He

Talking face generation aims to synthesize a face video with precise lip synchronization as well as a smooth transition of facial motion over the entire video via the given speech clip and facial image.

Talking Face Generation

A Unified RGB-T Saliency Detection Benchmark: Dataset, Baselines, Analysis and A Novel Approach

1 code implementation11 Jan 2017 Chenglong Li, Guizhao Wang, Yunpeng Ma, Aihua Zheng, Bin Luo, Jin Tang

In particular, we introduce a weight for each modality to describe the reliability, and integrate them into the graph-based manifold ranking algorithm to achieve adaptive fusion of different source data.

Saliency Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.