Search Results for author: Hanzi Wang

Found 61 papers, 13 papers with code

Augmentation Matters: A Mix-Paste Method for X-Ray Prohibited Item Detection under Noisy Annotations

1 code implementation3 Jan 2025 Ruikang Chen, Yan Yan, Jing-Hao Xue, Yang Lu, Hanzi Wang

However, obtaining correct annotations is extremely hard if not impossible for large-scale X-ray images, where item overlapping is ubiquitous. As a result, X-ray images are easily contaminated with noisy annotations, leading to performance deterioration of existing methods. In this paper, we address the challenging problem of training a robust prohibited item detector under noisy annotations (including both category noise and bounding box noise) from a novel perspective of data augmentation, and propose an effective label-aware mixed patch paste augmentation method (Mix-Paste).

Data Augmentation

Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition

no code implementations18 Nov 2024 Hanyu Guo, Wanchuan Yu, Suzhou Que, Kaiwen Du, Yan Yan, Hanzi Wang

In this paper, we propose a novel Dual Motion-Guided Attention Learning method (called DMGAL) for few-shot action recognition, aiming to learn the spatio-temporal relationships from the video-specific to the task-specific level.

Few-Shot action recognition Few Shot Action Recognition

Transitive Vision-Language Prompt Learning for Domain Generalization

no code implementations29 Apr 2024 Liyuan Wang, Yan Jin, Zhen Chen, Jinlin Wu, Mengke Li, Yang Lu, Hanzi Wang

The vision-language pre-training has enabled deep models to make a huge step forward in generalizing across unseen domains.

Domain Generalization

Dynamically Anchored Prompting for Task-Imbalanced Continual Learning

1 code implementation23 Apr 2024 Chenxing Hong, Yan Jin, Zhiqi Kang, Yizhou Chen, Mengke Li, Yang Lu, Hanzi Wang

We find that imbalanced tasks significantly challenge the capability of models to control the trade-off between stability and plasticity from the perspective of recent prompt-based continual learning methods.

Continual Learning

Frequency Domain Nuances Mining for Visible-Infrared Person Re-identification

no code implementations4 Jan 2024 Yukang Zhang, Yang Lu, Yan Yan, Hanzi Wang, Xuelong Li

Specifically, we propose a novel Frequency Domain Nuances Mining (FDNM) method to explore the cross-modality frequency domain information, which mainly includes an amplitude guided phase (AGP) module and an amplitude nuances mining (ANM) module.

Face Recognition Person Re-Identification

Federated Learning with Extremely Noisy Clients via Negative Distillation

1 code implementation20 Dec 2023 Yang Lu, Lin Chen, Yonggang Zhang, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang

The model trained on noisy labels serves as a `bad teacher' in knowledge distillation, aiming to decrease the risk of providing incorrect information.

Federated Learning Knowledge Distillation

Spatial-Contextual Discrepancy Information Compensation for GAN Inversion

1 code implementation12 Dec 2023 Ziqiang Zhang, Yan Yan, Jing-Hao Xue, Hanzi Wang

SDIC follows a "compensate-and-edit" paradigm and successfully bridges the gap in image details between the original image and the reconstructed/edited image.

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

no code implementations23 Jun 2023 Qianji Di, Wenxi Ma, Zhongang Qi, Tianxiang Hou, Ying Shan, Hanzi Wang

In this work, we propose a Text-Image-joint Scene Graph Generation (TISGG) model to resolve the unseen triples and improve the generalisation capability of the SGG models.

Graph Generation Scene Graph Generation +1

PARFormer: Transformer-based Multi-Task Network for Pedestrian Attribute Recognition

1 code implementation14 Apr 2023 Xinwen Fan, Yukang Zhang, Yang Lu, Hanzi Wang

Pedestrian attribute recognition (PAR) has received increasing attention because of its wide application in video surveillance and pedestrian analysis.

Attribute Data Augmentation +1

Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation

1 code implementation CVPR 2023 Yan Jin, Mengke Li, Yang Lu, Yiu-ming Cheung, Hanzi Wang

To address this problem, state-of-the-art methods usually adopt a mixture of experts (MoE) to focus on different parts of the long-tailed distribution.

Transfer Learning

Personalized Federated Learning on Long-Tailed Data via Adversarial Feature Augmentation

1 code implementation27 Mar 2023 Yang Lu, Pinxin Qian, Gang Huang, Hanzi Wang

Personalized Federated Learning (PFL) aims to learn personalized models for each client based on the knowledge across all clients in a privacy-preserving manner.

Personalized Federated Learning Privacy Preserving

MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification

no code implementations26 Mar 2023 Yukang Zhang, Yan Yan, Jie Li, Hanzi Wang

Furthermore, to better disentangle the modality-relevant features and the modality-irrelevant features, we propose a novel Center-Quadruplet Causal (CQC) loss to encourage the network to effectively learn the modality-relevant features and the modality-irrelevant features.

Person Re-Identification

Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

1 code implementation CVPR 2023 Yukang Zhang, Hanzi Wang

The proposed DEEN can effectively generate diverse embeddings to learn the informative feature representations and reduce the modality discrepancy between the VIS and IR images.

Cross-Modal Person Re-Identification

Federated Semi-Supervised Learning with Annotation Heterogeneity

no code implementations4 Mar 2023 Xinyi Shang, Gang Huang, Yang Lu, Jian Lou, Bo Han, Yiu-ming Cheung, Hanzi Wang

Federated Semi-Supervised Learning (FSSL) aims to learn a global model from different clients in an environment with both labeled and unlabeled data.

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

no code implementations21 Aug 2022 Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.

Scene Text Detection Text Detection

Label-Noise Learning with Intrinsically Long-Tailed Data

1 code implementation ICCV 2023 Yang Lu, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang

In this case, it is hard to distinguish clean samples from noisy samples on the intrinsic tail classes with the unknown intrinsic class distribution.

Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition

1 code implementation16 Jul 2022 Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

Extensive experiments on both in-the-lab and in-the-wild compound expression datasets demonstrate the superiority of our proposed CDNet against several state-of-the-art FSL methods.

cross-domain few-shot learning Facial Expression Recognition +1

FEDIC: Federated Learning on Non-IID and Long-Tailed Data via Calibrated Distillation

1 code implementation30 Apr 2022 Xinyi Shang, Yang Lu, Yiu-ming Cheung, Hanzi Wang

Federated learning provides a privacy guarantee for generating good deep learning models on distributed clients with different kinds of data.

Federated Learning Long-tail Learning

Federated Learning on Heterogeneous and Long-Tailed Data via Classifier Re-Training with Federated Features

2 code implementations28 Apr 2022 Xinyi Shang, Yang Lu, Gang Huang, Hanzi Wang

Experiments on several benchmark datasets show that the proposed CReFF is an effective solution to obtain a promising FL model under heterogeneous and long-tailed data.

Federated Learning Privacy Preserving

Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes

no code implementations8 Mar 2022 Xi Weng, Yan Yan, Genshun Dong, Chang Shu, Biao Wang, Hanzi Wang, Ji Zhang

This shows that DMA-Net provides a good tradeoff between segmentation quality and speed for semantic segmentation in street scenes.

Decoder Real-Time Semantic Segmentation +1

Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes

no code implementations8 Mar 2022 Xi Weng, Yan Yan, Si Chen, Jing-Hao Xue, Hanzi Wang

In this paper, we present a novel Stage-aware Feature Alignment Network (SFANet) based on the encoder-decoder structure for real-time semantic segmentation of street scenes.

Decoder Real-Time Semantic Segmentation +1

When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework

no code implementations18 Jan 2022 Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

To alleviate the problem of limited base classes in our FER task, we propose a novel Emotion Guided Similarity Network (EGS-Net), consisting of an emotion branch and a similarity branch, based on a two-stage learning framework.

cross-domain few-shot learning Facial Expression Recognition +1

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

1 code implementation11 Oct 2021 Lin Cheng, Pengfei Fang, Yanjie Liang, Liao Zhang, Chunhua Shen, Hanzi Wang

Inspired by those observations, we propose a novel visual saliency method, termed Target-Selective Gradient Backprop (TSGB), which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the image space, thereby generating target-selective and fine-grained saliency maps.

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data

no code implementations CVPR 2021 Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang

First, three auxiliary tasks, consisting of a Patch Rotation Task (PRT), a Patch Segmentation Task (PST), and a Patch Classification Task (PCT), are jointly developed to learn the spatial-semantic relationship from large-scale unlabeled facial data.

Attribute Facial Attribute Classification +1

Hierarchical Representation via Message Propagation for Robust Model Fitting

no code implementations29 Dec 2020 Shuyuan Lin, Xing Wang, Guobao Xiao, Yan Yan, Hanzi Wang

In this paper, we propose a novel hierarchical representation via message propagation (HRMP) method for robust model fitting, which simultaneously takes advantages of both the consensus analysis and the preference analysis to estimate the parameters of multiple model instances from data corrupted by outliers, for robust model fitting.

Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

no code implementations9 Nov 2020 Lijian Lin, Haosheng Chen, Yanjie Liang, Yan Yan, Hanzi Wang

In this paper, we propose a robust tracking method via Statistical Positive sample generation and Gradient Aware learning (SPGA) to address the above two limitations.

Diversity Visual Tracking

Dual Semantic Fusion Network for Video Object Detection

no code implementations16 Sep 2020 Lijian Lin, Haosheng Chen, Honglun Zhang, Jun Liang, Yu Li, Ying Shan, Hanzi Wang

Video object detection is a tough task due to the deteriorated quality of video sequences captured under complex environments.

Object object-detection +2

Correlation filter tracking with adaptive proposal selection for accurate scale estimation

no code implementations14 Jul 2020 Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang

In this paper, we propose an adaptive proposal selection algorithm which can generate a small number of high-quality proposals to handle the problem of scale variations for visual object tracking.

Visual Object Tracking

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

no code implementations11 Mar 2020 Genshun Dong, Yan Yan, Chunhua Shen, Hanzi Wang

Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information.

Image Segmentation Segmentation +2

Learning Object Scale With Click Supervision for Object Detection

no code implementations20 Feb 2020 Liao Zhang, Yan Yan, Lin Cheng, Hanzi Wang

Finally, we fuse these CAMs together to generate pseudoground-truths and train a fully-supervised object detector withthese ground-truths.

Object object-detection +1

End-to-end Learning of Object Motion Estimation from Retinal Events for Event-based Object Tracking

no code implementations14 Feb 2020 Haosheng Chen, David Suter, Qiangqiang Wu, Hanzi Wang

We feed the sequence of TSLTD frames to a novel Retinal Motion Regression Network (RMRNet) to perform an end-to-end 5-DoF object motion regression.

Motion Estimation Object +2

Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

no code implementations13 Feb 2020 Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang

To achieve this goal, we present an Adaptive Time-Surface with Linear Time Decay (ATSLTD) event-to-frame conversion algorithm, which asynchronously and effectively warps the spatio-temporal information of asynchronous retinal events to a sequence of ATSLTD frames with clear object contours.

Object Object Tracking

Hypergraph Optimization for Multi-structural Geometric Model Fitting

no code implementations13 Feb 2020 Shuyuan Lin, Guobao Xiao, Yan Yan, David Suter, Hanzi Wang

Recently, some hypergraph-based methods have been proposed to deal with the problem of model fitting in computer vision, mainly due to the superior capability of hypergraph to represent the complex relationship between data points.


Deep Multi-task Multi-label CNN for Effective Facial Attribute Classification

no code implementations10 Feb 2020 Longbiao Mao, Yan Yan, Jing-Hao Xue, Hanzi Wang

Two different network architectures are respectively designed to extract features for two groups of attributes, and a novel dynamic weighting scheme is proposed to automatically assign the loss weight to each facial attribute during training.

Attribute Face Detection +5

Joint Deep Learning of Facial Expression Synthesis and Recognition

no code implementations6 Feb 2020 Yan Yan, Ying Huang, Si Chen, Chunhua Shen, Hanzi Wang

Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions.

Deep Learning Facial Expression Recognition +2

Hallucinated Adversarial Learning for Robust Visual Tracking

no code implementations17 Jun 2019 Qiangqiang Wu, Zhihui Chen, Lin Cheng, Yan Yan, Bo Li, Hanzi Wang

Incorporating such an ability to hallucinate diverse new samples of the tracked instance can help the trackers alleviate the over-fitting problem in the low-data tracking regime.

Visual Tracking

DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

no code implementations6 Nov 2018 Qiangqiang Wu, Yan Yan, Yanjie Liang, Yi Liu, Hanzi Wang

In recent years, Discriminative Correlation Filter (DCF) based tracking methods have achieved great success in visual tracking.

Image Classification Visual Tracking

Superpixel-guided Two-view Deterministic Geometric Model Fitting

no code implementations3 May 2018 Guobao Xiao, Hanzi Wang, Yan Yan, David Suter

Specifically, SDF includes three main parts: a deterministic sampling algorithm, a model hypothesis updating strategy and a novel model selection algorithm.

Model Selection Superpixels +1

Multi-task Learning of Cascaded CNN for Facial Attribute Classification

no code implementations3 May 2018 Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang

In order to address the above problems, we propose a novel multi-task learning of cas- caded convolutional neural network method, termed MCFA, for predicting multiple facial attributes simultaneously.

Attribute Classification +5

Multi-label Learning Based Deep Transfer Neural Network for Facial Attribute Classification

no code implementations3 May 2018 Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang, Chunhua Shen

To address the above problem, we propose a novel deep transfer neural network method based on multi-label learning for facial attribute classification, termed FMTNet, which consists of three sub-networks: the Face detection Network (FNet), the Multi-label learning Network (MNet) and the Transfer learning Network (TNet).

Attribute Classification +6

A Fast Face Detection Method via Convolutional Neural Network

no code implementations27 Mar 2018 Guanjun Guo, Hanzi Wang, Yan Yan, Jin Zheng, Bo Li

Current face or object detection methods via convolutional neural network (such as OverFeat, R-CNN and DenseNet) explicitly extract multi-scale features based on an image pyramid.

Face Detection object-detection +1

A New Target-specific Object Proposal Generation Method for Visual Tracking

no code implementations27 Mar 2018 Guanjun Guo, Hanzi Wang, Yan Yan, Hong-Yuan Mark Liao, Bo Li

Then, we apply the proposed TOPG method to the task of visual tracking and propose a TOPG-based tracker (called as TOPGT), where TOPG is used as a sample selection strategy to select a small number of high-quality target candidates from the generated object proposals.

Object Object Proposal Generation +1

Single Image Super-Resolution via Cascaded Multi-Scale Cross Network

no code implementations24 Feb 2018 Yanting Hu, Xinbo Gao, Jie Li, Yuanfei Huang, Hanzi Wang

To improve information flow and to capture sufficient knowledge for reconstructing the high-frequency details, we propose a cascaded multi-scale cross network (CMSC) in which a sequence of subnetworks is cascaded to infer high resolution features in a coarse-to-fine manner.

Image Reconstruction Image Super-Resolution

Searching for Representative Modes on Hypergraphs for Robust Geometric Model Fitting

no code implementations4 Feb 2018 Hanzi Wang, Guobao Xiao, Yan Yan, David Suter

We cast the task of geometric model fitting as a representative mode-seeking problem on hypergraphs.

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression

no code implementations25 Dec 2017 Guanjun Guo, Hanzi Wang, Chunhua Shen, Yan Yan, Hong-Yuan Mark Liao

The deep CNN model is then designed to extract features from several image cropping datasets, upon which the cropping bounding boxes are predicted by the proposed CCR method.

Image Cropping regression

Object Discovery via Cohesion Measurement

no code implementations28 Apr 2017 Guanjun Guo, Hanzi Wang, Wan-Lei Zhao, Yan Yan, Xuelong. Li

Based on the new Cohesion Measurement, a novel object discovery method is proposed to discover objects latent in an image by utilizing the eigenvectors of the affinity matrix.

Clustering Image Segmentation +5

Superpixel-based Two-view Deterministic Fitting for Multiple-structure Data

no code implementations20 Jul 2016 Guobao Xiao, Hanzi Wang, Yan Yan, David Suter

The feature appearances are beneficial to reduce the computational complexity for deterministic fitting methods.

Model Selection Superpixels +1

Hypergraph Modelling for Geometric Model Fitting

no code implementations11 Jul 2016 Guobao Xiao, Hanzi Wang, Taotao Lai, David Suter

The hypergraph, with large and "data-determined" degrees of hyperedges, can express the complex relationships between model hypotheses and data points.

Learning Hough Regression Models via Bridge Partial Least Squares for Object Detection

no code implementations26 Mar 2016 Jianyu Tang, Hanzi Wang, Yan Yan

And the appropriate value of the only parameter used in PLS (i. e., the number of latent components) can be determined by using a cross-validation procedure.

Clustering Object +3

An Effective Unconstrained Correlation Filter and Its Kernelization for Face Recognition

no code implementations25 Mar 2016 Yan Yan, Hanzi Wang, Cuihua Li, Chenhui Yang, Bineng Zhong

In this paper, an effective unconstrained correlation filter called Uncon- strained Optimal Origin Tradeoff Filter (UOOTF) is presented and applied to robust face recognition.

Face Recognition Robust Face Recognition

Quadratic Projection Based Feature Extraction with Its Application to Biometric Recognition

no code implementations25 Mar 2016 Yan Yan, Hanzi Wang, Si Chen, Xiaochun Cao, David Zhang

This paper presents a novel quadratic projection based feature extraction framework, where a set of quadratic matrices is learned to distinguish each class from all other classes.

Mode-Seeking on Hypergraphs for Robust Geometric Model Fitting

no code implementations ICCV 2015 Hanzi Wang, Guobao Xiao, Yan Yan, David Suter

In addition to the mode seeking algorithm, MSH includes a similarity measure between vertices on the hypergraph and a weight-aware sampling technique.

Multi-Subregion Based Correlation Filter Bank for Robust Face Recognition

no code implementations24 Mar 2016 Yan Yan, Hanzi Wang, David Suter

In this paper, we propose an effective feature extraction algorithm, called Multi-Subregion based Correlation Filter Bank (MS-CFB), for robust face recognition.

Face Recognition Robust Face Recognition

Robust Scene Text Recognition Using Sparse Coding based Features

no code implementations29 Dec 2015 Da-Han Wang, Hanzi Wang, Dong Zhang, Jonathan Li, David Zhang

For character detection, we use the HSC features instead of using the Histograms of Oriented Gradients (HOG) features.

Scene Text Recognition

Distortion-driven Turbulence Effect Removal using Variational Model

no code implementations17 Jan 2014 Yuan Xie, Wensheng Zhang, DaCheng Tao, Wenrui Hu, Yanyun Qu, Hanzi Wang

To solve, or at least reduce these effects, we propose a new scheme to recover a latent image from observed frames by integrating a new variational model and distortion-driven spatial-temporal kernel regression.


The Ordered Residual Kernel for Robust Motion Subspace Clustering

no code implementations NeurIPS 2009 Tat-Jun Chin, Hanzi Wang, David Suter

The kernel permits the application of well-established statistical learning methods for effective outlier rejection, automatic recovery of the number of motions and accurate segmentation of the point trajectories.

Clustering Computational Efficiency +2

Cannot find the paper you are looking for? You can Submit a new open access paper.