Search Results for author: Hanzi Wang

Found 56 papers, 11 papers with code

Frequency Domain Nuances Mining for Visible-Infrared Person Re-identification

no code implementations • 4 Jan 2024 • Yukang Zhang, Yang Lu, Yan Yan, Hanzi Wang, Xuelong Li

Specifically, we propose a novel Frequency Domain Nuances Mining (FDNM) method to explore the cross-modality frequency domain information, which mainly includes an amplitude guided phase (AGP) module and an amplitude nuances mining (ANM) module.

Face Recognition Person Re-Identification

Paper
Add Code

Federated Learning with Extremely Noisy Clients via Negative Distillation

1 code implementation • 20 Dec 2023 • Yang Lu, Lin Chen, Yonggang Zhang, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang

The model trained on noisy labels serves as a `bad teacher' in knowledge distillation, aiming to decrease the risk of providing incorrect information.

Federated Learning Knowledge Distillation

Paper
Code

Spatial-Contextual Discrepancy Information Compensation for GAN Inversion

1 code implementation • 12 Dec 2023 • Ziqiang Zhang, Yan Yan, Jing-Hao Xue, Hanzi Wang

SDIC follows a "compensate-and-edit" paradigm and successfully bridges the gap in image details between the original image and the reconstructed/edited image.

Paper
Code

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

no code implementations • 23 Jun 2023 • Qianji Di, Wenxi Ma, Zhongang Qi, Tianxiang Hou, Ying Shan, Hanzi Wang

In this work, we propose a Text-Image-joint Scene Graph Generation (TISGG) model to resolve the unseen triples and improve the generalisation capability of the SGG models.

Graph Generation Scene Graph Generation +1

Paper
Add Code

PARFormer: Transformer-based Multi-Task Network for Pedestrian Attribute Recognition

1 code implementation • 14 Apr 2023 • Xinwen Fan, Yukang Zhang, Yang Lu, Hanzi Wang

Pedestrian attribute recognition (PAR) has received increasing attention because of its wide application in video surveillance and pedestrian analysis.

Attribute Data Augmentation +1

Paper
Code

Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation

1 code implementation • CVPR 2023 • Yan Jin, Mengke Li, Yang Lu, Yiu-ming Cheung, Hanzi Wang

To address this problem, state-of-the-art methods usually adopt a mixture of experts (MoE) to focus on different parts of the long-tailed distribution.

Transfer Learning

Paper
Code

Personalized Federated Learning on Long-Tailed Data via Adversarial Feature Augmentation

1 code implementation • 27 Mar 2023 • Yang Lu, Pinxin Qian, Gang Huang, Hanzi Wang

Personalized Federated Learning (PFL) aims to learn personalized models for each client based on the knowledge across all clients in a privacy-preserving manner.

Personalized Federated Learning Privacy Preserving

Paper
Code

MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification

no code implementations • 26 Mar 2023 • Yukang Zhang, Yan Yan, Jie Li, Hanzi Wang

Furthermore, to better disentangle the modality-relevant features and the modality-irrelevant features, we propose a novel Center-Quadruplet Causal (CQC) loss to encourage the network to effectively learn the modality-relevant features and the modality-irrelevant features.

Person Re-Identification

Paper
Add Code

Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

1 code implementation • CVPR 2023 • Yukang Zhang, Hanzi Wang

The proposed DEEN can effectively generate diverse embeddings to learn the informative feature representations and reduce the modality discrepancy between the VIS and IR images.

Ranked #1 on Cross-Modal Person Re-Identification on SYSU-MM01

Cross-Modal Person Re-Identification

Paper
Code

Federated Semi-Supervised Learning with Annotation Heterogeneity

no code implementations • 4 Mar 2023 • Xinyi Shang, Gang Huang, Yang Lu, Jian Lou, Bo Han, Yiu-ming Cheung, Hanzi Wang

Federated Semi-Supervised Learning (FSSL) aims to learn a global model from different clients in an environment with both labeled and unlabeled data.

Paper
Add Code

Label-Noise Learning with Intrinsically Long-Tailed Data

1 code implementation • ICCV 2023 • Yang Lu, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang

In this case, it is hard to distinguish clean samples from noisy samples on the intrinsic tail classes with the unknown intrinsic class distribution.

Paper
Code

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

no code implementations • 21 Aug 2022 • Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.

Scene Text Detection Text Detection

Paper
Add Code

Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition

1 code implementation • 16 Jul 2022 • Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

Extensive experiments on both in-the-lab and in-the-wild compound expression datasets demonstrate the superiority of our proposed CDNet against several state-of-the-art FSL methods.

cross-domain few-shot learning Facial Expression Recognition +1

Paper
Code

FEDIC: Federated Learning on Non-IID and Long-Tailed Data via Calibrated Distillation

1 code implementation • 30 Apr 2022 • Xinyi Shang, Yang Lu, Yiu-ming Cheung, Hanzi Wang

Federated learning provides a privacy guarantee for generating good deep learning models on distributed clients with different kinds of data.

Federated Learning Long-tail Learning

Paper
Code

Federated Learning on Heterogeneous and Long-Tailed Data via Classifier Re-Training with Federated Features

2 code implementations • 28 Apr 2022 • Xinyi Shang, Yang Lu, Gang Huang, Hanzi Wang

Experiments on several benchmark datasets show that the proposed CReFF is an effective solution to obtain a promising FL model under heterogeneous and long-tailed data.

Federated Learning Privacy Preserving

Paper
Code

Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes

no code implementations • 8 Mar 2022 • Xi Weng, Yan Yan, Genshun Dong, Chang Shu, Biao Wang, Hanzi Wang, Ji Zhang

This shows that DMA-Net provides a good tradeoff between segmentation quality and speed for semantic segmentation in street scenes.

Real-Time Semantic Segmentation Segmentation

Paper
Add Code

Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes

no code implementations • 8 Mar 2022 • Xi Weng, Yan Yan, Si Chen, Jing-Hao Xue, Hanzi Wang

In this paper, we present a novel Stage-aware Feature Alignment Network (SFANet) based on the encoder-decoder structure for real-time semantic segmentation of street scenes.

Real-Time Semantic Segmentation Segmentation

Paper
Add Code

When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework

no code implementations • 18 Jan 2022 • Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

To alleviate the problem of limited base classes in our FER task, we propose a novel Emotion Guided Similarity Network (EGS-Net), consisting of an emotion branch and a similarity branch, based on a two-stage learning framework.

cross-domain few-shot learning Facial Expression Recognition +1

Paper
Add Code

Event Data Association via Robust Model Fitting for Event-based Object Tracking

no code implementations • 25 Oct 2021 • Haosheng Chen, Shuyuan Lin, Yan Yan, Hanzi Wang, Xinbo Gao

In EDA, we first asynchronously fuse the event data based on its information entropy.

Model Selection Object Tracking

Paper
Add Code

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

1 code implementation • 11 Oct 2021 • Lin Cheng, Pengfei Fang, Yanjie Liang, Liao Zhang, Chunhua Shen, Hanzi Wang

Inspired by those observations, we propose a novel visual saliency method, termed Target-Selective Gradient Backprop (TSGB), which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the image space, thereby generating target-selective and fine-grained saliency maps.

Paper
Code

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data

no code implementations • CVPR 2021 • Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang

First, three auxiliary tasks, consisting of a Patch Rotation Task (PRT), a Patch Segmentation Task (PST), and a Patch Classification Task (PCT), are jointly developed to learn the spatial-semantic relationship from large-scale unlabeled facial data.

Ranked #3 on Facial Attribute Classification on LFWA

Attribute Facial Attribute Classification +1

Paper
Add Code

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

no code implementations • CVPR 2021 • Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang

In this paper, we propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

Hierarchical Representation via Message Propagation for Robust Model Fitting

no code implementations • 29 Dec 2020 • Shuyuan Lin, Xing Wang, Guobao Xiao, Yan Yan, Hanzi Wang

In this paper, we propose a novel hierarchical representation via message propagation (HRMP) method for robust model fitting, which simultaneously takes advantages of both the consensus analysis and the preference analysis to estimate the parameters of multiple model instances from data corrupted by outliers, for robust model fitting.

Paper
Add Code

Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

no code implementations • 9 Nov 2020 • Lijian Lin, Haosheng Chen, Yanjie Liang, Yan Yan, Hanzi Wang

In this paper, we propose a robust tracking method via Statistical Positive sample generation and Gradient Aware learning (SPGA) to address the above two limitations.

Visual Tracking

Paper
Add Code

Dual Semantic Fusion Network for Video Object Detection

no code implementations • 16 Sep 2020 • Lijian Lin, Haosheng Chen, Honglun Zhang, Jun Liang, Yu Li, Ying Shan, Hanzi Wang

Video object detection is a tough task due to the deteriorated quality of video sequences captured under complex environments.

Object object-detection +2

Paper
Add Code

Correlation filter tracking with adaptive proposal selection for accurate scale estimation

no code implementations • 14 Jul 2020 • Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang

In this paper, we propose an adaptive proposal selection algorithm which can generate a small number of high-quality proposals to handle the problem of scale variations for visual object tracking.

Visual Object Tracking

Paper
Add Code

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

no code implementations • 11 Mar 2020 • Genshun Dong, Yan Yan, Chunhua Shen, Hanzi Wang

Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information.

Image Segmentation Segmentation +2

Paper
Add Code

Learning Object Scale With Click Supervision for Object Detection

no code implementations • 20 Feb 2020 • Liao Zhang, Yan Yan, Lin Cheng, Hanzi Wang

Finally, we fuse these CAMs together to generate pseudoground-truths and train a fully-supervised object detector withthese ground-truths.

Object object-detection +1

Paper
Add Code

End-to-end Learning of Object Motion Estimation from Retinal Events for Event-based Object Tracking

no code implementations • 14 Feb 2020 • Haosheng Chen, David Suter, Qiangqiang Wu, Hanzi Wang

We feed the sequence of TSLTD frames to a novel Retinal Motion Regression Network (RMRNet) to perform an end-to-end 5-DoF object motion regression.

Motion Estimation Object +2

Paper
Add Code

Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

no code implementations • 13 Feb 2020 • Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang

To achieve this goal, we present an Adaptive Time-Surface with Linear Time Decay (ATSLTD) event-to-frame conversion algorithm, which asynchronously and effectively warps the spatio-temporal information of asynchronous retinal events to a sequence of ATSLTD frames with clear object contours.

Object Object Tracking

Paper
Add Code

Hypergraph Optimization for Multi-structural Geometric Model Fitting

no code implementations • 13 Feb 2020 • Shuyuan Lin, Guobao Xiao, Yan Yan, David Suter, Hanzi Wang

Recently, some hypergraph-based methods have been proposed to deal with the problem of model fitting in computer vision, mainly due to the superior capability of hypergraph to represent the complex relationship between data points.

Clustering

Paper
Add Code

Deep Multi-task Multi-label CNN for Effective Facial Attribute Classification

no code implementations • 10 Feb 2020 • Longbiao Mao, Yan Yan, Jing-Hao Xue, Hanzi Wang

Two different network architectures are respectively designed to extract features for two groups of attributes, and a novel dynamic weighting scheme is proposed to automatically assign the loss weight to each facial attribute during training.

Attribute Face Detection +5

Paper
Add Code

Joint Deep Learning of Facial Expression Synthesis and Recognition

no code implementations • 6 Feb 2020 • Yan Yan, Ying Huang, Si Chen, Chunhua Shen, Hanzi Wang

Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

Hallucinated Adversarial Learning for Robust Visual Tracking

no code implementations • 17 Jun 2019 • Qiangqiang Wu, Zhihui Chen, Lin Cheng, Yan Yan, Bo Li, Hanzi Wang

Incorporating such an ability to hallucinate diverse new samples of the tracked instance can help the trackers alleviate the over-fitting problem in the low-data tracking regime.

Visual Tracking

Paper
Add Code

DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

no code implementations • 6 Nov 2018 • Qiangqiang Wu, Yan Yan, Yanjie Liang, Yi Liu, Hanzi Wang

In recent years, Discriminative Correlation Filter (DCF) based tracking methods have achieved great success in visual tracking.

Image Classification Visual Tracking

Paper
Add Code

Multi-task Learning of Cascaded CNN for Facial Attribute Classification

no code implementations • 3 May 2018 • Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang

In order to address the above problems, we propose a novel multi-task learning of cas- caded convolutional neural network method, termed MCFA, for predicting multiple facial attributes simultaneously.

Attribute Classification +5

Paper
Add Code

Multi-label Learning Based Deep Transfer Neural Network for Facial Attribute Classification

no code implementations • 3 May 2018 • Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang, Chunhua Shen

To address the above problem, we propose a novel deep transfer neural network method based on multi-label learning for facial attribute classification, termed FMTNet, which consists of three sub-networks: the Face detection Network (FNet), the Multi-label learning Network (MNet) and the Transfer learning Network (TNet).

Attribute Classification +6

Paper
Add Code

Superpixel-guided Two-view Deterministic Geometric Model Fitting

no code implementations • 3 May 2018 • Guobao Xiao, Hanzi Wang, Yan Yan, David Suter

Specifically, SDF includes three main parts: a deterministic sampling algorithm, a model hypothesis updating strategy and a novel model selection algorithm.

Model Selection Superpixels +1

Paper
Add Code

A Fast Face Detection Method via Convolutional Neural Network

no code implementations • 27 Mar 2018 • Guanjun Guo, Hanzi Wang, Yan Yan, Jin Zheng, Bo Li

Current face or object detection methods via convolutional neural network (such as OverFeat, R-CNN and DenseNet) explicitly extract multi-scale features based on an image pyramid.

Face Detection object-detection +1

Paper
Add Code

A New Target-specific Object Proposal Generation Method for Visual Tracking

no code implementations • 27 Mar 2018 • Guanjun Guo, Hanzi Wang, Yan Yan, Hong-Yuan Mark Liao, Bo Li

Then, we apply the proposed TOPG method to the task of visual tracking and propose a TOPG-based tracker (called as TOPGT), where TOPG is used as a sample selection strategy to select a small number of high-quality target candidates from the generated object proposals.

Object Object Proposal Generation +1

Paper
Add Code

Single Image Super-Resolution via Cascaded Multi-Scale Cross Network

no code implementations • 24 Feb 2018 • Yanting Hu, Xinbo Gao, Jie Li, Yuanfei Huang, Hanzi Wang

To improve information flow and to capture sufficient knowledge for reconstructing the high-frequency details, we propose a cascaded multi-scale cross network (CMSC) in which a sequence of subnetworks is cascaded to infer high resolution features in a coarse-to-fine manner.

Image Reconstruction Image Super-Resolution

Paper
Add Code

Searching for Representative Modes on Hypergraphs for Robust Geometric Model Fitting

no code implementations • 4 Feb 2018 • Hanzi Wang, Guobao Xiao, Yan Yan, David Suter

We cast the task of geometric model fitting as a representative mode-seeking problem on hypergraphs.

Paper
Add Code

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression

no code implementations • 25 Dec 2017 • Guanjun Guo, Hanzi Wang, Chunhua Shen, Yan Yan, Hong-Yuan Mark Liao

The deep CNN model is then designed to extract features from several image cropping datasets, upon which the cropping bounding boxes are predicted by the proposed CCR method.

Image Cropping regression

Paper
Add Code

Object Discovery via Cohesion Measurement

no code implementations • 28 Apr 2017 • Guanjun Guo, Hanzi Wang, Wan-Lei Zhao, Yan Yan, Xuelong. Li

Based on the new Cohesion Measurement, a novel object discovery method is proposed to discover objects latent in an image by utilizing the eigenvectors of the affinity matrix.

Clustering Image Segmentation +5

Paper
Add Code

Revisiting Graph Construction for Fast Image Segmentation

no code implementations • 18 Feb 2017 • Zizhao Zhang, Fuyong Xing, Hanzi Wang, Yan Yan, Ying Huang, Xiaoshuang Shi, Lin Yang

In this paper, we propose a simple but effective method for fast image segmentation.

Clustering graph construction +4

Paper
Add Code

Superpixel-based Two-view Deterministic Fitting for Multiple-structure Data

no code implementations • 20 Jul 2016 • Guobao Xiao, Hanzi Wang, Yan Yan, David Suter

The feature appearances are beneficial to reduce the computational complexity for deterministic fitting methods.

Model Selection Superpixels +1

Paper
Add Code

Hypergraph Modelling for Geometric Model Fitting

no code implementations • 11 Jul 2016 • Guobao Xiao, Hanzi Wang, Taotao Lai, David Suter

The hypergraph, with large and "data-determined" degrees of hyperedges, can express the complex relationships between model hypotheses and data points.

Paper
Add Code

Learning Hough Regression Models via Bridge Partial Least Squares for Object Detection

no code implementations • 26 Mar 2016 • Jianyu Tang, Hanzi Wang, Yan Yan

And the appropriate value of the only parameter used in PLS (i. e., the number of latent components) can be determined by using a cross-validation procedure.

Clustering Object +3

Paper
Add Code

An Effective Unconstrained Correlation Filter and Its Kernelization for Face Recognition

no code implementations • 25 Mar 2016 • Yan Yan, Hanzi Wang, Cuihua Li, Chenhui Yang, Bineng Zhong

In this paper, an effective unconstrained correlation filter called Uncon- strained Optimal Origin Tradeoff Filter (UOOTF) is presented and applied to robust face recognition.

Face Recognition Robust Face Recognition

Paper
Add Code

Mode-Seeking on Hypergraphs for Robust Geometric Model Fitting

no code implementations • ICCV 2015 • Hanzi Wang, Guobao Xiao, Yan Yan, David Suter

In addition to the mode seeking algorithm, MSH includes a similarity measure between vertices on the hypergraph and a weight-aware sampling technique.

Paper
Add Code

Quadratic Projection Based Feature Extraction with Its Application to Biometric Recognition

no code implementations • 25 Mar 2016 • Yan Yan, Hanzi Wang, Si Chen, Xiaochun Cao, David Zhang

This paper presents a novel quadratic projection based feature extraction framework, where a set of quadratic matrices is learned to distinguish each class from all other classes.

Paper
Add Code

Multi-Subregion Based Correlation Filter Bank for Robust Face Recognition

no code implementations • 24 Mar 2016 • Yan Yan, Hanzi Wang, David Suter

In this paper, we propose an effective feature extraction algorithm, called Multi-Subregion based Correlation Filter Bank (MS-CFB), for robust face recognition.

Face Recognition Robust Face Recognition

Paper
Add Code

Robust Scene Text Recognition Using Sparse Coding based Features

no code implementations • 29 Dec 2015 • Da-Han Wang, Hanzi Wang, Dong Zhang, Jonathan Li, David Zhang

For character detection, we use the HSC features instead of using the Histograms of Oriented Gradients (HOG) features.

Scene Text Recognition

Paper
Add Code

Efficient Semidefinite Spectral Clustering via Lagrange Duality

no code implementations • 22 Feb 2014 • Yan Yan, Chunhua Shen, Hanzi Wang

constraint for spectral clustering.

Clustering

Paper
Add Code

Distortion-driven Turbulence Effect Removal using Variational Model

no code implementations • 17 Jan 2014 • Yuan Xie, Wensheng Zhang, DaCheng Tao, Wenrui Hu, Yanyun Qu, Hanzi Wang

To solve, or at least reduce these effects, we propose a new scheme to recover a latent image from observed frames by integrating a new variational model and distortion-driven spatial-temporal kernel regression.

regression

Paper
Add Code

The Ordered Residual Kernel for Robust Motion Subspace Clustering

no code implementations • NeurIPS 2009 • Tat-Jun Chin, Hanzi Wang, David Suter

The kernel permits the application of well-established statistical learning methods for effective outlier rejection, automatic recovery of the number of motions and accurate segmentation of the point trajectories.

Clustering Computational Efficiency +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.