no code implementations • 3 Jan 2025 • Hu Ding, Yan Yan, Yang Lu, Jing-Hao Xue, Hanzi Wang
This indicates the superiority of hypergraph modeling for uncertainty estimation and label refinement on the personalized federated FER task.
Facial Expression Recognition Facial Expression Recognition (FER) +1
1 code implementation • 3 Jan 2025 • Ruikang Chen, Yan Yan, Jing-Hao Xue, Yang Lu, Hanzi Wang
However, obtaining correct annotations is extremely hard if not impossible for large-scale X-ray images, where item overlapping is ubiquitous. As a result, X-ray images are easily contaminated with noisy annotations, leading to performance deterioration of existing methods. In this paper, we address the challenging problem of training a robust prohibited item detector under noisy annotations (including both category noise and bounding box noise) from a novel perspective of data augmentation, and propose an effective label-aware mixed patch paste augmentation method (Mix-Paste).
no code implementations • 18 Nov 2024 • Hanyu Guo, Wanchuan Yu, Suzhou Que, Kaiwen Du, Yan Yan, Hanzi Wang
In this paper, we propose a novel Dual Motion-Guided Attention Learning method (called DMGAL) for few-shot action recognition, aiming to learn the spatio-temporal relationships from the video-specific to the task-specific level.
no code implementations • 29 Apr 2024 • Liyuan Wang, Yan Jin, Zhen Chen, Jinlin Wu, Mengke Li, Yang Lu, Hanzi Wang
The vision-language pre-training has enabled deep models to make a huge step forward in generalizing across unseen domains.
1 code implementation • 23 Apr 2024 • Chenxing Hong, Yan Jin, Zhiqi Kang, Yizhou Chen, Mengke Li, Yang Lu, Hanzi Wang
We find that imbalanced tasks significantly challenge the capability of models to control the trade-off between stability and plasticity from the perspective of recent prompt-based continual learning methods.
no code implementations • 4 Jan 2024 • Yukang Zhang, Yang Lu, Yan Yan, Hanzi Wang, Xuelong Li
Specifically, we propose a novel Frequency Domain Nuances Mining (FDNM) method to explore the cross-modality frequency domain information, which mainly includes an amplitude guided phase (AGP) module and an amplitude nuances mining (ANM) module.
1 code implementation • 20 Dec 2023 • Yang Lu, Lin Chen, Yonggang Zhang, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang
The model trained on noisy labels serves as a `bad teacher' in knowledge distillation, aiming to decrease the risk of providing incorrect information.
1 code implementation • 12 Dec 2023 • Ziqiang Zhang, Yan Yan, Jing-Hao Xue, Hanzi Wang
SDIC follows a "compensate-and-edit" paradigm and successfully bridges the gap in image details between the original image and the reconstructed/edited image.
no code implementations • 23 Jun 2023 • Qianji Di, Wenxi Ma, Zhongang Qi, Tianxiang Hou, Ying Shan, Hanzi Wang
In this work, we propose a Text-Image-joint Scene Graph Generation (TISGG) model to resolve the unseen triples and improve the generalisation capability of the SGG models.
1 code implementation • 14 Apr 2023 • Xinwen Fan, Yukang Zhang, Yang Lu, Hanzi Wang
Pedestrian attribute recognition (PAR) has received increasing attention because of its wide application in video surveillance and pedestrian analysis.
1 code implementation • CVPR 2023 • Yan Jin, Mengke Li, Yang Lu, Yiu-ming Cheung, Hanzi Wang
To address this problem, state-of-the-art methods usually adopt a mixture of experts (MoE) to focus on different parts of the long-tailed distribution.
1 code implementation • 27 Mar 2023 • Yang Lu, Pinxin Qian, Gang Huang, Hanzi Wang
Personalized Federated Learning (PFL) aims to learn personalized models for each client based on the knowledge across all clients in a privacy-preserving manner.
no code implementations • 26 Mar 2023 • Yukang Zhang, Yan Yan, Jie Li, Hanzi Wang
Furthermore, to better disentangle the modality-relevant features and the modality-irrelevant features, we propose a novel Center-Quadruplet Causal (CQC) loss to encourage the network to effectively learn the modality-relevant features and the modality-irrelevant features.
1 code implementation • CVPR 2023 • Yukang Zhang, Hanzi Wang
The proposed DEEN can effectively generate diverse embeddings to learn the informative feature representations and reduce the modality discrepancy between the VIS and IR images.
Ranked #1 on Cross-Modal Person Re-Identification on SYSU-MM01
no code implementations • 4 Mar 2023 • Xinyi Shang, Gang Huang, Yang Lu, Jian Lou, Bo Han, Yiu-ming Cheung, Hanzi Wang
Federated Semi-Supervised Learning (FSSL) aims to learn a global model from different clients in an environment with both labeled and unlabeled data.
no code implementations • 21 Aug 2022 • Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang
We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.
1 code implementation • ICCV 2023 • Yang Lu, Yiliang Zhang, Bo Han, Yiu-ming Cheung, Hanzi Wang
In this case, it is hard to distinguish clean samples from noisy samples on the intrinsic tail classes with the unknown intrinsic class distribution.
1 code implementation • 16 Jul 2022 • Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang
Extensive experiments on both in-the-lab and in-the-wild compound expression datasets demonstrate the superiority of our proposed CDNet against several state-of-the-art FSL methods.
cross-domain few-shot learning Facial Expression Recognition +1
1 code implementation • 30 Apr 2022 • Xinyi Shang, Yang Lu, Yiu-ming Cheung, Hanzi Wang
Federated learning provides a privacy guarantee for generating good deep learning models on distributed clients with different kinds of data.
2 code implementations • 28 Apr 2022 • Xinyi Shang, Yang Lu, Gang Huang, Hanzi Wang
Experiments on several benchmark datasets show that the proposed CReFF is an effective solution to obtain a promising FL model under heterogeneous and long-tailed data.
no code implementations • 8 Mar 2022 • Xi Weng, Yan Yan, Genshun Dong, Chang Shu, Biao Wang, Hanzi Wang, Ji Zhang
This shows that DMA-Net provides a good tradeoff between segmentation quality and speed for semantic segmentation in street scenes.
no code implementations • 8 Mar 2022 • Xi Weng, Yan Yan, Si Chen, Jing-Hao Xue, Hanzi Wang
In this paper, we present a novel Stage-aware Feature Alignment Network (SFANet) based on the encoder-decoder structure for real-time semantic segmentation of street scenes.
no code implementations • 18 Jan 2022 • Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang
To alleviate the problem of limited base classes in our FER task, we propose a novel Emotion Guided Similarity Network (EGS-Net), consisting of an emotion branch and a similarity branch, based on a two-stage learning framework.
cross-domain few-shot learning Facial Expression Recognition +1
no code implementations • 25 Oct 2021 • Haosheng Chen, Shuyuan Lin, Yan Yan, Hanzi Wang, Xinbo Gao
In EDA, we first asynchronously fuse the event data based on its information entropy.
1 code implementation • 11 Oct 2021 • Lin Cheng, Pengfei Fang, Yanjie Liang, Liao Zhang, Chunhua Shen, Hanzi Wang
Inspired by those observations, we propose a novel visual saliency method, termed Target-Selective Gradient Backprop (TSGB), which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the image space, thereby generating target-selective and fine-grained saliency maps.
no code implementations • CVPR 2021 • Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang
First, three auxiliary tasks, consisting of a Patch Rotation Task (PRT), a Patch Segmentation Task (PST), and a Patch Classification Task (PCT), are jointly developed to learn the spatial-semantic relationship from large-scale unlabeled facial data.
Ranked #3 on Facial Attribute Classification on LFWA
no code implementations • CVPR 2021 • Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang
In this paper, we propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition.
Facial Expression Recognition Facial Expression Recognition (FER) +1
no code implementations • 29 Dec 2020 • Shuyuan Lin, Xing Wang, Guobao Xiao, Yan Yan, Hanzi Wang
In this paper, we propose a novel hierarchical representation via message propagation (HRMP) method for robust model fitting, which simultaneously takes advantages of both the consensus analysis and the preference analysis to estimate the parameters of multiple model instances from data corrupted by outliers, for robust model fitting.
no code implementations • 9 Nov 2020 • Lijian Lin, Haosheng Chen, Yanjie Liang, Yan Yan, Hanzi Wang
In this paper, we propose a robust tracking method via Statistical Positive sample generation and Gradient Aware learning (SPGA) to address the above two limitations.
no code implementations • 16 Sep 2020 • Lijian Lin, Haosheng Chen, Honglun Zhang, Jun Liang, Yu Li, Ying Shan, Hanzi Wang
Video object detection is a tough task due to the deteriorated quality of video sequences captured under complex environments.
no code implementations • 14 Jul 2020 • Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang
In this paper, we propose an adaptive proposal selection algorithm which can generate a small number of high-quality proposals to handle the problem of scale variations for visual object tracking.
no code implementations • 11 Mar 2020 • Genshun Dong, Yan Yan, Chunhua Shen, Hanzi Wang
Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information.
no code implementations • 20 Feb 2020 • Liao Zhang, Yan Yan, Lin Cheng, Hanzi Wang
Finally, we fuse these CAMs together to generate pseudoground-truths and train a fully-supervised object detector withthese ground-truths.
no code implementations • 14 Feb 2020 • Haosheng Chen, David Suter, Qiangqiang Wu, Hanzi Wang
We feed the sequence of TSLTD frames to a novel Retinal Motion Regression Network (RMRNet) to perform an end-to-end 5-DoF object motion regression.
no code implementations • 13 Feb 2020 • Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang
To achieve this goal, we present an Adaptive Time-Surface with Linear Time Decay (ATSLTD) event-to-frame conversion algorithm, which asynchronously and effectively warps the spatio-temporal information of asynchronous retinal events to a sequence of ATSLTD frames with clear object contours.
no code implementations • 13 Feb 2020 • Shuyuan Lin, Guobao Xiao, Yan Yan, David Suter, Hanzi Wang
Recently, some hypergraph-based methods have been proposed to deal with the problem of model fitting in computer vision, mainly due to the superior capability of hypergraph to represent the complex relationship between data points.
no code implementations • 10 Feb 2020 • Longbiao Mao, Yan Yan, Jing-Hao Xue, Hanzi Wang
Two different network architectures are respectively designed to extract features for two groups of attributes, and a novel dynamic weighting scheme is proposed to automatically assign the loss weight to each facial attribute during training.
no code implementations • 6 Feb 2020 • Yan Yan, Ying Huang, Si Chen, Chunhua Shen, Hanzi Wang
Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions.
no code implementations • 17 Jun 2019 • Qiangqiang Wu, Zhihui Chen, Lin Cheng, Yan Yan, Bo Li, Hanzi Wang
Incorporating such an ability to hallucinate diverse new samples of the tracked instance can help the trackers alleviate the over-fitting problem in the low-data tracking regime.
no code implementations • 6 Nov 2018 • Qiangqiang Wu, Yan Yan, Yanjie Liang, Yi Liu, Hanzi Wang
In recent years, Discriminative Correlation Filter (DCF) based tracking methods have achieved great success in visual tracking.
no code implementations • 3 May 2018 • Guobao Xiao, Hanzi Wang, Yan Yan, David Suter
Specifically, SDF includes three main parts: a deterministic sampling algorithm, a model hypothesis updating strategy and a novel model selection algorithm.
no code implementations • 3 May 2018 • Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang
In order to address the above problems, we propose a novel multi-task learning of cas- caded convolutional neural network method, termed MCFA, for predicting multiple facial attributes simultaneously.
no code implementations • 3 May 2018 • Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang, Chunhua Shen
To address the above problem, we propose a novel deep transfer neural network method based on multi-label learning for facial attribute classification, termed FMTNet, which consists of three sub-networks: the Face detection Network (FNet), the Multi-label learning Network (MNet) and the Transfer learning Network (TNet).
no code implementations • 27 Mar 2018 • Guanjun Guo, Hanzi Wang, Yan Yan, Jin Zheng, Bo Li
Current face or object detection methods via convolutional neural network (such as OverFeat, R-CNN and DenseNet) explicitly extract multi-scale features based on an image pyramid.
no code implementations • 27 Mar 2018 • Guanjun Guo, Hanzi Wang, Yan Yan, Hong-Yuan Mark Liao, Bo Li
Then, we apply the proposed TOPG method to the task of visual tracking and propose a TOPG-based tracker (called as TOPGT), where TOPG is used as a sample selection strategy to select a small number of high-quality target candidates from the generated object proposals.
no code implementations • 24 Feb 2018 • Yanting Hu, Xinbo Gao, Jie Li, Yuanfei Huang, Hanzi Wang
To improve information flow and to capture sufficient knowledge for reconstructing the high-frequency details, we propose a cascaded multi-scale cross network (CMSC) in which a sequence of subnetworks is cascaded to infer high resolution features in a coarse-to-fine manner.
no code implementations • 4 Feb 2018 • Hanzi Wang, Guobao Xiao, Yan Yan, David Suter
We cast the task of geometric model fitting as a representative mode-seeking problem on hypergraphs.
no code implementations • 25 Dec 2017 • Guanjun Guo, Hanzi Wang, Chunhua Shen, Yan Yan, Hong-Yuan Mark Liao
The deep CNN model is then designed to extract features from several image cropping datasets, upon which the cropping bounding boxes are predicted by the proposed CCR method.
no code implementations • 28 Apr 2017 • Guanjun Guo, Hanzi Wang, Wan-Lei Zhao, Yan Yan, Xuelong. Li
Based on the new Cohesion Measurement, a novel object discovery method is proposed to discover objects latent in an image by utilizing the eigenvectors of the affinity matrix.
no code implementations • 18 Feb 2017 • Zizhao Zhang, Fuyong Xing, Hanzi Wang, Yan Yan, Ying Huang, Xiaoshuang Shi, Lin Yang
In this paper, we propose a simple but effective method for fast image segmentation.
no code implementations • 20 Jul 2016 • Guobao Xiao, Hanzi Wang, Yan Yan, David Suter
The feature appearances are beneficial to reduce the computational complexity for deterministic fitting methods.
no code implementations • 11 Jul 2016 • Guobao Xiao, Hanzi Wang, Taotao Lai, David Suter
The hypergraph, with large and "data-determined" degrees of hyperedges, can express the complex relationships between model hypotheses and data points.
no code implementations • 26 Mar 2016 • Jianyu Tang, Hanzi Wang, Yan Yan
And the appropriate value of the only parameter used in PLS (i. e., the number of latent components) can be determined by using a cross-validation procedure.
no code implementations • 25 Mar 2016 • Yan Yan, Hanzi Wang, Cuihua Li, Chenhui Yang, Bineng Zhong
In this paper, an effective unconstrained correlation filter called Uncon- strained Optimal Origin Tradeoff Filter (UOOTF) is presented and applied to robust face recognition.
no code implementations • 25 Mar 2016 • Yan Yan, Hanzi Wang, Si Chen, Xiaochun Cao, David Zhang
This paper presents a novel quadratic projection based feature extraction framework, where a set of quadratic matrices is learned to distinguish each class from all other classes.
no code implementations • ICCV 2015 • Hanzi Wang, Guobao Xiao, Yan Yan, David Suter
In addition to the mode seeking algorithm, MSH includes a similarity measure between vertices on the hypergraph and a weight-aware sampling technique.
no code implementations • 24 Mar 2016 • Yan Yan, Hanzi Wang, David Suter
In this paper, we propose an effective feature extraction algorithm, called Multi-Subregion based Correlation Filter Bank (MS-CFB), for robust face recognition.
no code implementations • 29 Dec 2015 • Da-Han Wang, Hanzi Wang, Dong Zhang, Jonathan Li, David Zhang
For character detection, we use the HSC features instead of using the Histograms of Oriented Gradients (HOG) features.
no code implementations • 22 Feb 2014 • Yan Yan, Chunhua Shen, Hanzi Wang
constraint for spectral clustering.
no code implementations • 17 Jan 2014 • Yuan Xie, Wensheng Zhang, DaCheng Tao, Wenrui Hu, Yanyun Qu, Hanzi Wang
To solve, or at least reduce these effects, we propose a new scheme to recover a latent image from observed frames by integrating a new variational model and distortion-driven spatial-temporal kernel regression.
no code implementations • NeurIPS 2009 • Tat-Jun Chin, Hanzi Wang, David Suter
The kernel permits the application of well-established statistical learning methods for effective outlier rejection, automatic recovery of the number of motions and accurate segmentation of the point trajectories.