Search Results for author: Nannan Wang

Found 58 papers, 18 papers with code

Unsupervised Visible-Infrared Person ReID by Collaborative Learning with Neighbor-Guided Label Refinement

no code implementations22 May 2023 De Cheng, Xiaojian Huang, Nannan Wang, Lingfeng He, Zhihui Li, Xinbo Gao

Unsupervised learning visible-infrared person re-identification (USL-VI-ReID) aims at learning modality-invariant features from unlabeled cross-modality dataset, which is crucial for practical applications in video surveillance systems.

Person Re-Identification

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID

no code implementations22 May 2023 De Cheng, Lingfeng He, Nannan Wang, Shizhou Zhang, Zhen Wang, Xinbo Gao

To this end, we propose a novel bilateral cluster matching-based learning framework to reduce the modality gap by matching cross-modality clusters.

Contrastive Learning Person Re-Identification

Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

no code implementations9 May 2023 Shiyin Dong, Mingrui Zhu, Nannan Wang, Heng Yang, Xinbo Gao

Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions.

Retrieval Sketch-Based Image Retrieval +1

Semantic-aware Generation of Multi-view Portrait Drawings

1 code implementation4 May 2023 Biao Ma, Fei Gao, Chang Jiang, Nannan Wang, Gang Xu

Our motivation is that facial semantic labels are view-consistent and correlate with drawing techniques.

3D-Aware Image Synthesis Data Augmentation

Boosting Weakly-Supervised Temporal Action Localization with Text Information

1 code implementation CVPR 2023 Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Xiaoyu Wang, Xinbo Gao

For the discriminative objective, we propose a Text-Segment Mining (TSM) mechanism, which constructs a text description based on the action class label, and regards the text as the query to mine all class-related segments.

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint

1 code implementation25 Apr 2023 Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Jie Li, Xinbo Gao

The proposed Bi-SCC firstly adopts a temporal context augmentation to generate an augmented video that breaks the correlation between positive actions and their co-scene actions in the inter-video; Then, a semantic consistency constraint (SCC) is used to enforce the predictions of the original video and augmented video to be consistent, hence suppressing the co-scene actions.

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

Masked and Adaptive Transformer for Exemplar Based Image Translation

1 code implementation CVPR 2023 Chang Jiang, Fei Gao, Biao Ma, YuHao Lin, Nannan Wang, Gang Xu

To overcome this challenge, we improve the accuracy of matching on the one hand, and diminish the role of matching in image generation on the other hand.

Image Generation Semantic correspondence +1

Few-shot Face Image Translation via GAN Prior Distillation

no code implementations28 Jan 2023 Ruoyu Zhao, Mingrui Zhu, Xiaoyu Wang, Nannan Wang

GPD contains two models: a teacher network with GAN Prior and a student network that fulfills end-to-end translation.

Knowledge Distillation Translation

Few-shot Font Generation by Learning Style Difference and Similarity

no code implementations24 Jan 2023 Xiao He, Mingrui Zhu, Nannan Wang, Xinbo Gao, Heng Yang

To address this issue, we propose a novel font generation approach by learning the Difference between different styles and the Similarity of the same style (DS-Font).

Contrastive Learning Font Generation

All-to-key Attention for Arbitrary Style Transfer

no code implementations8 Dec 2022 Mingrui Zhu, Xiao He, Nannan Wang, Xiaoyu Wang, Xinbo Gao

In this paper, we propose a novel all-to-key attention mechanism -- each position of content features is matched to stable key positions of style features -- that is more in line with the characteristics of style transfer.

Style Transfer

Neighbour Consistency Guided Pseudo-Label Refinement for Unsupervised Person Re-Identification

no code implementations30 Nov 2022 De Cheng, Haichun Tai, Nannan Wang, Zhen Wang, Xinbo Gao

In this paper, we propose a Neighbour Consistency guided Pseudo Label Refinement (NCPLR) framework, which can be regarded as a transductive form of label propagation under the assumption that the prediction of each example should be similar to its nearest neighbours'.

Person Retrieval Pseudo Label +2

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

1 code implementation27 Nov 2022 Kun Cheng, Xiaodong Cun, Yong Zhang, Menghan Xia, Fei Yin, Mingrui Zhu, Xuan Wang, Jue Wang, Nannan Wang

Our system disentangles this objective into three sequential tasks: (1) face video generation with a canonical expression; (2) audio-driven lip-sync; and (3) face enhancement for improving photo-realism.

Video Editing Video Generation

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

1 code implementation CVPR 2023 Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

In this paper, we propose a neural architecture representation model that can be used to estimate these attributes holistically.

Representation Learning

Class-Dependent Label-Noise Learning with Cycle-Consistency Regularization Feature Space

1 code implementation NIPS 2022 De Cheng, Yixiong Ning, Nannan Wang, Xinbo Gao, Heng Yang, Yuxuan Du, Bo Han, Tongliang Liu

We show that the cycle-consistency regularization helps to minimize the volume of the transition matrix T indirectly without exploiting the estimated noisy class posterior, which could further encourage the estimated transition matrix T to converge to its optimal solution.

FedForgery: Generalized Face Forgery Detection with Residual Federated Learning

1 code implementation18 Oct 2022 Decheng Liu, Zhan Dang, Chunlei Peng, Yu Zheng, Shuang Li, Nannan Wang, Xinbo Gao

Experiments conducted on publicly available face forgery detection datasets prove the superior performance of the proposed FedForgery.

Federated Learning Image Generation

Strength-Adaptive Adversarial Training

no code implementations4 Oct 2022 Chaojian Yu, Dawei Zhou, Li Shen, Jun Yu, Bo Han, Mingming Gong, Nannan Wang, Tongliang Liu

Firstly, applying a pre-specified perturbation budget on networks of various model capacities will yield divergent degree of robustness disparity between natural and robust accuracies, which deviates from robust network's desideratum.

Adversarial Robustness Scheduling

Visual Information Hiding Based on Obfuscating Adversarial Perturbations

no code implementations30 Sep 2022 Zhigang Su, Dawei Zhou, Decheng Liu, Nannan Wang, Zhen Wang, Xinbo Gao

Growing leakage and misuse of visual information raise security and privacy concerns, which promotes the development of information protection.

Adversarial Attack De-identification +1

Improving Adversarial Robustness via Mutual Information Estimation

1 code implementation25 Jul 2022 Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Xiaoyu Wang, Yibing Zhan, Tongliang Liu

To alleviate this negative effect, in this paper, we investigate the dependence between outputs of the target model and input adversarial samples from the perspective of information theory, and propose an adversarial defense method.

Adversarial Defense Adversarial Robustness +1

TransFA: Transformer-based Representation for Face Attribute Evaluation

1 code implementation12 Jul 2022 Decheng Liu, Weijie He, Chunlei Peng, Nannan Wang, Jie Li, Xinbo Gao

The multiple branches transformer is employed to explore the inter-correlation between different attributes in similar semantic regions for attribute feature learning.

Multi-Label Classification Representation Learning

Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in VIS and NIR Scenario

no code implementations5 Jul 2022 Yukai Wang, Chunlei Peng, Decheng Liu, Nannan Wang, Xinbo Gao

In recent years, with the rapid development of face editing and generation, more and more fake videos are circulating on social media, which has caused extreme public concerns.

Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation

no code implementations CVPR 2022 De Cheng, Tongliang Liu, Yixiong Ning, Nannan Wang, Bo Han, Gang Niu, Xinbo Gao, Masashi Sugiyama

In label-noise learning, estimating the transition matrix has attracted more and more attention as the matrix plays an important role in building statistically consistent classifiers.

Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction

no code implementations29 Mar 2022 De Cheng, Yan Li, Dingwen Zhang, Nannan Wang, Xinbo Gao, Jiande Sun

To properly address this problem, we propose a novel density-variational learning framework to improve the robustness of the image dehzing model assisted by a variety of negative hazy images, to better deal with various complex hazy scenarios.

Image Dehazing Single Image Dehazing

Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin

1 code implementation CVPR 2022 Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao

In this paper, we learn an Adaptive Confidence Margin (Ada-CM) to fully leverage all unlabeled data for semi-supervised deep facial expression recognition.

Facial Expression Recognition (FER)

Semi-parametric Makeup Transfer via Semantic-aware Correspondence

1 code implementation4 Mar 2022 Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao

The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer.

Hybrid Dynamic Contrast and Probability Distillation for Unsupervised Person Re-Id

no code implementations29 Sep 2021 De Cheng, Jingyu Zhou, Nannan Wang, Xinbo Gao

However, since person Re-Id is an open-set problem, the clustering based methods often leave out lots of outlier instances or group the instances into the wrong clusters, thus they can not make full use of the training samples as a whole.

Contrastive Learning Metric Learning +2

Modeling Adversarial Noise for Adversarial Defense

no code implementations29 Sep 2021 Dawei Zhou, Nannan Wang, Bo Han, Tongliang Liu

Deep neural networks have been demonstrated to be vulnerable to adversarial noise, promoting the development of defense against adversarial attacks.

Adversarial Defense

Single Image Dehazing with An Independent Detail-Recovery Network

1 code implementation22 Sep 2021 Yan Li, De Cheng, Jiande Sun, Dingwen Zhang, Nannan Wang, Xinbo Gao

In this paper, we propose a single image dehazing method with an independent Detail Recovery Network (DRN), which considers capturing the details from the input image over a separate network and then integrates them into a coarse dehazed image.

Image Dehazing Single Image Dehazing

Modeling Adversarial Noise for Adversarial Training

1 code implementation21 Sep 2021 Dawei Zhou, Nannan Wang, Bo Han, Tongliang Liu

Deep neural networks have been demonstrated to be vulnerable to adversarial noise, promoting the development of defense against adversarial attacks.

Adversarial Defense

Support-Set Based Cross-Supervision for Video Grounding

no code implementations ICCV 2021 Xinpeng Ding, Nannan Wang, Shiwei Zhang, De Cheng, Xiaomeng Li, Ziyuan Huang, Mingqian Tang, Xinbo Gao

The contrastive objective aims to learn effective representations by contrastive learning, while the caption objective can train a powerful video encoder supervised by texts.

Contrastive Learning Video Grounding

Exploring Set Similarity for Dense Self-supervised Representation Learning

no code implementations CVPR 2022 Zhaoqing Wang, Qiang Li, Guoxin Zhang, Pengfei Wan, Wen Zheng, Nannan Wang, Mingming Gong, Tongliang Liu

By considering the spatial correspondence, dense self-supervised representation learning has achieved superior performance on various dense prediction tasks.

Instance Segmentation Keypoint Detection +4

Kernel Mean Estimation by Marginalized Corrupted Distributions

no code implementations10 Jul 2021 Xiaobo Xia, Shuo Shan, Mingming Gong, Nannan Wang, Fei Gao, Haikun Wei, Tongliang Liu

Estimating the kernel mean in a reproducing kernel Hilbert space is a critical component in many kernel learning algorithms.

Improving White-box Robustness of Pre-processing Defenses via Joint Adversarial Training

no code implementations10 Jun 2021 Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Jun Yu, Xiaoyu Wang, Tongliang Liu

However, pre-processing methods may suffer from the robustness degradation effect, in which the defense reduces rather than improving the adversarial robustness of a target model in a white-box setting.

Adversarial Defense Adversarial Robustness

Towards Defending against Adversarial Examples via Attack-Invariant Features

no code implementations9 Jun 2021 Dawei Zhou, Tongliang Liu, Bo Han, Nannan Wang, Chunlei Peng, Xinbo Gao

However, given the continuously evolving attacks, models trained on seen types of adversarial examples generally cannot generalize well to unseen types of adversarial examples.

Adversarial Robustness

Removing Adversarial Noise in Class Activation Feature Space

no code implementations ICCV 2021 Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu

Then, we train a denoising model to minimize the distances between the adversarial examples and the natural examples in the class activation feature space.

Adversarial Robustness Denoising

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer

2 code implementations CVPR 2021 Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao

Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).

Style Transfer

Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification

no code implementations ICCV 2021 Ziyu Wei, Xi Yang, Nannan Wang, Xinbo Gao

Visible infrared person re-identification (VI-REID) aims to match pedestrian images between the daytime visible and nighttime infrared camera views.

Person Re-Identification

ADD-Defense: Towards Defending Widespread Adversarial Examples via Perturbation-Invariant Representation

no code implementations1 Jan 2021 Dawei Zhou, Tongliang Liu, Bo Han, Nannan Wang, Xinbo Gao

Motivated by this observation, we propose a defense framework ADD-Defense, which extracts the invariant information called \textit{perturbation-invariant representation} (PIR) to defend against widespread adversarial examples.

Extended T: Learning with Mixed Closed-set and Open-set Noisy Labels

no code implementations2 Dec 2020 Xiaobo Xia, Tongliang Liu, Bo Han, Nannan Wang, Jiankang Deng, Jiatong Li, Yinian Mao

The traditional transition matrix is limited to model closed-set label noise, where noisy training data has true class labels within the noisy label set.

Class2Simi: A New Perspective on Learning with Label Noise

no code implementations28 Sep 2020 Songhua Wu, Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Nannan Wang, Haifeng Liu, Gang Niu

It is worthwhile to perform the transformation: We prove that the noise rate for the noisy similarity labels is lower than that of the noisy class labels, because similarity labels themselves are robust to noise.

CoFF: Cooperative Spatial Feature Fusion for 3D Object Detection on Autonomous Vehicles

no code implementations24 Sep 2020 Jingda Guo, Dominic Carrillo, Sihai Tang, Qi Chen, Qing Yang, Song Fu, Xi Wang, Nannan Wang, Paparao Palacharla

To reduce the amount of transmitted data, feature map based fusion is recently proposed as a practical solution to cooperative 3D object detection by autonomous vehicles.

3D Object Detection Autonomous Vehicles +1

Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels

no code implementations14 Jun 2020 Songhua Wu, Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Nannan Wang, Haifeng Liu, Gang Niu

To give an affirmative answer, in this paper, we propose a framework called Class2Simi: it transforms data points with noisy class labels to data pairs with noisy similarity labels, where a similarity label denotes whether a pair shares the class label or not.

Contrastive Learning Learning with noisy labels +1

Part-dependent Label Noise: Towards Instance-dependent Label Noise

1 code implementation NeurIPS 2020 Xiaobo Xia, Tongliang Liu, Bo Han, Nannan Wang, Mingming Gong, Haifeng Liu, Gang Niu, DaCheng Tao, Masashi Sugiyama

Learning with the \textit{instance-dependent} label noise is challenging, because it is hard to model such real-world noise.

Multi-Margin based Decorrelation Learning for Heterogeneous Face Recognition

no code implementations25 May 2020 Bing Cao, Nannan Wang, Xinbo Gao, Jie Li, Zhifeng Li

Heterogeneous face recognition (HFR) refers to matching face images acquired from different domains with wide applications in security scenarios.

Face Recognition Heterogeneous Face Recognition +1

Facial Attribute Capsules for Noise Face Super Resolution

no code implementations16 Feb 2020 Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao, Zhifeng Li

In the SR processing, we first generated a group of FACs from the input LR face, and then reconstructed the HR face from this group of FACs.

Image Super-Resolution

Multi-Class Classification from Noisy-Similarity-Labeled Data

no code implementations16 Feb 2020 Songhua Wu, Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Nannan Wang, Haifeng Liu, Gang Niu

We further estimate the transition matrix from only noisy data and build a novel learning system to learn a classifier which can assign noise-free class labels for instances.

Classification General Classification +1

Video Face Super-Resolution with Motion-Adaptive Feedback Cell

no code implementations15 Feb 2020 Jingwei Xin, Nannan Wang, Jie Li, Xinbo Gao, Zhifeng Li

Current state-of-the-art CNN methods usually treat the VSR problem as a large number of separate multi-frame super-resolution tasks, at which a batch of low resolution (LR) frames is utilized to generate a single high resolution (HR) frame, and running a slide window to select LR frames over the entire video would obtain a series of HR frames.

Motion Compensation Motion Estimation +2

Are Anchor Points Really Indispensable in Label-Noise Learning?

1 code implementation NeurIPS 2019 Xiaobo Xia, Tongliang Liu, Nannan Wang, Bo Han, Chen Gong, Gang Niu, Masashi Sugiyama

Existing theories have shown that the transition matrix can be learned by exploiting \textit{anchor points} (i. e., data points that belong to a specific class almost surely).

Learning with noisy labels

Saliency deep embedding for aurora image search

no code implementations23 May 2018 Xi Yang, Xinbo Gao, Bin Song, Nannan Wang, Dong Yang

In this paper, we aim to explore a new search method for images captured with circular fisheye lens, especially the aurora images.

Image Retrieval Region Proposal

Random Sampling for Fast Face Sketch Synthesis

no code implementations8 Jan 2017 Nannan Wang, Xinbo Gao, Jie Li

The most time-consuming or main computation complexity for exemplar-based face sketch synthesis methods lies in the neighbor selection process.

Face Hallucination Face Sketch Synthesis

Sparse Graphical Representation based Discriminant Analysis for Heterogeneous Face Recognition

no code implementations1 Jul 2016 Chunlei Peng, Xinbo Gao, Nannan Wang, Jie Li

An adaptive sparse graphical representation scheme is designed to represent heterogeneous face images, where a Markov networks model is constructed to generate adaptive sparse vectors.

Face Recognition Heterogeneous Face Recognition

Training-Free Synthesized Face Sketch Recognition Using Image Quality Assessment Metrics

no code implementations25 Mar 2016 Nannan Wang, Jie Li, Leiyu Sun, Bin Song, Xinbo Gao

In this paper, we proposed a synthesized face sketch recognition framework based on full-reference image quality assessment metrics.

Face Recognition Face Sketch Synthesis +2

Graphical Representation for Heterogeneous Face Recognition

no code implementations2 Mar 2015 Chunlei Peng, Xinbo Gao, Nannan Wang, Jie Li

Heterogeneous face recognition (HFR) refers to matching face images acquired from different sources (i. e., different sensors or different wavelengths) for identification.

Face Recognition Heterogeneous Face Recognition

Facial Feature Point Detection: A Comprehensive Survey

no code implementations4 Oct 2014 Nannan Wang, Xinbo Gao, DaCheng Tao, Xuelong. Li

CLM-based methods consist of a shape model and a number of local experts, each of which is utilized to detect a facial feature point.

3D Face Modelling Face Alignment +3

Cannot find the paper you are looking for? You can Submit a new open access paper.