Search Results for author: Xiaohua Xie

Found 33 papers, 17 papers with code

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

2 code implementations21 Mar 2024 Quan Zhang, Lei Wang, Vishal M. Patel, Xiaohua Xie, JianHuang Lai

Experiments on two datasets show that VDT is a feasible and effective solution for AGPReID, surpassing the previous method on mAP/Rank1 by up to 5. 0%/2. 7% on CARGO and 3. 7%/5. 2% on AG-ReID, keeping the same magnitude of computational complexity.

Person Re-Identification

Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

1 code implementation13 Mar 2024 Pengze Zhang, Hubery Yin, Chen Li, Xiaohua Xie

Most diffusion models assume that the reverse process adheres to a Gaussian distribution.

Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis

1 code implementation28 Feb 2024 Yanzuo Lu, Manlin Zhang, Andy J Ma, Xiaohua Xie, Jian-Huang Lai

While existing methods simply align the person appearance to the target pose, they are prone to overfitting due to the lack of a high-level semantic understanding on the source person image.

Pose-Guided Image Generation

Exploring Adversarial Attacks against Latent Diffusion Model from the Perspective of Adversarial Transferability

no code implementations13 Jan 2024 Junxi Chen, Junhao Dong, Xiaohua Xie

Recently, many studies utilized adversarial examples (AEs) to raise the cost of malicious image editing and copyright violation powered by latent diffusion models (LDMs).

Adversarial Attack Image Classification

Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion

no code implementations29 Dec 2023 Yun Chen, Lingxiao Yang, Qi Chen, Jian-Huang Lai, Xiaohua Xie

We introduce a two-stage pipeline to effectively train our network: Stage I utilizes inter-speech contrastive learning to model fine-grained emotion and intra-speech disentanglement learning to better separate emotion and content.

Contrastive Learning Disentanglement +1

MLNet: Mutual Learning Network with Neighborhood Invariance for Universal Domain Adaptation

1 code implementation13 Dec 2023 Yanzuo Lu, Meng Shen, Andy J Ma, Xiaohua Xie, Jian-Huang Lai

Universal domain adaptation (UniDA) is a practical but challenging problem, in which information about the relation between the source and the target domains is not given for knowledge transfer.

Transfer Learning Universal Domain Adaptation

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation

1 code implementation ICCV 2023 Shenghao Fu, Junkai Yan, Yipeng Gao, Xiaohua Xie, Wei-Shi Zheng

We find that the architecture discrepancy between dense and sparse detectors leads to feature conflict, hampering the performance of one-decoder-layer detectors.

APRF: Anti-Aliasing Projection Representation Field for Inverse Problem in Imaging

no code implementations11 Jul 2023 Zixuan Chen, Lingxiao Yang, JianHuang Lai, Xiaohua Xie

However, these methods have not considered the correlation between adjacent projection views, resulting in aliasing artifacts on SV sinograms.

Releasing Inequality Phenomena in $L_{\infty}$-Adversarial Training via Input Gradient Distillation

no code implementations16 May 2023 Junxi Chen, Junhao Dong, Xiaohua Xie

However, a recent work showed the inequality phenomena in $l_{\infty}$-adversarial training and revealed that the $l_{\infty}$-adversarially trained model is vulnerable when a few important pixels are perturbed by i. i. d.

Adversarial Defense Adversarial Robustness

Hard Nominal Example-aware Template Mutual Matching for Industrial Anomaly Detection

no code implementations28 Mar 2023 Zixuan Chen, Xiaohua Xie, Lingxiao Yang, JianHuang Lai

Additionally, to meet the speed-accuracy demands, we further propose \textbf{P}ixel-level \textbf{T}emplate \textbf{S}election (PTS) to streamline the original template set.

Anomaly Detection Incremental Learning

CuNeRF: Cube-Based Neural Radiance Field for Zero-Shot Medical Image Arbitrary-Scale Super Resolution

1 code implementation ICCV 2023 Zixuan Chen, Jian-Huang Lai, Lingxiao Yang, Xiaohua Xie

Medical image arbitrary-scale super-resolution (MIASSR) has recently gained widespread attention, aiming to super sample medical volumes at arbitrary scales via a single model.

Computed Tomography (CT) Super-Resolution

Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications

no code implementations24 Mar 2023 Junhao Dong, Junxi Chen, Xiaohua Xie, JianHuang Lai, Hao Chen

In this exposition, we present a comprehensive survey on recent advances in adversarial attack and defense for medical image analysis with a novel taxonomy in terms of the application scenario.

Adversarial Attack Medical Diagnosis

The Enemy of My Enemy is My Friend: Exploring Inverse Adversaries for Improving Adversarial Training

no code implementations CVPR 2023 Junhao Dong, Seyed-Mohsen Moosavi-Dezfooli, JianHuang Lai, Xiaohua Xie

To circumvent this issue, we propose a novel adversarial training scheme that encourages the model to produce similar outputs for an adversarial example and its ``inverse adversarial'' counterpart.

SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks

2 code implementations Proceedings of Machine Learning Research 2022 Lingxiao Yang, Ru-Yuan Zhang, Lida Li, Xiaohua Xie

Another advantage of the module is that most of the operators are selected based on the solution to the defined energy function, avoiding too many efforts for structure tuning.

Texture-guided Saliency Distilling for Unsupervised Salient Object Detection

1 code implementation CVPR 2023 Huajun Zhou, Bo Qiao, Lingxiao Yang, JianHuang Lai, Xiaohua Xie

In this paper, we propose a novel USOD method to mine rich and accurate saliency knowledge from both easy and hard samples.

Object object-detection +4

SNN2ANN: A Fast and Memory-Efficient Training Framework for Spiking Neural Networks

1 code implementation19 Jun 2022 Jianxiong Tang, JianHuang Lai, Xiaohua Xie, Lingxiao Yang, Wei-Shi Zheng

The SNN2ANN consists of 2 components: a) a weight sharing architecture between ANN and SNN and b) spiking mapping units.

3D-VFD: A Victim-free Detector against 3D Adversarial Point Clouds

no code implementations18 May 2022 Jiahao Zhu, Huajun Zhou, Zixuan Chen, Yi Zhou, Xiaohua Xie

3D deep models consuming point clouds have achieved sound application effects in computer vision.

Adversarial Attack Steganalysis

Cross-Camera Trajectories Help Person Retrieval in a Camera Network

1 code implementation27 Apr 2022 Xin Zhang, Xiaohua Xie, JianHuang Lai, Wei-Shi Zheng

To address this issue, we propose a pedestrian retrieval framework based on cross-camera trajectory generation, which integrates both temporal and spatial information.

Person Retrieval Re-Ranking +1

Restricted Black-box Adversarial Attack Against DeepFake Face Swapping

no code implementations26 Apr 2022 Junhao Dong, YuAn Wang, JianHuang Lai, Xiaohua Xie

DeepFake face swapping presents a significant threat to online security and social media, which can replace the source face in an arbitrary photo/video with the target face of an entirely different person.

Adversarial Attack Face Reconstruction +2

Exploring Dual-task Correlation for Pose Guided Person Image Generation

1 code implementation CVPR 2022 Pengze Zhang, Lingxiao Yang, JianHuang Lai, Xiaohua Xie

Pose Guided Person Image Generation (PGPIG) is the task of transforming a person image from the source pose to a given target pose.

Image Generation

Benchmarking Deep Models for Salient Object Detection

1 code implementation7 Feb 2022 Huajun Zhou, Yang Lin, Lingxiao Yang, JianHuang Lai, Xiaohua Xie

In recent years, deep network-based methods have continuously refreshed state-of-the-art performance on Salient Object Detection (SOD) task.

Benchmarking Object +3

Modeling 3D Layout for Group Re-Identification

1 code implementation CVPR 2022 Quan Zhang, Kaiheng Dang, Jian-Huang Lai, Zhanxiang Feng, Xiaohua Xie

To the best of our knowledge, 3DT is the first work to address GReID with 3D perspective, and the City1M is the currently largest dataset.

Improving Adversarially Robust Few-Shot Image Classification With Generalizable Representations

no code implementations CVPR 2022 Junhao Dong, YuAn Wang, Jian-Huang Lai, Xiaohua Xie

Extensive experiments show that our method can significantly outperform state-of-the-art adversarially robust FSIC methods on two standard benchmarks.

Classification Few-Shot Image Classification +1

Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images

no code implementations18 Sep 2021 Yuedong Chen, Junjia Huang, JianFeng Wang, Xiaohua Xie

Motion deblurring has witnessed rapid development in recent years, and most of the recent methods address it by using deep learning techniques, with the help of different kinds of prior knowledge.

Deblurring

Batch Face Alignment using a Low-rank GAN

no code implementations21 Oct 2019 Jiabo Huang, Xiaohua Xie, Wei-Shi Zheng

This paper studies the problem of aligning a set of face images of the same individual into a normalized image while removing the outliers like partial occlusion, extreme facial expression as well as significant illumination variation.

Face Alignment Generative Adversarial Network

Discovering Underlying Person Structure Pattern with Relative Local Distance for Person Re-identification

1 code implementation29 Jan 2019 Guangcong Wang, Jian-Huang Lai, Zhenyu Xie, Xiaohua Xie

With the discovered underlying person structure, the RLD method builds a bridge between the global and local feature representation and thus improves the capacity of feature representation for person re-ID.

Person Re-Identification Representation Learning

Spatial-Temporal Person Re-identification

3 code implementations8 Dec 2018 Guangcong Wang, Jian-Huang Lai, Peigen Huang, Xiaohua Xie

In this paper, we propose a novel two-stream spatial-temporal person ReID (st-ReID) framework that mines both visual semantic information and spatial-temporal information.

Person Re-Identification

Learning View-Specific Deep Networks for Person Re-Identification

no code implementations30 Mar 2018 Zhanxiang Feng, Jian-Huang Lai, Xiaohua Xie

In recent years, a growing body of research has focused on the problem of person re-identification (re-id).

Person Re-Identification

Deep Growing Learning

no code implementations ICCV 2017 Guangcong Wang, Xiaohua Xie, Jian-Huang Lai, Jiaxuan Zhuo

A bottleneck of SSL is the overfitting problem when training over the limited labeled data, especially on a complex model like a deep neural network.

Motion-Appearance Interactive Encoding for Object Segmentation in Unconstrained Videos

no code implementations25 Jul 2017 Chunchao Guo, Jian-Huang Lai, Xiaohua Xie

We present a novel method of integrating motion and appearance cues for foreground object segmentation in unconstrained videos.

Graph Matching Object +3

Cannot find the paper you are looking for? You can Submit a new open access paper.