Search Results for author: Shizhou Zhang

Found 15 papers, 5 papers with code

DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion

no code implementations • 7 Feb 2024 • Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang

Human de-occlusion, which aims to infer the appearance of invisible human parts from an occluded image, has great value in many human-related tasks, such as person re-id, and intention inference.

valid

Paper
Add Code

CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model

no code implementations • 10 Jan 2024 • Yinghui Xing, Litao Qu, Shizhou Zhang, Kai Zhang, Yanning Zhang

Fusion of a panchromatic (PAN) image and corresponding multispectral (MS) image is also known as pansharpening, which aims to combine abundant spatial details of PAN and spectral information of MS. Due to the absence of high-resolution MS images, available deep-learning-based methods usually follow the paradigm of training at reduced resolution and testing at both reduced and full resolution.

Pansharpening

Paper
Add Code

Ground-to-Aerial Person Search: Benchmark Dataset and Approach

1 code implementation • 24 Aug 2023 • Shizhou Zhang, Qingchun Yang, De Cheng, Yinghui Xing, Guoqiang Liang, Peng Wang, Yanning Zhang

In this work, we construct a large-scale dataset for Ground-to-Aerial Person Search, named G2APS, which contains 31, 770 images of 260, 559 annotated bounding boxes for 2, 644 identities appearing in both of the UAVs and ground surveillance cameras.

Knowledge Distillation Person Search

Paper
Code

Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection

no code implementations • 20 Jul 2023 • Yinghui Xing, Dexuan Kong, Shizhou Zhang, Geng Chen, Lingyan Ran, Peng Wang, Yanning Zhang

Camouflaged object detection (COD), aiming to segment camouflaged objects which exhibit similar patterns with the background, is a challenging task.

Multi-Task Learning object-detection +1

Paper
Add Code

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID

no code implementations • 22 May 2023 • De Cheng, Lingfeng He, Nannan Wang, Shizhou Zhang, Zhen Wang, Xinbo Gao

To this end, we propose a novel bilateral cluster matching-based learning framework to reduce the modality gap by matching cross-modality clusters.

Contrastive Learning Person Re-Identification

Paper
Add Code

MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization

1 code implementation • 1 Feb 2023 • Yinghui Xing, Song Wang, Shizhou Zhang, Guoqiang Liang, Xiuwei Zhang, Yanning Zhang

Most of the available multispectral pedestrian detectors are based on non-end-to-end detectors, while in this paper, we propose MultiSpectral pedestrian DEtection TRansformer (MS-DETR), an end-to-end multispectral pedestrian detector, which extends DETR into the field of multi-modal detection.

Pedestrian Detection

Paper
Code

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance

no code implementations • 16 Dec 2022 • Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

To enhance the discriminative power of features, we propose a batch clustering based loss to encourage a clustering branch to generate distinct normal and abnormal clusters based on a batch of data.

Anomaly Detection Clustering +1

Paper
Add Code

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion

no code implementations • 5 Dec 2022 • Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Ruiqi Wu, Shizhou Zhang, Peng Wang, Yanning Zhang

The key insight of this design is that the cross-attention mechanism in the transformer could be an ideal solution to align the discriminative texture clues from the original image with the canonical view image, which could compensate for the low-quality texture information of the canonical view image.

Domain Generalization Generalizable Person Re-identification +1

Paper
Add Code

Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model

1 code implementation • 17 Aug 2022 • Yinghui Xing, Qirui Wu, De Cheng, Shizhou Zhang, Guoqiang Liang, Peng Wang, Yanning Zhang

To make the final image feature concentrate more on the target visual concept, a Class-Aware Visual Prompt Tuning (CAVPT) scheme is further proposed in our DPT, where the class-aware visual prompt is generated dynamically by performing the cross attention between text prompts features and image patch token embeddings to encode both the downstream task-related information and visual instance information.

General Knowledge Language Modelling +1

Paper
Code

Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos

no code implementations • 14 Feb 2022 • Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

For weakly supervised anomaly detection, most existing work is limited to the problem of inadequate video representation due to the inability of modeling long-term contextual information.

Graph Learning Supervised Anomaly Detection +1

Paper
Add Code

Text-based Person Search in Full Images via Semantic-Driven Proposal Generation

1 code implementation • 27 Sep 2021 • Shizhou Zhang, De Cheng, Wenlong Luo, Yinghui Xing, Duo Long, Hao Li, Kai Niu, Guoqiang Liang, Yanning Zhang

Finding target persons in full scene images with a query of text description has important practical applications in intelligent video surveillance. However, different from the real-world scenarios where the bounding boxes are not available, existing text-based person retrieval methods mainly focus on the cross modal matching between the query text descriptions and the gallery of cropped pedestrian images.

Person Search Retrieval +3

Paper
Code

Unsupervised Video Summarization with a Convolutional Attentive Adversarial Network

no code implementations • 24 May 2021 • Guoqiang Liang, Yanbing Lv, Shucheng Li, Shizhou Zhang, Yanning Zhang

Specifically, the generator employs a fully convolutional sequence network to extract global representation of a video, and an attention-based network to output normalized importance scores.

Generative Adversarial Network Unsupervised Video Summarization

Paper
Add Code

Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation

no code implementations • 25 Oct 2019 • Shizhou Zhang, Yifei Yang, Peng Wang, Guoqiang Liang, Xiuwei Zhang, Yanning Zhang

The problem of cross-modality person re-identification has been receiving increasing attention recently, due to its practical significance.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Person Re-identification in Aerial Imagery

1 code implementation • 14 Aug 2019 • Shizhou Zhang, Qi Zhang, Yifei Yang, Xing Wei, Peng Wang, Bingliang Jiao, Yanning Zhang

Our method can learn a discriminative and compact feature representation for ReID in aerial imagery and can be trained in an end-to-end fashion efficiently.

object-detection Object Detection +1

Paper
Code

Vehicle Re-identification in Aerial Imagery: Dataset and Approach

no code implementations • ICCV 2019 • Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang

It is capable of explicitly detecting discriminative parts for each specific vehicle and significantly outperforms the evaluated baselines and state-of-the-art vehicle ReID approaches.

Vehicle Re-Identification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.