Search Results for author: Guoqiang Liang

Found 17 papers, 8 papers with code

Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model

1 code implementation • 17 Aug 2022 • Yinghui Xing, Qirui Wu, De Cheng, Shizhou Zhang, Guoqiang Liang, Peng Wang, Yanning Zhang

To make the final image feature concentrate more on the target visual concept, a Class-Aware Visual Prompt Tuning (CAVPT) scheme is further proposed in our DPT, where the class-aware visual prompt is generated dynamically by performing the cross attention between text prompts features and image patch token embeddings to encode both the downstream task-related information and visual instance information.

General Knowledge Language Modelling +1

Paper
Code

Learning Conditional Attributes for Compositional Zero-Shot Learning

1 code implementation • CVPR 2023 • Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen

Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.

Ranked #1 on Compositional Zero-Shot Learning on MIT-States

Attribute Compositional Zero-Shot Learning

Paper
Code

Ground-to-Aerial Person Search: Benchmark Dataset and Approach

1 code implementation • 24 Aug 2023 • Shizhou Zhang, Qingchun Yang, De Cheng, Yinghui Xing, Guoqiang Liang, Peng Wang, Yanning Zhang

In this work, we construct a large-scale dataset for Ground-to-Aerial Person Search, named G2APS, which contains 31, 770 images of 260, 559 annotated bounding boxes for 2, 644 identities appearing in both of the UAVs and ground surveillance cameras.

Knowledge Distillation Person Search

Paper
Code

New Insights on Relieving Task-Recency Bias for Online Class Incremental Learning

1 code implementation • 16 Feb 2023 • Guoqiang Liang, Zhaojie Chen, Zhaoqiang Chen, Shiyu Ji, Yanning Zhang

In all settings, the online class incremental learning (OCIL), where incoming samples from data stream can be used only once, is more challenging and can be encountered more frequently in real world.

Class Incremental Learning Incremental Learning +1

Paper
Code

MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization

1 code implementation • 1 Feb 2023 • Yinghui Xing, Song Wang, Shizhou Zhang, Guoqiang Liang, Xiuwei Zhang, Yanning Zhang

Most of the available multispectral pedestrian detectors are based on non-end-to-end detectors, while in this paper, we propose MultiSpectral pedestrian DEtection TRansformer (MS-DETR), an end-to-end multispectral pedestrian detector, which extends DETR into the field of multi-modal detection.

Pedestrian Detection

Paper
Code

Learning INR for Event-guided Rolling Shutter Frame Correction, Deblur, and Interpolation

1 code implementation • 24 May 2023 • Yunfan Lu, Guoqiang Liang, Lin Wang

Images captured by rolling shutter (RS) cameras under fast camera motion often contain obvious image distortions and blur, which can be modeled as a row-wise combination of a sequence of global shutter (GS) frames within the exposure time naturally, recovering high-frame-rate GS sharp frames from an RS blur image needs to simultaneously consider RS correction, deblur, and frame interpolation Taking this task is nontrivial, and to our knowledge, no feasible solutions exist by far.

Image Restoration

Paper
Code

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

1 code implementation • 15 Mar 2024 • Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.

Class Incremental Learning Incremental Learning +1

Paper
Code

Text-based Person Search in Full Images via Semantic-Driven Proposal Generation

1 code implementation • 27 Sep 2021 • Shizhou Zhang, De Cheng, Wenlong Luo, Yinghui Xing, Duo Long, Hao Li, Kai Niu, Guoqiang Liang, Yanning Zhang

Finding target persons in full scene images with a query of text description has important practical applications in intelligent video surveillance. However, different from the real-world scenarios where the bounding boxes are not available, existing text-based person retrieval methods mainly focus on the cross modal matching between the query text descriptions and the gallery of cropped pedestrian images.

Person Search Retrieval +3

Paper
Code

Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation

no code implementations • 25 Oct 2019 • Shizhou Zhang, Yifei Yang, Peng Wang, Guoqiang Liang, Xiuwei Zhang, Yanning Zhang

The problem of cross-modality person re-identification has been receiving increasing attention recently, due to its practical significance.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Same data may bring conflict results: a caution to use the disruptive index

no code implementations • 15 Sep 2020 • Guoqiang Liang, Yi Jiang, Haiyan Hou

In the last two decades, scholars have designed various types of bibliographic related indicators to identify breakthrough-class academic achievements.

Paper
Add Code

An Adversarial Human Pose Estimation Network Injected with Graph Structure

no code implementations • 29 Mar 2021 • Lei Tian, Guoqiang Liang, Peng Wang, Chunhua Shen

Because of the invisible human keypoints in images caused by illumination, occlusion and overlap, it is likely to produce unreasonable human pose prediction for most of the current human pose estimation methods.

Generative Adversarial Network Pose Estimation +1

Paper
Add Code

Unsupervised Video Summarization with a Convolutional Attentive Adversarial Network

no code implementations • 24 May 2021 • Guoqiang Liang, Yanbing Lv, Shucheng Li, Shizhou Zhang, Yanning Zhang

Specifically, the generator employs a fully convolutional sequence network to extract global representation of a video, and an attention-based network to output normalized importance scores.

Generative Adversarial Network Unsupervised Video Summarization

Paper
Add Code

Non-exemplar Class-incremental Learning by Random Auxiliary Classes Augmentation and Mixed Features

no code implementations • 16 Apr 2023 • Ke Song, Quan Xia, Guoqiang Liang, Zhaojie Chen, Yanning Zhang

Instead, by mixing new and old features, old knowledge can be retained without increasing the computational complexity.

Class Incremental Learning Incremental Learning

Paper
Add Code

Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames

no code implementations • 27 Jun 2023 • Yunfan Lu, Guoqiang Liang, Lin Wang

Although events possess high temporal resolution, beneficial for video frame interpolation (VFI), a hurdle in tackling this task is the lack of paired GS frames.

Self-Supervised Learning Video Frame Interpolation

Paper
Add Code

DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion

no code implementations • 7 Feb 2024 • Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang

Human de-occlusion, which aims to infer the appearance of invisible human parts from an occluded image, has great value in many human-related tasks, such as person re-id, and intention inference.

valid

Paper
Add Code

Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey

no code implementations • 4 Mar 2024 • Lingyan Ran, YaLi Li, Guoqiang Liang, Yanning Zhang

Semantic segmentation is an important and popular research area in computer vision that focuses on classifying pixels in an image based on their semantics.

Image Segmentation Pseudo Label +2

Paper
Add Code

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

no code implementations • 1 Apr 2024 • Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, Lin Wang

To this end, we propose a real-world (indoor and outdoor) dataset comprising over 30K pairs of images and events under both low and normal illumination conditions.

feature selection Low-Light Image Enhancement

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.