Search Results for author: Zhizhong Zhang

Found 25 papers, 12 papers with code

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

2 code implementations • 8 Apr 2024 • Xiaofan Li, Zhizhong Zhang, Xin Tan, Chengwei Chen, Yanyun Qu, Yuan Xie, Lizhuang Ma

The vision-language model has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering.

Anomaly Detection Language Modelling +1

131

Paper
Code

DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

no code implementations • 4 Mar 2024 • Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

To handle this problem, we propose the first Dynamic Environment MOtion Synthesis framework (DEMOS) to predict future motion instantly according to the current scene, and use it to dynamically update the latent motion for final motion synthesis.

motion prediction Motion Synthesis

Paper
Add Code

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification

no code implementations • 12 Jan 2024 • Jiangming Shi, Xiangbo Yin, Yeyun Chen, Yachao Zhang, Zhizhong Zhang, Yuan Xie, Yanyun Qu

To associate cross-modality clustered pseudo-labels, we design a Multi-Memory Learning and Matching (MMLM) module, ensuring that optimization explicitly focuses on the nuances of individual perspectives and establishes reliable cross-modality correspondences.

Clustering Person Re-Identification +1

Paper
Add Code

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation

no code implementations • 13 Dec 2023 • Yujun Chen, Xin Tan, Zhizhong Zhang, Yanyun Qu, Yuan Xie

Second, in the Image Branch, we propose the Instance Position-scale Learning (IPSL) Module to learn and fuse the information of instance position and scale, which is from a 2D pre-trained detector and a type of latent label obtained from 3D to 2D projection.

Panoptic Segmentation Position

Paper
Add Code

COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction

1 code implementation • 4 Dec 2023 • Qihang Ma, Xin Tan, Yanyun Qu, Lizhuang Ma, Zhizhong Zhang, Yuan Xie

The autonomous driving community has shown significant interest in 3D occupancy prediction, driven by its exceptional geometric perception and general object recognition capabilities.

Autonomous Driving Object Recognition

Paper
Code

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

1 code implementation • 22 Nov 2023 • Zhen Zhao, Jingqun Tang, Chunhui Lin, Binghong Wu, Can Huang, Hao liu, Xin Tan, Zhizhong Zhang, Yuan Xie

A straightforward solution is performing model fine-tuning tailored to a specific scenario, but it is computationally intensive and requires multiple model copies for various scenarios.

In-Context Learning Scene Text Recognition

Paper
Code

LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment

1 code implementation • ICCV 2023 • Zhiwei Zhang, Zhizhong Zhang, Qian Yu, Ran Yi, Yuan Xie, Lizhuang Ma

3D panoptic segmentation is a challenging perception task that requires both semantic segmentation and instance segmentation.

Instance Segmentation Panoptic Segmentation +1

Paper
Code

Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition

no code implementations • 3 May 2023 • Ding Li, Yongqiang Tang, Zhizhong Zhang, Wensheng Zhang

Besides, to further exploit the potential of positive pairs and increase the robustness of self-supervised representation learning, we propose a Positive Feature Transformation (PFT) strategy which adopts feature-level manipulation to increase the variance of positive pairs.

Action Recognition Contrastive Learning +2

Paper
Add Code

High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets

1 code implementation • 7 Feb 2023 • Yanbo Wang, Chuming Lin, Donghao Luo, Ying Tai, Zhizhong Zhang, Yuan Xie

A generic method for generating a high-quality image from the degraded one is in demand.

Clustering Colorization +2

Paper
Code

Instance and Category Supervision are Alternate Learners for Continual Learning

no code implementations • ICCV 2023 • Xudong Tian, Zhizhong Zhang, Xin Tan, Jun Liu, Chengjie Wang, Yanyun Qu, Guannan Jiang, Yuan Xie

Continual Learning (CL) is the constant development of complex behaviors by building upon previously acquired skills.

Continual Learning Self-Supervised Learning

Paper
Add Code

Multi-Centroid Task Descriptor for Dynamic Class Incremental Inference

no code implementations • CVPR 2023 • Tenghao Cai, Zhizhong Zhang, Xin Tan, Yanyun Qu, Guannan Jiang, Chengjie Wang, Yuan Xie

As a result, our dynamic inference network is trained independently of baseline and provides a flexible, efficient solution to distinguish between tasks.

Class Incremental Learning Incremental Learning

Paper
Add Code

Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification

1 code implementation • ICCV 2023 • Jiangming Shi, Yachao Zhang, Xiangbo Yin, Yuan Xie, Zhizhong Zhang, Jianping Fan, Zhongchao shi, Yanyun Qu

Visible-infrared person re-identification (VI-ReID) aims to match a specific person from a gallery of images captured from non-overlapping visible and infrared cameras.

Person Re-Identification Pseudo Label

Paper
Code

Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling

no code implementations • CVPR 2023 • Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a space decoupling (SD) algorithm to decouple the feature space into a pair of complementary subspaces, i. e., the stability space I, and the plasticity space R. I is established by conducting space intersection between the historic and current feature space, and thus I contains more task-shared bases.

Continual Learning

Paper
Add Code

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

no code implementations • 16 Sep 2022 • Tianfang Sun, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel cross-modality weakly supervised method for 3D segmentation, incorporating complementary information from unlabeled images.

3D Semantic Segmentation Pseudo Label +2

Paper
Add Code

Attentive pooling for Group Activity Recognition

no code implementations • 31 Aug 2022 • Ding Li, Yuan Xie, Wensheng Zhang, Yongqiang Tang, Zhizhong Zhang

However, the existing methods simply employed max/average pooling in this framework, which ignored the distinct contributions of different individuals to the group activity recognition.

Group Activity Recognition

Paper
Add Code

Boosting Night-time Scene Parsing with Learnable Frequency

1 code implementation • 30 Aug 2022 • Zhifeng Xie, Sen Wang, Ke Xu, Zhizhong Zhang, Xin Tan, Yuan Xie, Lizhuang Ma

Based on this, we propose to exploit the image frequency distributions for night-time scene parsing.

Autonomous Driving Scene Parsing

Paper
Code

Variational Distillation for Multi-View Learning

3 code implementations • 20 Jun 2022 • Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, DaCheng Tao

Information Bottleneck (IB) based multi-view learning provides an information theoretic principle for seeking shared information contained in heterogeneous data descriptions.

MULTI-VIEW LEARNING Representation Learning

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

116

Paper
Code

Multi-scale 2D Representation Learning for weakly-supervised moment retrieval

no code implementations • 4 Nov 2021 • Ding Li, Rui Wu, Yongqiang Tang, Zhizhong Zhang, Wensheng Zhang

Specifically, we first construct a two-dimensional map for each temporal scale to capture the temporal dependencies between candidates.

Moment Retrieval Representation Learning +1

Paper
Add Code

Towards Compact Single Image Super-Resolution via Contrastive Self-distillation

8 code implementations • 25 May 2021 • Yanbo Wang, Shaohui Lin, Yanyun Qu, Haiyan Wu, Zhizhong Zhang, Yuan Xie, Angela Yao

Convolutional neural networks (CNNs) are highly successful for super-resolution (SR) but often require sophisticated architectures with heavy memory cost and computational overhead, significantly restricts their practical deployments on resource-limited devices.

Image Super-Resolution SSIM +1

334

Paper
Code

Contrastive Learning for Compact Single Image Dehazing

7 code implementations • CVPR 2021 • Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel contrastive regularization (CR) built upon contrastive learning to exploit both the information of hazy images and clear images as negative and positive samples, respectively.

Ranked #5 on Image Dehazing on RS-Haze

Contrastive Learning Image Dehazing +1

327

Paper
Code

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification

3 code implementations • CVPR 2021 • Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma

The Information Bottleneck (IB) provides an information theoretic principle for representation learning, by retaining all information relevant for predicting label while minimizing the redundancy.

Cross-Modality Person Re-identification Cross-Modal Person Re-Identification +3

334

Paper
Code

Learn from Concepts: Towards the Purified Memory for Few-shot Learning

no code implementations • Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021 • Xuncheng Liu, Xudong Tian, Shaohui Lin, Yanyun Qu, Lizhuang Ma, Wang Yuan, Zhizhong Zhang, Yuan Xie

In this paper, we present a novel purified memory mechanism that simulates the recognition process of human beings.

Few-Shot Learning

Paper
Add Code

Field-free spin-orbit torque-induced switching of perpendicular magnetization in a ferrimagnetic layer with vertical composition gradient

no code implementations • 21 Jan 2021 • Zhenyi Zheng, Yue Zhang, Victor Lopez-Dominguez, Luis Sánchez-Tejerina, Jiacheng Shi, Xueqiang Feng, Lei Chen, Zilu Wang, Zhizhong Zhang, Kun Zhang, Bin Hong, Yong Xu, Youguang Zhang, Mario Carpentieri, Albert Fert, Giovanni Finocchio, Weisheng Zhao, Pedram Khalili Amiri

Existing methods to do so involve the application of an in-plane bias magnetic field, or incorporation of in-plane structural asymmetry in the device, both of which can be difficult to implement in practical applications.

Mesoscale and Nanoscale Physics

Paper
Add Code

Effective Image Retrieval via Multilinear Multi-index Fusion

no code implementations • 27 Sep 2017 • Zhizhong Zhang, Yuan Xie, Wensheng Zhang, Qi Tian

In this paper, we propose a new multi-index fusion scheme for image retrieval.

Image Retrieval Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.