Search Results for author: Guozhen Zhang

Found 6 papers, 2 papers with code

Dual DETRs for Multi-Label Temporal Action Detection

no code implementations31 Mar 2024 Yuhan Zhu, Guozhen Zhang, Jing Tan, Gangshan Wu, LiMin Wang

To address this issue, we propose a new Dual-level query-based TAD framework, namely DualDETR, to detect actions from both instance-level and boundary-level.

Action Detection object-detection +1

StableDrag: Stable Dragging for Point-based Image Editing

no code implementations7 Mar 2024 Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, LiMin Wang

Point-based image editing has attracted remarkable attention since the emergence of DragGAN.

Point Tracking

MGMAE: Motion Guided Masking for Video Masked Autoencoding

1 code implementation ICCV 2023 Bingkun Huang, Zhiyu Zhao, Guozhen Zhang, Yu Qiao, LiMin Wang

Based on this masking volume, we can track the unmasked tokens in time and sample a set of temporal consistent cubes from videos.

Optical Flow Estimation Representation Learning

DPL: Decoupled Prompt Learning for Vision-Language Models

no code implementations19 Aug 2023 Chen Xu, Yuhan Zhu, Guozhen Zhang, Haocheng Shen, Yixuan Liao, Xiaoxin Chen, Gangshan Wu, LiMin Wang

Prompt learning has emerged as an efficient and effective approach for transferring foundational Vision-Language Models (e. g., CLIP) to downstream tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.