Search Results for author: Chunluan Zhou

Found 12 papers, 4 papers with code

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

1 code implementation • 31 Jan 2024 • Xingning Dong, Zipeng Feng, Chunluan Zhou, Xuzheng Yu, Ming Yang, Qingpei Guo

We then summarize this empirical study into the M2-RAAP recipe, where our technical contributions lie in 1) the data filtering and text re-writing pipeline resulting in 1M high-quality bilingual video-text pairs, 2) the replacement of video inputs with key-frames to accelerate pre-training, and 3) the Auxiliary-Caption-Guided (ACG) strategy to enhance video features.

Retrieval Text Retrieval +1

Paper
Code

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints

1 code implementation • 21 Aug 2023 • Yutao Chen, Xingning Dong, Tian Gan, Chunluan Zhou, Ming Yang, Qingpei Guo

Compared with images, we conjecture that videos necessitate more constraints to preserve the temporal consistency during editing.

Video Editing

Paper
Code

Generalized Relation Modeling for Transformer Tracking

1 code implementation • CVPR 2023 • Shenyuan Gao, Chunluan Zhou, Jun Zhang

Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which allows earlier interaction between the template and search region, has achieved a remarkable performance gain.

Relation

Paper
Code

SOAR: Scene-debiasing Open-set Action Recognition

no code implementations • ICCV 2023 • Yuanhao Zhai, Ziyi Liu, Zhenyu Wu, Yi Wu, Chunluan Zhou, David Doermann, Junsong Yuan, Gang Hua

Deep models have the risk of utilizing spurious clues to make predictions, e. g., recognizing actions via classifying the background scene.

Open Set Action Recognition Scene Classification

Paper
Add Code

AiATrack: Attention in Attention for Transformer Visual Tracking

1 code implementation • 20 Jul 2022 • Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, Junsong Yuan

However, the independent correlation computation in the attention mechanism could result in noisy and ambiguous attention weights, which inhibits further performance improvement.

Ranked #2 on Visual Object Tracking on NeedForSpeed

Visual Object Tracking Visual Tracking

102

Paper
Code

Distilling Inter-Class Distance for Semantic Segmentation

no code implementations • 7 May 2022 • Zhengbo Zhang, Chunluan Zhou, Zhigang Tu

Knowledge distillation is widely adopted in semantic segmentation to reduce the computation cost. The previous knowledge distillation methods for semantic segmentation focus on pixel-wise feature alignment and intra-class feature variation distillation, neglecting to transfer the knowledge of the inter-class distance in the feature space, which is important for semantic segmentation.

Knowledge Distillation Position +2

Paper
Add Code

Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking

no code implementations • CVPR 2021 • Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua

In this paper, we propose a novel online approach to learning the pose dynamics, which are independent of pose detections in current fame, and hence may serve as a robust estimation even in challenging scenarios including occlusion.

Multi-Person Pose Estimation Multi-Person Pose Estimation and Tracking +1

Paper
Add Code

Discriminative Feature Transformation for Occluded Pedestrian Detection

no code implementations • ICCV 2019 • Chunluan Zhou, Ming Yang, Junsong Yuan

Such a feature transformation partially compen- sates the missing contribution of occluded parts in feature space, therefore improving the performance for occluded pedestrian detection.

Pedestrian Detection

Paper
Add Code

Bi-box Regression for Pedestrian Detection and Occlusion Estimation

no code implementations • ECCV 2018 • Chunluan Zhou, Junsong Yuan

The full body estimation branch is trained to regress full body regions for positive pedestrian proposals, while the visible part estimation branch is trained to regress visible part regions for both positive and negative pedestrian proposals.

Occlusion Estimation Pedestrian Detection +1

Paper
Add Code

Actor-Action Semantic Segmentation with Region Masks

no code implementations • 23 Jul 2018 • Kang Dang, Chunluan Zhou, Zhigang Tu, Michael Hoy, Justin Dauwels, Junsong Yuan

One major challenge for this task is that when an actor performs an action, different body parts of the actor provide different types of cues for the action category and may receive inconsistent action labeling when they are labeled independently.

Action Segmentation Instance Segmentation +2

Paper
Add Code

Attention to Head Locations for Crowd Counting

no code implementations • 27 Jun 2018 • Youmei Zhang, Chunluan Zhou, Faliang Chang, Alex C. Kot

Occlusions, complex backgrounds, scale variations and non-uniform distributions present great challenges for crowd counting in practical applications.

Crowd Counting Density Estimation

Paper
Add Code

Multi-Label Learning of Part Detectors for Heavily Occluded Pedestrian Detection

no code implementations • ICCV 2017 • Chunluan Zhou, Junsong Yuan

Detecting pedestrians that are partially occluded remains a challenging problem due to variations and uncertainties of partial occlusion patterns.

Multi-Label Learning Pedestrian Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.