no code implementations • 4 Dec 2023 • Dixuan Lin, Yixing Peng, Jingke Meng, Wei-Shi Zheng
In this work, we show the discrepancy between image-to-text association and text-to-image association and propose CADA: Cross-Modal Adaptive Dual Association that finely builds bidirectional image-text detailed associations.
Ranked #1 on Text based Person Retrieval on RSTPReid (mAP metric)
no code implementations • 30 Aug 2023 • Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-Shi Zheng
Our method, termed DiffuVolume, considers the diffusion model as a cost volume filter, which will recurrently remove the redundant information from the cost volume.
no code implementations • ICCV 2023 • An-Lan Wang, Kun-Yu Lin, Jia-Run Du, Jingke Meng, Wei-Shi Zheng
In this work, we focus on the task of procedure planning from instructional videos with text supervision, where a model aims to predict an action sequence to transform the initial visual state into the goal visual state.
no code implementations • CVPR 2019 • Jingke Meng, Sheng Wu, Wei-Shi Zheng
In the conventional person re-id setting, it is assumed that the labeled images are the person images within the bounding box for each individual; this labeling across multiple nonoverlapping camera views from raw video surveillance is costly and time-consuming.