no code implementations • 18 Apr 2024 • Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun
Text-video retrieval aims to find the most relevant cross-modal samples for a given query.
no code implementations • 13 May 2023 • Han Fang, Zhifei Yang, Xianghao Zang, Chao Ban, Hao Sun
Specifically, after applying attention-based video masking to generate high-informed and low-informed masks, we propose Informed Semantics Completion to recover masked semantics information.
no code implementations • 18 Jun 2021 • Baoming Yan, Lin Wang, Ke Gao, Bo Gao, Xiao Liu, Chao Ban, Jiang Yang, Xiaobo Li
Video affective understanding, which aims to predict the evoked expressions by the video content, is desired for video creation and recommendation.