no code implementations • 7 Jan 2022 • Li Haopeng, Ke Qiuhong, Gong Mingming, Tom Drummond
Considering that the annotation of large-scale datasets is time-consuming, we propose a multimodal self-supervised learning framework to obtain semantic representations of videos, which benefits the video summarization task.
1 code implementation • 27 Dec 2021 • Li Haopeng, Ke Qiuhong, Gong Mingming, Zhang Rui
Video summarization aims to automatically generate a summary (storyboard or video skim) of a video, which can facilitate large-scale video retrieval and browsing.
1 code implementation • 19 Dec 2021 • Yoo Hongsang, Li Haopeng, Ke Qiuhong, Liu Liangchen, Zhang Rui
In this paper, we propose to model the causal relationships based on the precondition and effect to improve the performance of action recognition.