2 code implementations • 29 Oct 2021 • Ning Han, Jingjing Chen, Chuhao Shi, Yawen Zeng, Guangyi Xiao, Hao Chen
The task of text-video retrieval aims to understand the correspondence between language and vision, has gained increasing attention in recent years.