no code implementations • 16 Mar 2020 • Yijun Song, Jingwen Wang, Lin Ma, Zhou Yu, Jun Yu
The task of temporally grounding textual queries in videos is to localize one video segment that semantically corresponds to the given query.
Sentence