Language-Based Temporal Localization
5 papers with code • 1 benchmarks • 1 datasets
Latest papers
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
Our framework introduces two auxiliary tasks, cross-modal matching and temporal order discrimination, to promote the grounding model training.
TubeDETR: Spatio-Temporal Video Grounding with Transformers
We consider the problem of localizing a spatio-temporal tube in a video corresponding to a given text query.
Hierarchical Deep Residual Reasoning for Temporal Moment Localization
Temporal Moment Localization (TML) in untrimmed videos is a challenging task in the field of multimedia, which aims at localizing the start and end points of the activity in the video, described by a sentence query.
Video Moment Localization using Object Evidence and Reverse Captioning
We address the problem of language-based temporal localization of moments in untrimmed videos.
MAC: Mining Activity Concepts for Language-based Temporal Localization
Previous methods address the problem by considering features from video sliding windows and language queries and learning a subspace to encode their correlation, which ignore rich semantic cues about activities in videos and queries.