HC-STVG1 (Human-centric Spatio-Temporal Video Grounding)

Introduced by Tang et al. in Human-centric Spatio-Temporal Video Grounding With Visual Transformers

The newly proposed HC-STVG task aims to localize the target person spatio-temporally in an untrimmed video. For this task, we collect a new benchmark dataset, which has spatio temporal annotations related to the target persons in complex multi-person scenes, together with full interaction and rich action information.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages