VTC (Videos, Titles and Comments)

Introduced by Hanu et al. in VTC: Improving Video-Text Retrieval with User Comments

VTC is a large-scale multimodal dataset containing video-caption pairs (~300k) alongside comments that can be used for multimodal representation learning.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Modalities


Languages