M-VAD Names (M-VAD Names Dataset)

Introduced by Pini et al. in M-VAD Names: a Dataset for Video Captioning with Naming

The dataset contains the annotations of characters' visual appearances, in the form of tracks of face bounding boxes, and the associations with characters' textual mentions, when available. The detection and annotation of the visual appearances of characters in each video clip of each movie was achieved through a semi-automatic approach. The released dataset contains more than 24k annotated video clips, including 63k visual tracks and 34k textual mentions, all associated with their character identities.

Source: M-VAD Names Dataset


