STAIR Actions Captions

Introduced by Shigeto et al. in Video Caption Dataset for Describing Human Actions in Japanese

A large-scale Japanese video caption dataset consisting of 79,822 videos and 399,233 captions. Each caption in the dataset describes a video in the form of "who does what and where."

Source: Video Caption Dataset for Describing Human Actions in Japanese

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

MSVD

Usage

License

Unknown

Modalities

Videos
Texts

Languages

Japanese

STAIR Actions Captions

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit