ActivityNet Entities

Introduced by Zhou et al. in Grounded Video Description

ActivityNet-Entities, augments the challenging ActivityNet Captions dataset with 158k bounding box annotations, each grounding a noun phrase. This allows training video description models with this data, and importantly, evaluate how grounded or "true" such model are to the video they describe.

Source: https://github.com/facebookresearch/ActivityNet-Entities

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

facebookresearch/ActivityNet-Entities

154

Tasks

Image Generation
Video Description

Similar Datasets

Video Localized Narratives

ActivityNet Captions

Flickr30K Entities

YouCook

Source: https://github.com/facebookresearch/ActivityNet-Entities.

Usage

License

Unknown

Modalities

Videos

ActivityNet Entities

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit