MUSES (MUlti-Shot EventS)

Introduced by Liu et al. in Multi-shot Temporal Event Localization: a Benchmark

MUSES is a large-scale dataset for temporal event (action) localization. It focuses on the temporal localization of multi-shot events, which are captured with multiple shots. Such events often appear in edited videos, such as TV shows and movies.

What’s included in MUSES:

3,697 videos of TV and movie dramas
716 hours of duration
25 event categories
652k shots
31,477 annotated event instances

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Temporal Action Localization	MUSES	TemporalMaxer

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Temporal Action Localization

Similar Datasets

Anatomy of Video Editing (AVE)

THUMOS14

Usage

License

Unknown

MUSES (MUlti-Shot EventS)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

Anatomy of Video Editing (AVE)

THUMOS14

Usage

License

Modalities

Languages

MUSES (MUlti-Shot EventS)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

Anatomy of Video Editing (AVE)

THUMOS14

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages