TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Temporal Action Localization	MUSES	MUSES	mAP@0.3	25.9	# 2
Temporal Action Localization	MUSES	MUSES	mAP@0.4	22.6	# 2
Temporal Action Localization	MUSES	MUSES	mAP@0.5	18.9	# 2
Temporal Action Localization	MUSES	MUSES	mAP@0.6	15.0	# 2
Temporal Action Localization	MUSES	MUSES	mAP@0.7	10.6	# 2
Temporal Action Localization	MUSES	MUSES	mAP	18.6	# 2
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.5	56.9	# 16
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.3	68.9	# 16
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.4	64.0	# 17
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.6	46.3	# 15
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.7	31.0	# 18
Temporal Action Localization	THUMOS’14	MUSES	Avg mAP (0.3:0.7)	53.4	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-shot-temporal-event-localization-a/temporal-action-localization-on-muses)](https://paperswithcode.com/sota/temporal-action-localization-on-muses?p=multi-shot-temporal-event-localization-a)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-shot-temporal-event-localization-a/temporal-action-localization-on-thumos14)](https://paperswithcode.com/sota/temporal-action-localization-on-thumos14?p=multi-shot-temporal-event-localization-a)`

Multi-shot Temporal Event Localization: a Benchmark

CVPR 2021 · Xiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr ·

Current developments in temporal event or action localization usually target actions captured by a single camera. However, extensive events or actions in the wild may be captured as a sequence of shots by multiple cameras at different positions. In this paper, we propose a new and challenging task called multi-shot temporal event localization, and accordingly, collect a large scale dataset called MUlti-Shot EventS (MUSES). MUSES has 31,477 event instances for a total of 716 video hours. The core nature of MUSES is the frequent shot cuts, for an average of 19 shots per instance and 176 shots per video, which induces large intrainstance variations. Our comprehensive evaluations show that the state-of-the-art method in temporal action localization only achieves an mAP of 13.1% at IoU=0.5. As a minor contribution, we present a simple baseline approach for handling the intra-instance variations, which reports an mAP of 18.9% on MUSES and 56.9% on THUMOS14 at IoU=0.5. To facilitate research in this direction, we release the dataset and the project code at https://songbai.site/muses/ .

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

xlliu7/muses official

Tasks

Add Remove

Action Localization

Temporal Action Localization

Datasets

Introduced in the Paper:

MUSES

Used in the Paper:

ActivityNet

THUMOS14

HACS

Results from the Paper

Edit

Ranked #2 on Temporal Action Localization on MUSES

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Temporal Action Localization	MUSES	MUSES	mAP@0.3	25.9	# 2	Compare
			mAP@0.4	22.6	# 2	Compare
			mAP@0.5	18.9	# 2	Compare
			mAP@0.6	15.0	# 2	Compare
			mAP@0.7	10.6	# 2	Compare
			mAP	18.6	# 2	Compare
Temporal Action Localization	THUMOS’14	MUSES	mAP IOU@0.5	56.9	# 16	Compare
			mAP IOU@0.3	68.9	# 16	Compare
			mAP IOU@0.4	64.0	# 17	Compare
			mAP IOU@0.6	46.3	# 15	Compare
			mAP IOU@0.7	31.0	# 18	Compare
			Avg mAP (0.3:0.7)	53.4	# 19	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Multi-shot Temporal Event Localization: a Benchmark

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove