TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Temporal Action Localization	ActivityNet-1.3	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.5	50.47	# 23
Temporal Action Localization	ActivityNet-1.3	E2E-TAD (SlowFast R50+TadTR)	mAP	35.10	# 20
Temporal Action Localization	ActivityNet-1.3	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.75	35.99	# 11
Temporal Action Localization	ActivityNet-1.3	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.95	10.83	# 2
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.5	56.0	# 18
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.3	69.4	# 13
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.4	64.3	# 16
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.6	46.4	# 14
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.7	34.9	# 12
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	Avg mAP (0.3:0.7)	54.2	# 17

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-empirical-study-of-end-to-end-temporal/temporal-action-localization-on-thumos14)](https://paperswithcode.com/sota/temporal-action-localization-on-thumos14?p=an-empirical-study-of-end-to-end-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-empirical-study-of-end-to-end-temporal/temporal-action-localization-on-activitynet)](https://paperswithcode.com/sota/temporal-action-localization-on-activitynet?p=an-empirical-study-of-end-to-end-temporal)`

An Empirical Study of End-to-End Temporal Action Detection

CVPR 2022 · Xiaolong Liu, Song Bai, Xiang Bai ·

Temporal action detection (TAD) is an important yet challenging task in video understanding. It aims to simultaneously predict the semantic label and the temporal interval of every action instance in an untrimmed video. Rather than end-to-end learning, most existing methods adopt a head-only learning paradigm, where the video encoder is pre-trained for action classification, and only the detection head upon the encoder is optimized for TAD. The effect of end-to-end learning is not systematically evaluated. Besides, there lacks an in-depth study on the efficiency-accuracy trade-off in end-to-end TAD. In this paper, we present an empirical study of end-to-end temporal action detection. We validate the advantage of end-to-end learning over head-only learning and observe up to 11\% performance improvement. Besides, we study the effects of multiple design choices that affect the TAD performance and speed, including detection head, video encoder, and resolution of input videos. Based on the findings, we build a mid-resolution baseline detector, which achieves the state-of-the-art performance of end-to-end methods while running more than 4$\times$ faster. We hope that this paper can serve as a guide for end-to-end learning and inspire future research in this field. Code and models are available at \url{https://github.com/xlliu7/E2E-TAD}.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

xlliu7/E2E-TAD official

Tasks

Add Remove

Action Classification

Action Detection

Temporal Action Localization

Video Understanding

Datasets

ActivityNet

THUMOS14

HACS

Results from the Paper

Edit

Ranked #17 on Temporal Action Localization on THUMOS’14

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Temporal Action Localization	ActivityNet-1.3	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.5	50.47	# 23	Compare
			mAP	35.10	# 20	Compare
			mAP IOU@0.75	35.99	# 11	Compare
			mAP IOU@0.95	10.83	# 2	Compare
Temporal Action Localization	THUMOS’14	E2E-TAD (SlowFast R50+TadTR)	mAP IOU@0.5	56.0	# 18	Compare
			mAP IOU@0.3	69.4	# 13	Compare
			mAP IOU@0.4	64.3	# 16	Compare
			mAP IOU@0.6	46.4	# 14	Compare
			mAP IOU@0.7	34.9	# 12	Compare
			Avg mAP (0.3:0.7)	54.2	# 17	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

An Empirical Study of End-to-End Temporal Action Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove