TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Detection	THUMOS' 14	TadML-two stream	mAP	59.7	# 2
Action Detection	THUMOS' 14	TadML-rgb	mAP	53.46	# 4
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.5	62.53	# 10
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.3	73.29	# 11
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.4	69.73	# 10
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.6	53.36	# 9
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.7	39.60	# 9
Temporal Action Localization	THUMOS’14	TadML(two-stream)	Avg mAP (0.3:0.7)	59.70	# 12
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.5	56.61	# 17
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.3	68.78	# 17
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.4	64.66	# 15
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.6	45.40	# 17
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.7	31.88	# 16
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	Avg mAP (0.3:0.7)	53.46	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tadml-a-fast-temporal-action-detection-with/action-detection-on-thumos-14)](https://paperswithcode.com/sota/action-detection-on-thumos-14?p=tadml-a-fast-temporal-action-detection-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tadml-a-fast-temporal-action-detection-with/temporal-action-localization-on-thumos14)](https://paperswithcode.com/sota/temporal-action-localization-on-thumos14?p=tadml-a-fast-temporal-action-detection-with)`

TadML: A fast temporal action detection with Mechanics-MLP

7 Jun 2022 · Bowen Deng, Dongchang Liu ·

Temporal Action Detection(TAD) is a crucial but challenging task in video understanding.It is aimed at detecting both the type and start-end frame for each action instance in a long, untrimmed video.Most current models adopt both RGB and Optical-Flow streams for the TAD task. Thus, original RGB frames must be converted manually into Optical-Flow frames with additional computation and time cost, which is an obstacle to achieve real-time processing. At present, many models adopt two-stage strategies, which would slow the inference speed down and complicatedly tuning on proposals generating.By comparison, we propose a one-stage anchor-free temporal localization method with RGB stream only, in which a novel Newtonian Mechanics-MLP architecture is established. It has comparable accuracy with all existing state-of-the-art models, while surpasses the inference speed of these methods by a large margin. The typical inference speed in this paper is astounding 4.44 video per second on THUMOS14. In applications, because there is no need to convert optical flow, the inference speed will be faster.It also proves that MLP has great potential in downstream tasks such as TAD. The source code is available at https://github.com/BonedDeng/TadML

PDF Abstract

Code

Add Remove Mark official

boneddeng/tadml official

Tasks

Add Remove

Action Detection

Optical Flow Estimation

Temporal Action Localization

Temporal Localization

Datasets

UCF101

ActivityNet

THUMOS14

Results from the Paper

Edit

Ranked #2 on Action Detection on THUMOS' 14

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Detection	THUMOS' 14	TadML-two stream	mAP	59.7	# 2	Compare
Action Detection	THUMOS' 14	TadML-rgb	mAP	53.46	# 4	Compare
Temporal Action Localization	THUMOS’14	TadML(two-stream)	mAP IOU@0.5	62.53	# 10	Compare
			mAP IOU@0.3	73.29	# 11	Compare
			mAP IOU@0.4	69.73	# 10	Compare
			mAP IOU@0.6	53.36	# 9	Compare
			mAP IOU@0.7	39.60	# 9	Compare
			Avg mAP (0.3:0.7)	59.70	# 12	Compare
Temporal Action Localization	THUMOS’14	TadML(rgb-only)	mAP IOU@0.5	56.61	# 17	Compare
			mAP IOU@0.3	68.78	# 17	Compare
			mAP IOU@0.4	64.66	# 15	Compare
			mAP IOU@0.6	45.40	# 17	Compare
			mAP IOU@0.7	31.88	# 16	Compare
			Avg mAP (0.3:0.7)	53.46	# 18	Compare

Methods

Add Remove

Average Pooling • Dense Connections • Dropout • GELU • Global Average Pooling • Layer Normalization • MLP-Mixer • Residual Connection • SPEED

Edit Social Preview

TadML: A fast temporal action detection with Mechanics-MLP

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove