TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Temporal Action Localization	ActivityNet-1.3	GCM	mAP IOU@0.5	51.03	# 18
Temporal Action Localization	ActivityNet-1.3	GCM	mAP	34.24	# 26
Temporal Action Localization	ActivityNet-1.3	GCM	mAP IOU@0.75	35.17	# 17
Temporal Action Localization	ActivityNet-1.3	GCM	mAP IOU@0.95	7.44	# 20
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.5	51.9	# 23
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.1	72.5	# 2
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.2	70.9	# 2
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.3	66.5	# 22
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.4	60.8	# 21

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graph-convolutional-module-for-temporal/temporal-action-localization-on-thumos14)](https://paperswithcode.com/sota/temporal-action-localization-on-thumos14?p=graph-convolutional-module-for-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graph-convolutional-module-for-temporal/temporal-action-localization-on-activitynet)](https://paperswithcode.com/sota/temporal-action-localization-on-activitynet?p=graph-convolutional-module-for-temporal)`

Graph Convolutional Module for Temporal Action Localization in Videos

1 Dec 2021 · Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan ·

Temporal action localization has long been researched in computer vision. Existing state-of-the-art action localization methods divide each video into multiple action units (i.e., proposals in two-stage methods and segments in one-stage methods) and then perform action recognition/regression on each of them individually, without explicitly exploiting their relations during learning. In this paper, we claim that the relations between action units play an important role in action localization, and a more powerful action detector should not only capture the local content of each action unit but also allow a wider field of view on the context related to it. To this end, we propose a general graph convolutional module (GCM) that can be easily plugged into existing action localization methods, including two-stage and one-stage paradigms. To be specific, we first construct a graph, where each action unit is represented as a node and their relations between two action units as an edge. Here, we use two types of relations, one for capturing the temporal connections between different action units, and the other one for characterizing their semantic relationship. Particularly for the temporal connections in two-stage methods, we further explore two different kinds of edges, one connecting the overlapping action units and the other one connecting surrounding but disjointed units. Upon the graph we built, we then apply graph convolutional networks (GCNs) to model the relations among different action units, which is able to learn more informative representations to enhance action localization. Experimental results show that our GCM consistently improves the performance of existing action localization methods, including two-stage methods (e.g., CBR and R-C3D) and one-stage methods (e.g., D-SSAD), verifying the generality and effectiveness of our GCM.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Localization

Action Recognition

Temporal Action Localization

Datasets

ActivityNet

THUMOS14

Results from the Paper

Edit

Ranked #2 on Temporal Action Localization on THUMOS’14 (mAP IOU@0.1 metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Temporal Action Localization	ActivityNet-1.3	GCM	mAP IOU@0.5	51.03	# 18	Compare
			mAP	34.24	# 26	Compare
			mAP IOU@0.75	35.17	# 17	Compare
			mAP IOU@0.95	7.44	# 20	Compare
Temporal Action Localization	THUMOS’14	GCM	mAP IOU@0.5	51.9	# 23	Compare
			mAP IOU@0.1	72.5	# 2	Compare
			mAP IOU@0.2	70.9	# 2	Compare
			mAP IOU@0.3	66.5	# 22	Compare
			mAP IOU@0.4	60.8	# 21	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Graph Convolutional Module for Temporal Action Localization in Videos

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove