TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Recognition	Something-Something V1	TRG (ResNet-50)	Top 1 Accuracy	49.5	# 55
Action Recognition	Something-Something V1	TRG (ResNet-50)	Top 5 Accuracy	86.1	# 6
Action Recognition	Something-Something V1	TRG (Inception-V3)	Top 1 Accuracy	49.7	# 53
Action Recognition	Something-Something V2	TRG (Inception-V3)	Top-1 Accuracy	61.3	# 108
Action Recognition	Something-Something V2	TRG (Inception-V3)	Top-5 Accuracy	91.4	# 37
Action Recognition	Something-Something V2	TRG (ResNet-50)	Top-1 Accuracy	62.2	# 104
Action Recognition	Something-Something V2	TRG (ResNet-50)	Top-5 Accuracy	90.3	# 59

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-reasoning-graph-for-activity/action-recognition-in-videos-on-something-1)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something-1?p=temporal-reasoning-graph-for-activity)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-reasoning-graph-for-activity/action-recognition-in-videos-on-something)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something?p=temporal-reasoning-graph-for-activity)`

Temporal Reasoning Graph for Activity Recognition

27 Aug 2019 · Jingran Zhang, Fumin Shen, Xing Xu, Heng Tao Shen ·

Despite great success has been achieved in activity analysis, it still has many challenges. Most existing work in activity recognition pay more attention to design efficient architecture or video sampling strategy. However, due to the property of fine-grained action and long term structure in video, activity recognition is expected to reason temporal relation between video sequences. In this paper, we propose an efficient temporal reasoning graph (TRG) to simultaneously capture the appearance features and temporal relation between video sequences at multiple time scales. Specifically, we construct learnable temporal relation graphs to explore temporal relation on the multi-scale range. Additionally, to facilitate multi-scale temporal relation extraction, we design a multi-head temporal adjacent matrix to represent multi-kinds of temporal relations. Eventually, a multi-head temporal relation aggregator is proposed to extract the semantic meaning of those features convolving through the graphs. Extensive experiments are performed on widely-used large-scale datasets, such as Something-Something and Charades, and the results show that our model can achieve state-of-the-art performance. Further analysis shows that temporal relation reasoning with our TRG can extract discriminative features for activity recognition.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

Activity Recognition

Relation

Relation Extraction

Temporal Relation Extraction

Datasets

Charades

Something-Something V2

Something-Something V1

Results from the Paper

Edit

Ranked #53 on Action Recognition on Something-Something V1

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Recognition	Something-Something V1	TRG (ResNet-50)	Top 1 Accuracy	49.5	# 55	Compare
Action Recognition	Something-Something V1	TRG (ResNet-50)	Top 5 Accuracy	86.1	# 6	Compare
Action Recognition	Something-Something V1	TRG (Inception-V3)	Top 1 Accuracy	49.7	# 53	Compare
Action Recognition	Something-Something V2	TRG (Inception-V3)	Top-1 Accuracy	61.3	# 108	Compare
Action Recognition	Something-Something V2	TRG (Inception-V3)	Top-5 Accuracy	91.4	# 37	Compare
Action Recognition	Something-Something V2	TRG (ResNet-50)	Top-1 Accuracy	62.2	# 104	Compare
Action Recognition	Something-Something V2	TRG (ResNet-50)	Top-5 Accuracy	90.3	# 59	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Temporal Reasoning Graph for Activity Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove