TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Video Classification	Breakfast	GHRM	Accuracy (%)	75.5	# 5
Long-video Activity Recognition	Breakfast	GHRM (I3D-K400-Pretrain-feature)	mAP	65.86	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graph-based-high-order-relation-modeling-for/video-classification-on-breakfast)](https://paperswithcode.com/sota/video-classification-on-breakfast?p=graph-based-high-order-relation-modeling-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/graph-based-high-order-relation-modeling-for/long-video-activity-recognition-on-breakfast)](https://paperswithcode.com/sota/long-video-activity-recognition-on-breakfast?p=graph-based-high-order-relation-modeling-for)`

Graph-Based High-Order Relation Modeling for Long-Term Action Recognition

CVPR 2021 · Jiaming Zhou, Kun-Yu Lin, Haoxin Li, Wei-Shi Zheng ·

Long-term actions involve many important visual concepts, e.g., objects, motions, and sub-actions, and there are various relations among these concepts, which we call basic relations. These basic relations will jointly affect each other during the temporal evolution of long-term actions, which forms the high-order relations that are essential for long-term action recognition. In this paper, we propose a Graph-based High-order Relation Modeling (GHRM) module to exploit the high-order relations in the long-term actions for long-term action recognition. In GHRM, each basic relation in the long-term actions will be modeled by a graph, where each node represents a segment in a long video. Moreover, when modeling each basic relation, the information from all the other basic relations will be incorporated by GHRM, and thus the high-order relations in the long-term actions can be well exploited. To better exploit the high-order relations along the time dimension, we design a GHRM-layer consisting of a Temporal-GHRM branch and a Semantic-GHRM branch, which aims to model the local temporal high-order relations and global semantic high-order relations. The experimental results on three long-term action recognition datasets, namely, Breakfast, Charades, and MultiThumos, demonstrate the effectiveness of our model.

PDF Abstract