TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	EXTRA DATA	REMOVE
Spatio-Temporal Action Localization	AVA-Kinetics	RM (multi-scale, ensemble)	val mAP	40.52	# 4
Spatio-Temporal Action Localization	AVA-Kinetics	RM (multi-scale, ir-CSN-152)	val mAP	37.95	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/relation-modeling-in-spatio-temporal-action/spatio-temporal-action-localization-on-ava)](https://paperswithcode.com/sota/spatio-temporal-action-localization-on-ava?p=relation-modeling-in-spatio-temporal-action)`

Relation Modeling in Spatio-Temporal Action Localization

15 Jun 2021 · Yutong Feng, Jianwen Jiang, Ziyuan Huang, Zhiwu Qing, Xiang Wang, Shiwei Zhang, Mingqian Tang, Yue Gao ·

This paper presents our solution to the AVA-Kinetics Crossover Challenge of ActivityNet workshop at CVPR 2021. Our solution utilizes multiple types of relation modeling methods for spatio-temporal action detection and adopts a training strategy to integrate multiple relation modeling in end-to-end training over the two large-scale video datasets. Learning with memory bank and finetuning for long-tailed distribution are also investigated to further improve the performance. In this paper, we detail the implementations of our solution and provide experiments results and corresponding discussions. We finally achieve 40.67 mAP on the test set of AVA-Kinetics.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Detection

Action Localization

Relation

Spatio-Temporal Action Localization

Temporal Action Localization

Datasets

Kinetics

AVA

Results from the Paper

Edit

Ranked #4 on Spatio-Temporal Action Localization on AVA-Kinetics (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Uses Extra Training Data	Result	Benchmark
Spatio-Temporal Action Localization	AVA-Kinetics	RM (multi-scale, ensemble)	val mAP	40.52	# 4			Compare
Spatio-Temporal Action Localization	AVA-Kinetics	RM (multi-scale, ir-CSN-152)	val mAP	37.95	# 6			Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Relation Modeling in Spatio-Temporal Action Localization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove