TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Weakly Supervised Action Localization	ActivityNet-1.3	MAAN	mAP@0.5	33.7	# 11
Weakly Supervised Action Localization	THUMOS 2014	MAAN	mAP@0.5	20.3	# 23
Weakly Supervised Action Localization	THUMOS 2014	MAAN	mAP@0.1:0.7	31.6	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/marginalized-average-attentional-network-for-1/weakly-supervised-action-localization-on-1)](https://paperswithcode.com/sota/weakly-supervised-action-localization-on-1?p=marginalized-average-attentional-network-for-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/marginalized-average-attentional-network-for-1/weakly-supervised-action-localization-on)](https://paperswithcode.com/sota/weakly-supervised-action-localization-on?p=marginalized-average-attentional-network-for-1)`

Marginalized Average Attentional Network for Weakly-Supervised Learning

ICLR 2019 · Yuan Yuan, Yueming Lyu, Xi Shen, Ivor W. Tsang, Dit-yan Yeung ·

In weakly-supervised temporal action localization, previous works have failed to locate dense and integral regions for each entire action due to the overestimation of the most salient regions. To alleviate this issue, we propose a marginalized average attentional network (MAAN) to suppress the dominant response of the most salient regions in a principled manner. The MAAN employs a novel marginalized average aggregation (MAA) module and learns a set of latent discriminative probabilities in an end-to-end fashion. MAA samples multiple subsets from the video snippet features according to a set of latent discriminative probabilities and takes the expectation over all the averaged subset features. Theoretically, we prove that the MAA module with learned latent discriminative probabilities successfully reduces the difference in responses between the most salient regions and the others. Therefore, MAAN is able to generate better class activation sequences and identify dense and integral action regions in the videos. Moreover, we propose a fast algorithm to reduce the complexity of constructing MAA from O($2^T$) to O($T^2$). Extensive experiments on two large-scale video datasets show that our MAAN achieves superior performance on weakly-supervised temporal action localization

PDF Abstract ICLR 2019 PDF ICLR 2019 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Localization

Temporal Action Localization

Weakly Supervised Action Localization

Weakly-supervised Learning

Weakly-supervised Temporal Action Localization

Weakly Supervised Temporal Action Localization

Datasets

ActivityNet

THUMOS14

Results from the Paper

Edit

Ranked #11 on Weakly Supervised Action Localization on ActivityNet-1.3 (mAP@0.5 metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Weakly Supervised Action Localization	ActivityNet-1.3	MAAN	mAP@0.5	33.7	# 11	Compare
Weakly Supervised Action Localization	THUMOS 2014	MAAN	mAP@0.5	20.3	# 23	Compare
Weakly Supervised Action Localization	THUMOS 2014	MAAN	mAP@0.1:0.7	31.6	# 19	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Marginalized Average Attentional Network for Weakly-Supervised Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove