TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Weakly Supervised Action Localization	THUMOS14	EM-ML	avg-mAP (0.1-0.5)	44.9	# 12
Weakly Supervised Action Localization	THUMOS14	EM-ML	avg-mAP (0.3-0.7)	30.4	# 11
Weakly Supervised Action Localization	THUMOS14	EM-ML	avg-mAP (0.1:0.7)	37.7	# 12
Weakly Supervised Action Localization	THUMOS’14	EM-MIL	mAP@0.5	30.5	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/weakly-supervised-action-localization-with-2/weakly-supervised-action-localization-on-4)](https://paperswithcode.com/sota/weakly-supervised-action-localization-on-4?p=weakly-supervised-action-localization-with-2)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/weakly-supervised-action-localization-with-2/weakly-supervised-action-localization-on-5)](https://paperswithcode.com/sota/weakly-supervised-action-localization-on-5?p=weakly-supervised-action-localization-with-2)`

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

ECCV 2020 · Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu ·

Weakly-supervised action localization requires training a model to localize the action segments in the video given only video level action label. It can be solved under the Multiple Instance Learning (MIL) framework, where a bag (video) contains multiple instances (action segments). Since only the bag's label is known, the main challenge is assigning which key instances within the bag to trigger the bag's label. Most previous models use attention-based approaches applying attentions to generate the bag's representation from instances, and then train it via the bag's classification. These models, however, implicitly violate the MIL assumption that instances in negative bags should be uniformly negative. In this work, we explicitly model the key instances assignment as a hidden variable and adopt an Expectation-Maximization (EM) framework. We derive two pseudo-label generation schemes to model the E and M process and iteratively optimize the likelihood lower bound. We show that our EM-MIL approach more accurately models both the learning objective and the MIL assumptions. It achieves state-of-the-art performance on two standard benchmarks, THUMOS14 and ActivityNet1.2.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

airmachine/EM-MIL-WeaklyActionDetec… official

Tasks

Add Remove

Action Localization

Multiple Instance Learning

Pseudo Label

Weakly Supervised Action Localization

Datasets

THUMOS14

Results from the Paper

Edit

Ranked #9 on Weakly Supervised Action Localization on THUMOS’14

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Weakly Supervised Action Localization	THUMOS14	EM-ML	avg-mAP (0.1-0.5)	44.9	# 12	Compare
			avg-mAP (0.3-0.7)	30.4	# 11	Compare
			avg-mAP (0.1:0.7)	37.7	# 12	Compare
Weakly Supervised Action Localization	THUMOS’14	EM-MIL	mAP@0.5	30.5	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove