TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Classification	Charades	DEEP-HAL with ODF+SDF (I3D)	MAP	50.16	# 11
Action Classification	Charades	DEEP-HAL with ODF+SDF (AssembleNet++)	MAP	62.29	# 4
Egocentric Activity Recognition	EPIC-KITCHENS-55	DEEP-HAL with ODF+SDF (AssembleNet++)	Actions Top-1 (S2)	27.3	# 1
Egocentric Activity Recognition	EPIC-KITCHENS-55	DEEP-HAL with ODF+SDF (AssembleNet++)	Actions Top-1 (S1)	35.8	# 1
Action Recognition	HMDB-51	DEEP-HAL with ODF+SDF(I3D)	Average accuracy of 3 splits	87.56	# 2
Scene Recognition	YUP++	DEEP-HAL with ODF+SDF (I3D)	Accuracy (%)	94.4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hallucinating-statistical-moment-and-subspace/egocentric-activity-recognition-on-epic-1)](https://paperswithcode.com/sota/egocentric-activity-recognition-on-epic-1?p=hallucinating-statistical-moment-and-subspace)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hallucinating-statistical-moment-and-subspace/scene-recognition-on-yup)](https://paperswithcode.com/sota/scene-recognition-on-yup?p=hallucinating-statistical-moment-and-subspace)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hallucinating-statistical-moment-and-subspace/action-recognition-in-videos-on-hmdb-51)](https://paperswithcode.com/sota/action-recognition-in-videos-on-hmdb-51?p=hallucinating-statistical-moment-and-subspace)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hallucinating-statistical-moment-and-subspace/action-classification-on-charades)](https://paperswithcode.com/sota/action-classification-on-charades?p=hallucinating-statistical-moment-and-subspace)`

Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors

14 Jan 2020 · Lei Wang, Piotr Koniusz ·

In this paper, we build on a concept of self-supervision by taking RGB frames as input to learn to predict both action concepts and auxiliary descriptors e.g., object descriptors. So-called hallucination streams are trained to predict auxiliary cues, simultaneously fed into classification layers, and then hallucinated at the testing stage to aid network. We design and hallucinate two descriptors, one leveraging four popular object detectors applied to training videos, and the other leveraging image- and video-level saliency detectors. The first descriptor encodes the detector- and ImageNet-wise class prediction scores, confidence scores, and spatial locations of bounding boxes and frame indexes to capture the spatio-temporal distribution of features per video. Another descriptor encodes spatio-angular gradient distributions of saliency maps and intensity patterns. Inspired by the characteristic function of the probability distribution, we capture four statistical moments on the above intermediate descriptors. As numbers of coefficients in the mean, covariance, coskewness and cokurtotsis grow linearly, quadratically, cubically and quartically w.r.t. the dimension of feature vectors, we describe the covariance matrix by its leading n' eigenvectors (so-called subspace) and we capture skewness/kurtosis rather than costly coskewness/cokurtosis. We obtain state of the art on five popular datasets such as Charades and EPIC-Kitchens.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Classification

Action Recognition

Egocentric Activity Recognition

Hallucination

Optical Flow Estimation

Scene Recognition

Datasets

MS COCO

HMDB51

Charades

AVA

EPIC-KITCHENS-55 YUP++

Results from the Paper

Edit

Ranked #1 on Scene Recognition on YUP++ (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Classification	Charades	DEEP-HAL with ODF+SDF (I3D)	MAP	50.16	# 11	Compare
Action Classification	Charades	DEEP-HAL with ODF+SDF (AssembleNet++)	MAP	62.29	# 4	Compare
Egocentric Activity Recognition	EPIC-KITCHENS-55	DEEP-HAL with ODF+SDF (AssembleNet++)	Actions Top-1 (S2)	27.3	# 1	Compare
Egocentric Activity Recognition	EPIC-KITCHENS-55	DEEP-HAL with ODF+SDF (AssembleNet++)	Actions Top-1 (S1)	35.8	# 1	Compare
Action Recognition	HMDB-51	DEEP-HAL with ODF+SDF(I3D)	Average accuracy of 3 splits	87.56	# 2	Compare
Scene Recognition	YUP++	DEEP-HAL with ODF+SDF (I3D)	Accuracy (%)	94.4	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove