TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Skeleton Based Action Recognition	Florence 3D	SCK⊕+DCK⊕	Accuracy	97.45	# 3
Skeleton Based Action Recognition	Florence 3D	SCK+DCK	Accuracy	95.23	# 5
Action Recognition	HMDB-51	SCK⊕(I3D)+IDT	Average accuracy of 3 splits	86.11	# 4
Skeleton Based Action Recognition	NTU RGB+D	SCK⊕	Accuracy (CV)	94.75	# 53
Skeleton Based Action Recognition	NTU RGB+D	SCK⊕	Accuracy (CS)	91.56	# 26
Skeleton Based Action Recognition	UT-Kinect	SCK⊕+DCK⊕	Accuracy	99.2	# 2
Skeleton Based Action Recognition	UT-Kinect	SCK+DCK	Accuracy	98.2	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tensor-representations-for-action-recognition/skeleton-based-action-recognition-on-ut)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ut?p=tensor-representations-for-action-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tensor-representations-for-action-recognition/skeleton-based-action-recognition-on-florence)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-florence?p=tensor-representations-for-action-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tensor-representations-for-action-recognition/action-recognition-in-videos-on-hmdb-51)](https://paperswithcode.com/sota/action-recognition-in-videos-on-hmdb-51?p=tensor-representations-for-action-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tensor-representations-for-action-recognition/skeleton-based-action-recognition-on-ntu-rgbd)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd?p=tensor-representations-for-action-recognition)`

Tensor Representations for Action Recognition

28 Dec 2020 · Piotr Koniusz, Lei Wang, Anoop Cherian ·

Human actions in video sequences are characterized by the complex interplay between spatial features and their temporal dynamics. In this paper, we propose novel tensor representations for compactly capturing such higher-order relationships between visual features for the task of action recognition. We propose two tensor-based feature representations, viz. (i) sequence compatibility kernel (SCK) and (ii) dynamics compatibility kernel (DCK). SCK builds on the spatio-temporal correlations between features, whereas DCK explicitly models the action dynamics of a sequence. We also explore generalization of SCK, coined SCK(+), that operates on subsequences to capture the local-global interplay of correlations, which can incorporate multi-modal inputs e.g., skeleton 3D body-joints and per-frame classifier scores obtained from deep learning models trained on videos. We introduce linearization of these kernels that lead to compact and fast descriptors. We provide experiments on (i) 3D skeleton action sequences, (ii) fine-grained video sequences, and (iii) standard non-fine-grained videos. As our final representations are tensors that capture higher-order relationships of features, they relate to co-occurrences for robust fine-grained recognition. We use higher-order tensors and so-called Eigenvalue Power Normalization (EPN) which have been long speculated to perform spectral detection of higher-order occurrences, thus detecting fine-grained relationships of features rather than merely count features in action sequences. We prove that a tensor of order r, built from Z* dimensional features, coupled with EPN indeed detects if at least one higher-order occurrence is `projected' into one of its binom(Z*,r) subspaces of dim. r represented by the tensor, thus forming a Tensor Power Normalization metric endowed with binom(Z*,r) such `detectors'.

PDF Abstract

Code

Add Remove Mark official

ElricXin/Skeleton-MixFormer

Tasks

Add Remove

Action Recognition

Action Recognition In Videos

Skeleton Based Action Recognition

Datasets

Kinetics

HMDB51

NTU RGB+D

UT-Kinect

Florence3D

Results from the Paper

Edit

Ranked #2 on Skeleton Based Action Recognition on UT-Kinect

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	Florence 3D	SCK⊕+DCK⊕	Accuracy	97.45	# 3	Compare
Skeleton Based Action Recognition	Florence 3D	SCK+DCK	Accuracy	95.23	# 5	Compare
Action Recognition	HMDB-51	SCK⊕(I3D)+IDT	Average accuracy of 3 splits	86.11	# 4	Compare
Skeleton Based Action Recognition	NTU RGB+D	SCK⊕	Accuracy (CV)	94.75	# 53	Compare
Skeleton Based Action Recognition	NTU RGB+D	SCK⊕	Accuracy (CS)	91.56	# 26	Compare
Skeleton Based Action Recognition	UT-Kinect	SCK⊕+DCK⊕	Accuracy	99.2	# 2	Compare
Skeleton Based Action Recognition	UT-Kinect	SCK+DCK	Accuracy	98.2	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Tensor Representations for Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove