Skeleton Based Action Recognition

158 papers with code • 34 benchmarks • 28 datasets

Skeleton-based Action Recognition is a computer vision task that involves recognizing human actions from a sequence of 3D skeletal joint data captured from sensors such as Microsoft Kinect, Intel RealSense, and wearable devices. The goal of skeleton-based action recognition is to develop algorithms that can understand and classify human actions from skeleton data, which can be used in various applications such as human-computer interaction, sports analysis, and surveillance.

( Image credit: View Adaptive Neural Networks for High Performance Skeleton-based Human Action Recognition )


Use these libraries to find Skeleton Based Action Recognition models and implementations

Most implemented papers

Graph Attention Networks

PetarV-/GAT ICLR 2018

We present graph attention networks (GATs), novel neural network architectures that operate on graph-structured data, leveraging masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations.

Semi-Supervised Classification with Graph Convolutional Networks

dmlc/dgl 9 Sep 2016

We present a scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs.

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

open-mmlab/mmaction2 CVPR 2017

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks.

Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

yysijie/st-gcn 23 Jan 2018

Dynamics of human body skeletons convey significant information for human action recognition.

Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN

Sunnydreamrain/IndRNN_Theano_Lasagne CVPR 2018

Experimental results have shown that the proposed IndRNN is able to process very long sequences (over 5000 time steps), can be used to construct very deep networks (21 layers used in the experiment) and still be trained robustly.

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

NVIDIA/flownet2-pytorch CVPR 2017

Particularly on small displacements and real-world data, FlowNet cannot compete with variational methods.

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

wushidonguc/two-stream-action-recognition-keras 3 Dec 2012

To the best of our knowledge, UCF101 is currently the most challenging dataset of actions due to its large number of classes, large number of clips and also unconstrained nature of such clips.

Simplifying Graph Convolutional Networks

Tiiiger/SGC 19 Feb 2019

Graph Convolutional Networks (GCNs) and their variants have experienced significant attention and have become the de facto methods for learning graph representations.

Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation

hikvision-research/skelact 17 Apr 2018

Skeleton-based human action recognition has recently drawn increasing attentions with the availability of large-scale skeleton datasets.