TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Gesture Recognition	ChaLearn 2013	3S Net TTM	Accuracy	92.08	# 1
Gesture Recognition	ChaLearn 2016	3S Net TTM	Accuracy	39.95	# 1
Gesture Recognition	MSRC-12	3S Net TTM	Accuracy	99.01	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-gesture-recognition-using/gesture-recognition-on-chalearn-2013)](https://paperswithcode.com/sota/gesture-recognition-on-chalearn-2013?p=skeleton-based-gesture-recognition-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-gesture-recognition-using/gesture-recognition-on-chalearn-2016)](https://paperswithcode.com/sota/gesture-recognition-on-chalearn-2016?p=skeleton-based-gesture-recognition-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/skeleton-based-gesture-recognition-using/gesture-recognition-on-msrc-12)](https://paperswithcode.com/sota/gesture-recognition-on-msrc-12?p=skeleton-based-gesture-recognition-using)`

Skeleton-based Gesture Recognition Using Several Fully Connected Layers with Path Signature Features and Temporal Transformer Module

17 Nov 2018 · Chenyang Li, Xin Zhang, Lufan Liao, Lianwen Jin, Weixin Yang ·

The skeleton based gesture recognition is gaining more popularity due to its wide possible applications. The key issues are how to extract discriminative features and how to design the classification model. In this paper, we first leverage a robust feature descriptor, path signature (PS), and propose three PS features to explicitly represent the spatial and temporal motion characteristics, i.e., spatial PS (S_PS), temporal PS (T_PS) and temporal spatial PS (T_S_PS). Considering the significance of fine hand movements in the gesture, we propose an "attention on hand" (AOH) principle to define joint pairs for the S_PS and select single joint for the T_PS. In addition, the dyadic method is employed to extract the T_PS and T_S_PS features that encode global and local temporal dynamics in the motion. Secondly, without the recurrent strategy, the classification model still faces challenges on temporal variation among different sequences. We propose a new temporal transformer module (TTM) that can match the sequence key frames by learning the temporal shifting parameter for each input. This is a learning-based module that can be included into standard neural network architecture. Finally, we design a multi-stream fully connected layer based network to treat spatial and temporal features separately and fused them together for the final result. We have tested our method on three benchmark gesture datasets, i.e., ChaLearn 2016, ChaLearn 2013 and MSRC-12. Experimental results demonstrate that we achieve the state-of-the-art performance on skeleton-based gesture recognition with high computational efficiency.

PDF Abstract

Code

Add Remove Mark official

LiChenyang-Github/Temporal-Transfor…

Tasks

Add Remove

Computational Efficiency

General Classification

Gesture Recognition

Datasets

MSRC-12

Results from the Paper

Edit

Ranked #1 on Gesture Recognition on ChaLearn 2013

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Gesture Recognition	ChaLearn 2013	3S Net TTM	Accuracy	92.08	# 1	Compare
Gesture Recognition	ChaLearn 2016	3S Net TTM	Accuracy	39.95	# 1	Compare
Gesture Recognition	MSRC-12	3S Net TTM	Accuracy	99.01	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Skeleton-based Gesture Recognition Using Several Fully Connected Layers with Path Signature Features and Temporal Transformer Module

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove