TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	EXTRA DATA	REMOVE
Action Recognition	Jester (Gesture Recognition)	X3D MobileNet-V3 LGD-GC	Val	95.56	# 3
Action Recognition	UCF101	X3D MobileNet-V3 LGD-GC	3-fold Accuracy	94.85	# 47

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ligar-lightweight-general-purpose-action/action-recognition-on-jester-gesture)](https://paperswithcode.com/sota/action-recognition-on-jester-gesture?p=ligar-lightweight-general-purpose-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ligar-lightweight-general-purpose-action/action-recognition-in-videos-on-ucf101)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ucf101?p=ligar-lightweight-general-purpose-action)`

LIGAR: Lightweight General-purpose Action Recognition

30 Aug 2021 · Evgeny Izutov ·

Growing amount of different practical tasks in a video understanding problem has addressed the great challenge aiming to design an universal solution, which should be available for broad masses and suitable for the demanding edge-oriented inference. In this paper we are focused on designing a network architecture and a training pipeline to tackle the mentioned challenges. Our architecture takes the best from the previous ones and brings the ability to be successful not only in appearance-based action recognition tasks but in motion-based problems too. Furthermore, the induced label noise problem is formulated and Adaptive Clip Selection (ACS) framework is proposed to deal with it. Together it makes the LIGAR framework the general-purpose action recognition solution. We also have reported the extensive analysis on the general and gesture datasets to show the excellent trade-off between the performance and the accuracy in comparison to the state-of-the-art solutions. Training code is available at: https://github.com/openvinotoolkit/training_extensions. For the efficient edge-oriented inference all trained models can be exported into the OpenVINO format.

PDF Abstract

Code

Add Remove Mark official

openvinotoolkit/training_extensions official

1,120

Tasks

Add Remove

Action Recognition

Gesture Recognition

Hand-Gesture Recognition

Video Understanding

Datasets

UCF101

ActivityNet

YouTube-8M

Kinetics-700

Jester (Gesture Recognition)

Results from the Paper

Add Remove

Ranked #3 on Action Recognition on Jester (Gesture Recognition) (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Uses Extra Training Data	Benchmark
Action Recognition	Jester (Gesture Recognition)	X3D MobileNet-V3 LGD-GC	Val	95.56	# 3		Compare
Action Recognition	UCF101	X3D MobileNet-V3 LGD-GC	3-fold Accuracy	94.85	# 47		Compare

Methods

Add Remove

1x1 Convolution • 3D Convolution • Average Pooling • Batch Normalization • CLIP • Convolution • Dense Connections • Depthwise Convolution • Depthwise Separable Convolution • Dropout • Global Average Pooling • Hard Swish • Inverted Residual Block • MobileNetV3 • Pointwise Convolution • ReLU • ReLU6 • Sigmoid Activation • Squeeze-and-Excitation Block

Edit Social Preview

LIGAR: Lightweight General-purpose Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove