Browse > Computer Vision > Video > Action Classification

Action Classification

38 papers with code · Computer Vision
Subtask of Video

State-of-the-art leaderboards

Greatest papers with code

Temporal Segment Networks for Action Recognition in Videos

8 May 2017yjxiong/temporal-segment-networks

Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

CVPR 2017 deepmind/kinetics-i3d

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks.

#3 best model for Action Classification on HMDB51 (using extra training data)

ACTION RECOGNITION IN VIDEOS SKELETON BASED ACTION RECOGNITION

The Kinetics Human Action Video Dataset

19 May 2017deepmind/kinetics-i3d

We describe the DeepMind Kinetics human action video dataset.

ACTION CLASSIFICATION HUMAN-OBJECT INTERACTION DETECTION

FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network

NeurIPS 2018 Microsoft/EdgeML

FastRNN addresses these limitations by adding a residual connection that does not constrain the range of the singular values explicitly and has only two extra scalar parameters.

ACTION CLASSIFICATION LANGUAGE MODELLING SPEECH RECOGNITION TIME SERIES TIME SERIES CLASSIFICATION

Resource-efficient Machine Learning in 2 KB RAM for the Internet of Things

ICML 2017 Microsoft/EdgeML

This paper develops a novel tree-based algorithm, called Bonsai, for efficient prediction on IoT devices – such as those based on the Arduino Uno board having an 8 bit ATmega328P microcontroller operating at 16 MHz with no native floating point support, 2 KB RAM and 32 KB read-only flash.

ACTION CLASSIFICATION

Video Classification with Channel-Separated Convolutional Networks

4 Apr 2019facebookresearch/R2Plus1D

It is natural to ask: 1) if group convolution can help to alleviate the high computational cost of video classification networks; 2) what factors matter the most in 3D group convolutional networks; and 3) what are good computation/accuracy trade-offs with 3D group convolutional networks.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS IMAGE CLASSIFICATION

Temporal Relational Reasoning in Videos

ECCV 2018 metalbubble/TRN-pytorch

Temporal relational reasoning, the ability to link meaningful transformations of objects or entities over time, is a fundamental property of intelligent species.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS COMMON SENSE REASONING HUMAN-OBJECT INTERACTION DETECTION RELATIONAL REASONING

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

30 Mar 2017jeffreyhuang1/two-stream-action-recognition

We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS VIDEO UNDERSTANDING

ECO: Efficient Convolutional Network for Online Video Understanding

ECCV 2018 mzolfaghari/ECO-efficient-video-understanding

In this paper, we introduce a network architecture that takes long-term content into account and enables fast per-video processing at the same time.

#11 best model for Action Recognition In Videos on Something-Something V1 (using extra training data)

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS VIDEO CAPTIONING VIDEO RETRIEVAL VIDEO UNDERSTANDING

Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

CVPR 2016 zhengshou/scnn

To address this challenging issue, we exploit the effectiveness of deep networks in temporal action localization via three segment-based 3D ConvNets: (1) a proposal network identifies candidate segments in a long video that may contain actions; (2) a classification network learns one-vs-all action classification model to serve as initialization for the localization network; and (3) a localization network fine-tunes on the learned classification network to localize each action instance.

ACTION CLASSIFICATION ACTION LOCALIZATION TEMPORAL LOCALIZATION