Browse > Computer Vision > Activity Recognition > Action Recognition In Videos

Action Recognition In Videos

State-of-the-art leaderboards

Trend Dataset Best Method Paper title Paper Code Compare

Greatest papers with code

YouTube-8M: A Large-Scale Video Classification Benchmark

27 Sep 2016google/youtube-8m

Despite the size of the dataset, some of our models train to convergence in less than a day on a single machine using TensorFlow.

 SOTA for Action Recognition In Videos on ActivityNet (using extra training data)

ACTION RECOGNITION IN VIDEOS

Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

CVPR 2018 kenshohara/3D-ResNets-PyTorch

The purpose of this study is to determine whether current video datasets have sufficient data for training very deep convolutional neural networks (CNNs) with spatio-temporal three-dimensional (3D) kernels.

ACTION RECOGNITION IN VIDEOS

Temporal Segment Networks for Action Recognition in Videos

8 May 2017yjxiong/temporal-segment-networks

Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 Aug 2016yjxiong/temporal-segment-networks

The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.

ACTION RECOGNITION IN VIDEOS MULTIMODAL ACTIVITY RECOGNITION

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

CVPR 2017 deepmind/kinetics-i3d

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks.

#3 best model for Action Classification on HMDB51 (using extra training data)

ACTION RECOGNITION IN VIDEOS SKELETON BASED ACTION RECOGNITION

Video Classification with Channel-Separated Convolutional Networks

4 Apr 2019facebookresearch/VMZ

It is natural to ask: 1) if group convolution can help to alleviate the high computational cost of video classification networks; 2) what factors matter the most in 3D group convolutional networks; and 3) what are good computation/accuracy trade-offs with 3D group convolutional networks.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS IMAGE CLASSIFICATION

A Closer Look at Spatiotemporal Convolutions for Action Recognition

CVPR 2018 facebookresearch/R2Plus1D

In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition.

ACTION RECOGNITION IN VIDEOS

Temporal Relational Reasoning in Videos

ECCV 2018 metalbubble/TRN-pytorch

Temporal relational reasoning, the ability to link meaningful transformations of objects or entities over time, is a fundamental property of intelligent species.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS COMMON SENSE REASONING HUMAN-OBJECT INTERACTION DETECTION RELATIONAL REASONING

Towards Good Practices for Very Deep Two-Stream ConvNets

8 Jul 2015yjxiong/caffe

However, for action recognition in videos, the improvement of deep convolutional networks is not so evident.

ACTION RECOGNITION IN VIDEOS DATA AUGMENTATION