Browse > Computer Vision > Action Localization > Temporal Action Localization

Temporal Action Localization

149 papers with code · Computer Vision

Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.

Leaderboards

Greatest papers with code

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

CVPR 2018 tensorflow/models

The AVA dataset densely annotates 80 atomic visual actions in 430 15-minute video clips, where actions are localized in space and time, resulting in 1. 58M action labels with multiple labels per person occurring frequently.

TEMPORAL ACTION LOCALIZATION VIDEO UNDERSTANDING

Large-scale weakly-supervised pre-training for video action recognition

CVPR 2019 microsoft/computervision-recipes

Second, frame-based models perform quite well on action recognition; is pre-training for good image features sufficient or is pre-training for spatio-temporal features valuable for optimal transfer learning?

ACTION RECOGNITION IN VIDEOS ACTIVITY RECOGNITION IN VIDEOS TRANSFER LEARNING

A Closer Look at Spatiotemporal Convolutions for Action Recognition

CVPR 2018 microsoft/computervision-recipes

In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition.

ACTION RECOGNITION IN VIDEOS

Temporal Segment Networks for Action Recognition in Videos

8 May 2017yjxiong/temporal-segment-networks

Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.

#5 best model for Action Classification on Moments in Time (Top 5 Accuracy metric)

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 Aug 2016yjxiong/temporal-segment-networks

The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.

ACTION RECOGNITION IN VIDEOS MULTIMODAL ACTIVITY RECOGNITION

Sparse 3D convolutional neural networks

12 May 2015facebookresearch/SparseConvNet

We have implemented a convolutional neural network designed for processing sparse three-dimensional input data.

3D OBJECT RECOGNITION TEMPORAL ACTION LOCALIZATION

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

30 Mar 2017jeffreyhuang1/two-stream-action-recognition

We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS VIDEO UNDERSTANDING

Grad-CAM++: Improved Visual Explanations for Deep Convolutional Networks

30 Oct 2017sicara/tf-explain

Over the last decade, Convolutional Neural Network (CNN) models have been highly successful in solving complex vision problems.

3D HUMAN ACTION RECOGNITION OBJECT LOCALIZATION