Activity Recognition

180 papers with code • 3 benchmarks • 27 datasets

Human Activity Recognition is the problem of identifying events performed by humans given a video input. It is formulated as a binary (or multiclass) classification problem of outputting activity class labels. Activity Recognition is an important problem with many societal applications including smart surveillance, video search/retrieval, intelligent robots, and other monitoring systems.

Source: Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

Libraries

Use these libraries to find Activity Recognition models and implementations

Most implemented papers

Multivariate LSTM-FCNs for Time Series Classification

houshd/MLSTM-FCN 14 Jan 2018

Over the past decade, multivariate time series classification has received great attention.

Real-world Anomaly Detection in Surveillance Videos

WaqasSultani/AnomalyDetectionCVPR2018 CVPR 2018

To avoid annotating the anomalous segments or clips in training videos, which is very time consuming, we propose to learn anomaly through the deep multiple instance ranking framework by leveraging weakly labeled training videos, i. e. the training labels (anomalous or normal) are at video-level instead of clip-level.

Representation Flow for Action Recognition

piergiaj/representation-flow-cvpr19 CVPR 2019

Our representation flow layer is a fully-differentiable layer designed to capture the `flow' of any representation channel within a convolutional neural network for action recognition.

CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

CAMMA-public/cholectriplet2021 10 Apr 2022

In this paper, we present the challenge setup and assessment of the state-of-the-art deep learning methods proposed by the participants during the challenge.

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

chihyaoma/Activity-Recognition-with-CNN-and-RNN 30 Mar 2017

We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.

Deep Residual Bidir-LSTM for Human Activity Recognition Using Wearable Sensors

guillaume-chevalier/HAR-stacked-residual-bidir-LSTMs 22 Aug 2017

Human activity recognition (HAR) has become a popular topic in research because of its wide application.

Im2Flow: Motion Hallucination from Static Images for Action Recognition

rhgao/Im2Flow CVPR 2018

Second, we show the power of hallucinated flow for recognition, successfully transferring the learned motion into a standard two-stream network for activity recognition.

Temporal Relational Reasoning in Videos

metalbubble/TRN-pytorch ECCV 2018

Temporal relational reasoning, the ability to link meaningful transformations of objects or entities over time, is a fundamental property of intelligent species.

Fine-grained Activity Recognition in Baseball Videos

piergiaj/mlb-youtube 9 Apr 2018

In this paper, we introduce a challenging new dataset, MLB-YouTube, designed for fine-grained activity detection.

Large-scale weakly-supervised pre-training for video action recognition

microsoft/computervision-recipes CVPR 2019

Second, frame-based models perform quite well on action recognition; is pre-training for good image features sufficient or is pre-training for spatio-temporal features valuable for optimal transfer learning?