Action Recognition In Videos

64 papers with code • 17 benchmarks • 17 datasets

Action Recognition in Videos is a task in computer vision and pattern recognition where the goal is to identify and categorize human actions performed in a video sequence. The task involves analyzing the spatiotemporal dynamics of the actions and mapping them to a predefined set of action classes, such as running, jumping, or swimming.

Libraries

Use these libraries to find Action Recognition In Videos models and implementations
4 papers
3,866
3 papers
548
2 papers
2,968
See all 5 libraries.

Latest papers with no code

Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos

no code yet • 11 Apr 2024

These spatial features then undergo intermediate temporal modeling facilitated by the Mamba block before progressing to the encoder section, which comprises vanilla upsampling Shift S-GCN blocks.

Deep Learning Approaches for Human Action Recognition in Video Data

no code yet • 11 Mar 2024

The results of this study underscore the potential of composite models in achieving robust human action recognition and suggest avenues for future research in optimizing these models for real-world deployment.

DVANet: Disentangling View and Action Features for Multi-View Action Recognition

no code yet • 10 Dec 2023

In this work, we present a novel approach to multi-view action recognition where we guide learned action representations to be separated from view-relevant information in a video.

Action Class Relation Detection and Classification Across Multiple Video Datasets

no code yet • 15 Aug 2023

The Meta Video Dataset (MetaVD) provides annotated relations between action classes in major datasets for human action recognition in videos.

Knowledge Prompting for Few-shot Action Recognition

no code yet • 22 Nov 2022

Few-shot action recognition in videos is challenging for its lack of supervision and difficulty in generalizing to unseen actions.

Could Giant Pretrained Image Models Extract Universal Representations?

no code yet • 3 Nov 2022

In this paper, we present a study of frozen pretrained models when applied to diverse and representative computer vision tasks, including object detection, semantic segmentation and video action recognition.

Class-Incremental Learning for Action Recognition in Videos

no code yet • ICCV 2021

We tackle catastrophic forgetting problem in the context of class-incremental learning for video recognition, which has not been explored actively despite the popularity of continual learning.

Co-training Transformer with Videos and Images Improves Action Recognition

no code yet • 14 Dec 2021

We term this approach as Co-training Videos and Images for Action Recognition (CoVeR).

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing

no code yet • 5 Nov 2021

Despite of dramatic progresses in the area of video classification research, a severe problem faced by the community is that the detailed understanding of human actions is ignored.

Class incremental learning for video action classification

no code yet • IEEE International Conference on Image Processing (ICIP) 2021

Class Incremental Learning (CIL) is a hot topic in machine learning for CNN models to learn new classes incrementally.