Action Localization

135 papers with code • 0 benchmarks • 3 datasets

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Libraries

Use these libraries to find Action Localization models and implementations

Latest papers with no code

Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder

no code yet • 8 Apr 2024

This study is the first to conduct end-to-end temporal action localization in untrimmed videos of infants with ASD, offering promising avenues for early intervention and support.

LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization

no code yet • 1 Apr 2024

Temporal Action Localization (TAL) involves localizing and classifying action snippets in an untrimmed video.

PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization

no code yet • 27 Mar 2024

This paper introduces a novel approach to temporal action localization (TAL) in few-shot learning.

Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes

no code yet • 17 Mar 2024

To this end, we first devise innovative strategies to adaptively select high-quality positive and negative classes from the label space, by modeling both the confidence and rank of a class in relation to those of the target class.

BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin

no code yet • 12 Mar 2024

Skeleton-based motion representations are robust for action localization and understanding for their invariance to perspective, lighting, and occlusion, compared with images.

Density-Guided Label Smoothing for Temporal Localization of Driving Actions

no code yet • 11 Mar 2024

Temporal localization of driving actions plays a crucial role in advanced driver-assistance systems and naturalistic driving studies.

Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model

no code yet • 29 Jan 2024

The results are promising for real-time application, and the falls are detected on video level with a state-of-the-art 0. 96 F1 score on the HQFSD dataset under the given experimental settings.

Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization

no code yet • 29 Dec 2023

These methods build on adding a DETR head with learnable queries that, after cross- and self-attention can be sent to corresponding MLPs for detecting a person's bounding box and action.

SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization

no code yet • 20 Dec 2023

Temporal Action Localization (TAL) is a complex task that poses relevant challenges, particularly when attempting to generalize on new -- unseen -- domains in real-world applications.

ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization

no code yet • 27 Nov 2023

This paper addresses the challenge of point-supervised temporal action detection, in which only one frame per action instance is annotated in the training set.