Action Localization
135 papers with code • 0 benchmarks • 3 datasets
Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.
Benchmarks
These leaderboards are used to track progress in Action Localization
Libraries
Use these libraries to find Action Localization models and implementationsLatest papers with no code
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder
This study is the first to conduct end-to-end temporal action localization in untrimmed videos of infants with ASD, offering promising avenues for early intervention and support.
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization
Temporal Action Localization (TAL) involves localizing and classifying action snippets in an untrimmed video.
PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization
This paper introduces a novel approach to temporal action localization (TAL) in few-shot learning.
Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes
To this end, we first devise innovative strategies to adaptively select high-quality positive and negative classes from the label space, by modeling both the confidence and rank of a class in relation to those of the target class.
BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin
Skeleton-based motion representations are robust for action localization and understanding for their invariance to perspective, lighting, and occlusion, compared with images.
Density-Guided Label Smoothing for Temporal Localization of Driving Actions
Temporal localization of driving actions plays a crucial role in advanced driver-assistance systems and naturalistic driving studies.
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model
The results are promising for real-time application, and the falls are detected on video level with a state-of-the-art 0. 96 F1 score on the HQFSD dataset under the given experimental settings.
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
These methods build on adding a DETR head with learnable queries that, after cross- and self-attention can be sent to corresponding MLPs for detecting a person's bounding box and action.
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
Temporal Action Localization (TAL) is a complex task that poses relevant challenges, particularly when attempting to generalize on new -- unseen -- domains in real-world applications.
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization
This paper addresses the challenge of point-supervised temporal action detection, in which only one frame per action instance is annotated in the training set.