Action Localization

136 papers with code • 0 benchmarks • 3 datasets

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Libraries

Use these libraries to find Action Localization models and implementations

Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing

kranthikumarr/poibin ICCV 2023

To address this, we focus on improving the proportion of positive segments detected in a video.

1
01 Jan 2023

Anchor-free temporal action localization via Progressive Boundary-aware Boosting

2024-MindSpore-1/Code8 journal 2023

The TCM aggregates the temporal context information and provides features for the IBM and the FPBM.

0
01 Jan 2023

Dilation-Erosion for Single-Frame Supervised Temporal Action Localization

lingjun123/single-frame-tal 13 Dec 2022

To balance the annotation labor and the granularity of supervision, single-frame annotation has been introduced in temporal action localization.

2
13 Dec 2022

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

coolbay/re2tal 25 Nov 2022

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content.

12
25 Nov 2022

ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022

JonnyS1226/Ego4d_mq_3rd_solution 17 Nov 2022

Moreover, in order to better capture the long-term temporal dependencies in the long videos, we propose a segment-level recurrence mechanism.

1
17 Nov 2022

Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge

happyharrycn/actionformer_release 16 Nov 2022

This report describes our submission to the Ego4D Moment Queries Challenge 2022.

384
16 Nov 2022

A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge

SichengMo/Ego4D_NLQ_Actionformer 16 Nov 2022

This report describes Badgers@UW-Madison, our submission to the Ego4D Natural Language Queries (NLQ) Challenge.

7
16 Nov 2022

Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks

musicalOffering/sola CVPR 2023

Temporal Action Localization (TAL) methods typically operate on top of feature sequences from a frozen snippet encoder that is pretrained with the Trimmed Action Classification (TAC) tasks, resulting in a task discrepancy problem.

6
11 Nov 2022

SimOn: A Simple Framework for Online Temporal Action Localization

tuantng/simon 8 Nov 2022

In addition, the evaluation for Online Detection of Action Start (ODAS) demonstrates the effectiveness and robustness of our method in the online setting.

17
08 Nov 2022

EgoTaskQA: Understanding Human Tasks in Egocentric Videos

Buzz-Beater/EgoTaskQA 8 Oct 2022

The challenges of such capability lie in the difficulty of generating a detailed understanding of situated actions, their effects on object states (i. e., state changes), and their causal dependencies.

26
08 Oct 2022