Action Localization

136 papers with code • 0 benchmarks • 3 datasets

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Localization

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Libraries

Use these libraries to find Action Localization models and implementations

Pilhyeon/Learning-Action-Completene…

3 papers

open-mmlab/mmaction2

2 papers

3,906

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing

kranthikumarr/poibin • ICCV 2023

To address this, we focus on improving the proportion of positive segments detected in a video.

01 Jan 2023

Paper
Code

Anchor-free temporal action localization via Progressive Boundary-aware Boosting

2024-MindSpore-1/Code8 • • journal 2023

The TCM aggregates the temporal context information and provides features for the IBM and the FPBM.

01 Jan 2023

Paper
Code

Dilation-Erosion for Single-Frame Supervised Temporal Action Localization

lingjun123/single-frame-tal • 13 Dec 2022

To balance the annotation labor and the granularity of supervision, single-frame annotation has been introduced in temporal action localization.

13 Dec 2022

Paper
Code

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

coolbay/re2tal • 25 Nov 2022

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content.

25 Nov 2022

Paper
Code

ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022

JonnyS1226/Ego4d_mq_3rd_solution • • 17 Nov 2022

Moreover, in order to better capture the long-term temporal dependencies in the long videos, we propose a segment-level recurrence mechanism.

17 Nov 2022

Paper
Code

Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge

happyharrycn/actionformer_release • • 16 Nov 2022

This report describes our submission to the Ego4D Moment Queries Challenge 2022.

384

16 Nov 2022

Paper
Code

A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge

SichengMo/Ego4D_NLQ_Actionformer • • 16 Nov 2022

This report describes Badgers@UW-Madison, our submission to the Ego4D Natural Language Queries (NLQ) Challenge.

16 Nov 2022

Paper
Code

Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks

musicalOffering/sola • • CVPR 2023

Temporal Action Localization (TAL) methods typically operate on top of feature sequences from a frozen snippet encoder that is pretrained with the Trimmed Action Classification (TAC) tasks, resulting in a task discrepancy problem.

11 Nov 2022

Paper
Code

SimOn: A Simple Framework for Online Temporal Action Localization

tuantng/simon • • 8 Nov 2022

In addition, the evaluation for Online Detection of Action Start (ODAS) demonstrates the effectiveness and robustness of our method in the online setting.

08 Nov 2022

Paper
Code

EgoTaskQA: Understanding Human Tasks in Egocentric Videos

Buzz-Beater/EgoTaskQA • • 8 Oct 2022

The challenges of such capability lie in the difficulty of generating a detailed understanding of situated actions, their effects on object states (i. e., state changes), and their causal dependencies.

08 Oct 2022

Paper
Code

Action Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result