Action Localization

135 papers with code • 0 benchmarks • 3 datasets

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Localization

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Libraries

Use these libraries to find Action Localization models and implementations

Pilhyeon/Learning-Action-Completene…

3 papers

open-mmlab/mmaction2

2 papers

3,888

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Test-Time Zero-Shot Temporal Action Localization

benedettaliberatori/t3al • • 8 Apr 2024

To this aim, we introduce a novel method that performs Test-Time adaptation for Temporal Action Localization (T3AL).

08 Apr 2024

Paper
Code

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

ttgeng233/UniAV • • 4 Apr 2024

Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL).

04 Apr 2024

Paper
Code

ASTRA: An Action Spotting TRAnsformer for Soccer Videos

arturxe2/astra • • 2 Apr 2024

In this paper, we introduce ASTRA, a Transformer-based model designed for the task of Action Spotting in soccer matches.

02 Apr 2024

Paper
Code

Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach

qinying-liu/case • • ICCV 2023

It comprises two core components: a snippet clustering component that groups the snippets into multiple latent clusters and a cluster classification component that further classifies the cluster as foreground or background.

100

21 Dec 2023

Paper
Code

Unsupervised Temporal Action Localization via Self-paced Incremental Learning

tanghaoyu258/feel • • 12 Dec 2023

Thereafter, we design two (constant- and variable- speed) incremental instance learning strategies for easy-to-hard model training, thus ensuring the reliability of these video pseudolabels and further improving overall localization performance.

12 Dec 2023

Paper
Code

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

zzxslp/mm-navigator • 13 Nov 2023

We first benchmark MM-Navigator on our collected iOS screen dataset.

106

13 Nov 2023

Paper
Code

Temporal Action Localization with Enhanced Instant Discriminability

dingfengshi/tridet • • 11 Sep 2023

Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video.

148

11 Sep 2023

Paper
Code

HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation

pipixin321/hr-pro • • 24 Aug 2023

For snippet-level learning, we introduce an online-updated memory to store reliable snippet prototypes for each class.

24 Aug 2023

Paper
Code

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

xiaojuntang22/iccv2023-ddgnet • • ICCV 2023

Considering this phenomenon, we propose Discriminability-Driven Graph Network (DDG-Net), which explicitly models ambiguous snippets and discriminative snippets with well-designed connections, preventing the transmission of ambiguous information and enhancing the discriminability of snippet-level representations.

31 Jul 2023

Paper
Code

NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023

happyharrycn/actionformer_release • • 5 Jul 2023

This report describes our submission to the Ego4D Moment Queries Challenge 2023.

384

05 Jul 2023

Paper
Code

Action Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result