Browse > Computer Vision > Action Localization

Action Localization

26 papers with code · Computer Vision

Leaderboards

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Latest papers with code

Weakly-Supervised Action Localization by Generative Attention Modeling

27 Mar 2020bfshi/DGAM-Weakly-Supervised-Action-Localization

By maximizing the conditional probability with respect to the attention, the action and non-action frames are well separated.

WEAKLY SUPERVISED ACTION LOCALIZATION WEAKLY-SUPERVISED TEMPORAL ACTION LOCALIZATION

63
27 Mar 2020

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

21 Jan 2020asrafulashiq/wsad

We propose a classification module to generate action labels for each segment in the video, and a deep metric learning module to learn the similarity between different action instances.

METRIC LEARNING TEMPORAL LOCALIZATION VIDEO UNDERSTANDING WEAKLY-SUPERVISED TEMPORAL ACTION LOCALIZATION

10
21 Jan 2020

STAGE: Spatio-Temporal Attention on Graph Entities for Video Action Detection

9 Dec 2019aimagelab/STAGE_action_detection

Spatio-temporal action localization is a challenging yet fascinating task that aims to detect and classify human actions in video clips.

ACTION DETECTION SPATIO-TEMPORAL ACTION LOCALIZATION TEMPORAL ACTION LOCALIZATION VIDEO UNDERSTANDING

26
09 Dec 2019

Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization

8 Dec 2019microsoft/2D-TAN

In this report, we introduce the Winner method for HACS Temporal Action Localization Challenge 2019.

TEMPORAL ACTION LOCALIZATION

89
08 Dec 2019

Background Suppression Network for Weakly-supervised Temporal Action Localization

22 Nov 2019Pilhyeon/BaSNet-pytorch

This formulation does not fully model the problem in that background frames are forced to be misclassified as action classes to predict video-level labels accurately.

WEAKLY SUPERVISED ACTION LOCALIZATION WEAKLY-SUPERVISED TEMPORAL ACTION LOCALIZATION

75
22 Nov 2019

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

15 Nov 2019wei-tim/YOWO

YOWO is a single-stage architecture with two branches to extract temporal and spatial information concurrently and predict bounding boxes and action probabilities directly from video clips in one evaluation.

TEMPORAL ACTION LOCALIZATION

471
15 Nov 2019

Graph Convolutional Networks for Temporal Action Localization

ICCV 2019 Alvin-Zeng/PGCN

Then we apply the GCNs over the graph to model the relations among different proposals and learn powerful representations for the action classification and localization.

ACTION CLASSIFICATION TEMPORAL ACTION LOCALIZATION

183
07 Sep 2019

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

ICCV 2019 naraysa/3c-net

Our joint formulation has three terms: a classification term to ensure the separability of learned action features, an adapted multi-label center loss term to enhance the action feature discriminability and a counting loss term to delineate adjacent action sequences, leading to improved localization.

WEAKLY SUPERVISED ACTION LOCALIZATION WEAKLY-SUPERVISED TEMPORAL ACTION LOCALIZATION

35
22 Aug 2019

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

ICCV 2019 antoine77340/S3D_HowTo100M

In this work, we propose instead to learn such embeddings from video data with readily available natural language annotations in the form of automatically transcribed narrations.

ACTION LOCALIZATION VIDEO RETRIEVAL

29
07 Jun 2019