Weakly Supervised Action Localization

32 papers with code • 8 benchmarks • 5 datasets

In this task, the training data consists of videos with a list of activities in them without any temporal boundary annotations. However, while testing, given a video, the algorithm should recognize the activities in the video and also provide the start and end time.

Benchmarks

Add a Result

These leaderboards are used to track progress in Weakly Supervised Action Localization

Dataset	Best Model	Compare
THUMOS 2014	HR-Pro	See all
ActivityNet-1.2	CASE	See all
ActivityNet-1.3	PivoTAL	See all
THUMOS’14	LACP	See all
THUMOS14	HR-Pro	See all
GTEA	AU-Action	See all
BEOID	HR-Pro	See all
FineAction	HAAN	See all

Libraries

Use these libraries to find Weakly Supervised Action Localization models and implementations

Pilhyeon/Learning-Action-Completene…

3 papers

Datasets

Most implemented papers

Most implemented Social Latest No code

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

sujoyp/wtalc-pytorch • • ECCV 2018

Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement.

Paper
Code

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization

HumamAlwassel/RefineLoc • • 30 Mar 2019

RefineLoc shows competitive results with the state-of-the-art in weakly-supervised temporal localization.

Paper
Code

Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization

Finspire13/CMCS-Temporal-Action-Localization • • CVPR 2019

In this work, we first identify two underexplored problems posed by the weak supervision for temporal action localization, namely action completeness modeling and action-context separation.

Paper
Code

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

naraysa/3c-net • • ICCV 2019

Our joint formulation has three terms: a classification term to ensure the separability of learned action features, an adapted multi-label center loss term to enhance the action feature discriminability and a counting loss term to delineate adjacent action sequences, leading to improved localization.

Paper
Code

SF-Net: Single-Frame Supervision for Temporal Action Localization

Flowerfan/SF-Net • • ECCV 2020

To obtain the single-frame supervision, the annotators are asked to identify only a single frame within the temporal window of an action.

Paper
Code

Weakly-Supervised Action Localization by Generative Attention Modeling

bfshi/DGAM-Weakly-Supervised-Action-Localization • • CVPR 2020

By maximizing the conditional probability with respect to the attention, the action and non-action frames are well separated.

Paper
Code

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

airmachine/EM-MIL-WeaklyActionDetection • • ECCV 2020

Weakly-supervised action localization requires training a model to localize the action segments in the video given only video level action label.

Paper
Code

Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization

kylemin/A2CL-PT • • ECCV 2020

Two triplets of the feature space are considered in our approach: one triplet is used to learn discriminative features for each activity class, and the other one is used to distinguish the features where no activity occurs (i. e. background features) from activity-related features for each video.

Paper
Code