Temporal Localization

55 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Temporal Localization models and implementations

Most implemented papers

Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs

zhengshou/scnn CVPR 2016

To address this challenging issue, we exploit the effectiveness of deep networks in temporal action localization via three segment-based 3D ConvNets: (1) a proposal network identifies candidate segments in a long video that may contain actions; (2) a classification network learns one-vs-all action classification model to serve as initialization for the localization network; and (3) a localization network fine-tunes on the learned classification network to localize each action instance.

Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks

seanmcgovern21/Machine-Learning-CS539 19 Apr 2016

In this paper, we study the problem of online action detection from streaming skeleton data.

Am I Done? Predicting Action Progress in Videos

frans-db/progress-prediction 4 May 2017

In this paper we deal with the problem of predicting action progress in videos.

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization

HumamAlwassel/RefineLoc 30 Mar 2019

RefineLoc shows competitive results with the state-of-the-art in weakly-supervised temporal localization.

SoftLoc: Robust Temporal Localization under Label Misalignment

SoftLocICLR/submission 25 Sep 2019

This work addresses the long-standing problem of robust event localization in the presence of temporally of misaligned labels in the training data.

Learning to Localize Temporal Events in Large-scale Video Data

mxbi/youtube8m-2019 25 Oct 2019

We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset.

Multi-attention Networks for Temporal Localization of Video-level Labels

mv-lab/youtube8m-19 15 Nov 2019

The model performance is further improved by constructing multiple sets of attention networks.

Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition

vt-vl-lab/SDN NeurIPS 2019

We validate the effectiveness of our method by transferring our pre-trained model to three different tasks, including action classification, temporal localization, and spatio-temporal action detection.

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

asrafulashiq/wsad 21 Jan 2020

We propose a classification module to generate action labels for each segment in the video, and a deep metric learning module to learn the similarity between different action instances.

Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA

hyounghk/VideoQADenseCapFrameGate-ACL2020 ACL 2020

Moreover, our model is also comprised of dual-level attention (word/object and frame level), multi-head self/cross-integration for different sources (video and dense captions), and gates which pass more relevant information to the classifier.