Temporal Localization
55 papers with code • 0 benchmarks • 3 datasets
Benchmarks
These leaderboards are used to track progress in Temporal Localization
Libraries
Use these libraries to find Temporal Localization models and implementationsMost implemented papers
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
To address this challenging issue, we exploit the effectiveness of deep networks in temporal action localization via three segment-based 3D ConvNets: (1) a proposal network identifies candidate segments in a long video that may contain actions; (2) a classification network learns one-vs-all action classification model to serve as initialization for the localization network; and (3) a localization network fine-tunes on the learned classification network to localize each action instance.
Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks
In this paper, we study the problem of online action detection from streaming skeleton data.
Am I Done? Predicting Action Progress in Videos
In this paper we deal with the problem of predicting action progress in videos.
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization
RefineLoc shows competitive results with the state-of-the-art in weakly-supervised temporal localization.
SoftLoc: Robust Temporal Localization under Label Misalignment
This work addresses the long-standing problem of robust event localization in the presence of temporally of misaligned labels in the training data.
Learning to Localize Temporal Events in Large-scale Video Data
We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset.
Multi-attention Networks for Temporal Localization of Video-level Labels
The model performance is further improved by constructing multiple sets of attention networks.
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
We validate the effectiveness of our method by transferring our pre-trained model to three different tasks, including action classification, temporal localization, and spatio-temporal action detection.
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
We propose a classification module to generate action labels for each segment in the video, and a deep metric learning module to learn the similarity between different action instances.
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Moreover, our model is also comprised of dual-level attention (word/object and frame level), multi-head self/cross-integration for different sources (video and dense captions), and gates which pass more relevant information to the classifier.