Temporal Action Localization

421 papers with code • 14 benchmarks • 42 datasets

Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.

Libraries

Use these libraries to find Temporal Action Localization models and implementations
9 papers
3,866
4 papers
548
3 papers
2,972
See all 12 libraries.

Generative Model-based Feature Knowledge Distillation for Action Recognition

aaai-24/generative-based-kd 14 Dec 2023

Addressing this gap, our paper introduces an innovative knowledge distillation framework, with the generative model for training a lightweight student model.

7
14 Dec 2023

Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables

ami-iit/paper_guo_2023_humanoids_lifting_risk_prediction 14 Dec 2023

This paper proposes a framework that combines online human state estimation, action recognition and motion prediction to enable early assessment and prevention of worker biomechanical risk during lifting tasks.

1
14 Dec 2023

EZ-CLIP: Efficient Zeroshot Video Action Recognition

shahzadnit/ez-clip 13 Dec 2023

Recent advancements in large-scale pre-training of visual-language models on paired image-text data have demonstrated impressive generalization capabilities for zero-shot tasks.

14
13 Dec 2023

Unsupervised Temporal Action Localization via Self-paced Incremental Learning

tanghaoyu258/feel 12 Dec 2023

Thereafter, we design two (constant- and variable- speed) incremental instance learning strategies for easy-to-hard model training, thus ensuring the reliability of these video pseudolabels and further improving overall localization performance.

3
12 Dec 2023

Towards a geometric understanding of Spatio Temporal Graph Convolution Networks

daspraty/stg-gradcam 12 Dec 2023

In this paper, we first propose to use a local Dataset Graph (DS-Graph) obtained from the feature representation of input data at each layer to develop an understanding of the layer-wise embedding geometry of the STGCN.

2
12 Dec 2023

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

sming256/OpenTAD 28 Nov 2023

Recently, temporal action detection (TAD) has seen significant performance improvement with end-to-end training.

63
28 Nov 2023

Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

easonxiao-888/uvcom 28 Nov 2023

Video Moment Retrieval (MR) and Highlight Detection (HD) have attracted significant attention due to the growing demand for video analysis.

33
28 Nov 2023

Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art

ostadabbas/video-based-infant-action-recognition 21 Nov 2023

Automated human action recognition, a burgeoning field within computer vision, boasts diverse applications spanning surveillance, security, human-computer interaction, tele-health, and sports analysis.

4
21 Nov 2023

Learning Human Action Recognition Representations Without Real Humans

howardzh01/ppma NeurIPS 2023

To this end, we present, for the first time, a benchmark that leverages real-world videos with humans removed and synthetic data containing virtual humans to pre-train a model.

2
10 Nov 2023

FPGA-QHAR: Throughput-Optimized for Quantized Human Action Recognition on The Edge

azzam-alhussain/fpga-qhar 4 Nov 2023

Accelerating Human Action Recognition (HAR) efficiently for real-time surveillance and robotic systems on edge chips remains a challenging research field, given its high computational and memory requirements.

1
04 Nov 2023