Temporal Action Localization

421 papers with code • 14 benchmarks • 42 datasets

Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.

Benchmarks

Add a Result

These leaderboards are used to track progress in Temporal Action Localization

Dataset	Best Model	Compare
THUMOS’14	AdaTAD (VideoMAEv2-giant)	See all
ActivityNet-1.3	ActionMamba (InternVideo2-6B)	See all
HACS	ActionMamba(InternVideo2-6B)	See all
CrossTask	VideoCLIP	See all
MultiTHUMOS	TriDet (VideoMAEv2)	See all
FineAction	ActionMamba(InternVideo2-6B)	See all
EPIC-KITCHENS-100	AdaTAD (verb, VideoMAE-L)	See all
MUSES	TemporalMaxer	See all
MEXaction2	S-CNN	See all
ActivityNet-1.2	DeepMetricLearner	See all
THUMOS'14	AdaTAD (VideoMAEv2-giant)	See all
Ego4D MQ val	ActionFormer (SlowFast+Omnivore+EgoVLP)	See all
Ego4D MQ test	ActionFormer (SlowFast+Omnivore+EgoVLP)	See all
THUMOS14	BasicTAD (R50-SlowOnly)	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Temporal Action Localization models and implementations

open-mmlab/mmaction2

9 papers

3,866

yjxiong/caffe

4 papers

548

towhee-io/towhee

3 papers

2,972

bryanyzhu/two-stream-pytorch

3 papers

553

See all 12 libraries.

Datasets

Subtasks

Temporal Action Proposal Generation

Activity Recognition In Videos

Action Recognition In Still Images

Latest papers

Most implemented Social Latest No code

Generative Model-based Feature Knowledge Distillation for Action Recognition

aaai-24/generative-based-kd • • 14 Dec 2023

Addressing this gap, our paper introduces an innovative knowledge distillation framework, with the generative model for training a lightweight student model.

14 Dec 2023

Paper
Code

Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables

ami-iit/paper_guo_2023_humanoids_lifting_risk_prediction • • 14 Dec 2023

This paper proposes a framework that combines online human state estimation, action recognition and motion prediction to enable early assessment and prevention of worker biomechanical risk during lifting tasks.

14 Dec 2023

Paper
Code

EZ-CLIP: Efficient Zeroshot Video Action Recognition

shahzadnit/ez-clip • • 13 Dec 2023

Recent advancements in large-scale pre-training of visual-language models on paired image-text data have demonstrated impressive generalization capabilities for zero-shot tasks.

13 Dec 2023

Paper
Code

Unsupervised Temporal Action Localization via Self-paced Incremental Learning

tanghaoyu258/feel • • 12 Dec 2023

Thereafter, we design two (constant- and variable- speed) incremental instance learning strategies for easy-to-hard model training, thus ensuring the reliability of these video pseudolabels and further improving overall localization performance.

12 Dec 2023

Paper
Code

Towards a geometric understanding of Spatio Temporal Graph Convolution Networks

daspraty/stg-gradcam • • 12 Dec 2023

In this paper, we first propose to use a local Dataset Graph (DS-Graph) obtained from the feature representation of input data at each layer to develop an understanding of the layer-wise embedding geometry of the STGCN.

12 Dec 2023

Paper
Code

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

sming256/OpenTAD • • 28 Nov 2023

Recently, temporal action detection (TAD) has seen significant performance improvement with end-to-end training.

28 Nov 2023

Paper
Code

Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

easonxiao-888/uvcom • • 28 Nov 2023

Video Moment Retrieval (MR) and Highlight Detection (HD) have attracted significant attention due to the growing demand for video analysis.

28 Nov 2023

Paper
Code

Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art

ostadabbas/video-based-infant-action-recognition • 21 Nov 2023

Automated human action recognition, a burgeoning field within computer vision, boasts diverse applications spanning surveillance, security, human-computer interaction, tele-health, and sports analysis.

21 Nov 2023

Paper
Code

Learning Human Action Recognition Representations Without Real Humans

howardzh01/ppma • • NeurIPS 2023

To this end, we present, for the first time, a benchmark that leverages real-world videos with humans removed and synthetic data containing virtual humans to pre-train a model.

10 Nov 2023

Paper
Code

FPGA-QHAR: Throughput-Optimized for Quantized Human Action Recognition on The Edge

azzam-alhussain/fpga-qhar • • 4 Nov 2023

Accelerating Human Action Recognition (HAR) efficiently for real-time surveillance and robotic systems on edge chips remains a challenging research field, given its high computational and memory requirements.

04 Nov 2023

Paper
Code

Temporal Action Localization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result