The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction

CVPR 2023  ยท  Alexandros Stergiou, Dima Damen ยท

Early action prediction deals with inferring the ongoing action from partially-observed videos, typically at the outset of the video. We propose a bottleneck-based attention model that captures the evolution of the action, through progressive sampling over fine-to-coarse scales. Our proposed Temporal Progressive (TemPr) model is composed of multiple attention towers, one for each scale. The predicted action label is based on the collective agreement considering confidences of these towers. Extensive experiments over four video datasets showcase state-of-the-art performance on the task of Early Action Prediction across a range of encoder architectures. We demonstrate the effectiveness and consistency of TemPr through detailed ablations.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Early Action Prediction NTU RGB+D TemPr4 Top-1 (obs. ratio = 0.1) 29.3 # 1
Top-1 (obs. ratio = 0.2) 38.7 # 1
Top-1 (obs. ratio = 0.3) 50.2 # 1
Top-1 (obs. ratio = 0.5) 70.1 # 1
Top-1 (obs. ratio = 0.7) 78.8 # 1
Top-1 (obs. ratio = 0.9) 84.2 # 1
Early Action Prediction Something-Something sub21 TemPr4 Top-1 (obs. ratio = 0.1) 28.4 # 1
Top-1 (obs. ratio = 0.2) 34.8 # 1
Top-1 (obs. ratio = 0.3) 37.9 # 1
Top-1 (obs. ratio = 0.5) 41.3 # 1
Top-1 (obs. ratio = 0.7) 45.8 # 1
Top-1 (obs. ratio = 0.9) 48.6 # 1
Early Action Prediction Something-Something V2 TemPr4 Top-1 (obs. ratio = 0.1) 20.5 # 1
Top-1 (obs. ratio = 0.3) 28.6 # 1
Top-1 (obs. ratio = 0.5) 41.2 # 1
Top-1 (obs. ratio = 0.7) 47.1 # 1
Early Action Prediction UCF101 TemPr4 Top-1 (obs. ratio = 0.1) 88.6 # 1
Top-1 (obs. ratio = 0.2) 93.5 # 1
Top-1 (obs. ratio = 0.3) 94.9 # 1
Top-1 (obs. ratio = 0.4) 94.9 # 1
Top-1 (obs. ratio = 0.5) 95.4 # 1
Top-1 (obs. ratio = 0.6) 95.2 # 1
Top-1 (obs. ratio = 0.7) 95.3 # 1
Top-1 (obs. ratio = 0.8) 96.6 # 1
Top-1 (obs. ratio = 0.9) 96.2 # 1

Methods


No methods listed for this paper. Add relevant methods here