TriDet: Temporal Action Detection with Relative Boundary Modeling

In this paper, we present a one-stage framework TriDet for temporal action detection. Existing methods often suffer from imprecise boundary predictions due to the ambiguous action boundaries in videos. To alleviate this problem, we propose a novel Trident-head to model the action boundary via an estimated relative probability distribution around the boundary. In the feature pyramid of TriDet, we propose an efficient Scalable-Granularity Perception (SGP) layer to mitigate the rank loss problem of self-attention that takes place in the video features and aggregate information across different temporal granularities. Benefiting from the Trident-head and the SGP-based feature pyramid, TriDet achieves state-of-the-art performance on three challenging benchmarks: THUMOS14, HACS and EPIC-KITCHEN 100, with lower computational costs, compared to previous methods. For example, TriDet hits an average mAP of $69.3\%$ on THUMOS14, outperforming the previous best by $2.5\%$, but with only $74.6\%$ of its latency. The code is released to https://github.com/sssste/TriDet.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Temporal Action Localization ActivityNet-1.3 TriDet (TSP features) mAP IOU@0.5 54.7 # 8
mAP 36.8 # 12
mAP IOU@0.75 38.0 # 4
mAP IOU@0.95 8.4 # 12
Temporal Action Localization EPIC-KITCHENS-100 TriDet (verb) Avg mAP (0.1-0.5) 25.4 # 2
mAP IOU@0.1 28.6 # 2
mAP IOU@0.2 27.4 # 2
mAP IOU@0.3 26.1 # 2
mAP IOU@0.4 24.2 # 2
mAP IOU@0.5 20.8 # 2
Temporal Action Localization HACS TriDet (SlowFast) Average-mAP 38.6 # 6
mAP@0.5 56.7 # 3
mAP@0.75 39.3 # 3
mAP@0.95 11.7 # 3
Temporal Action Localization HACS TriDet (I3D RGB) Average-mAP 36.8 # 7
mAP@0.5 54.5 # 4
mAP@0.75 36.8 # 4
mAP@0.95 11.5 # 4
Temporal Action Localization THUMOS’14 TriDet (I3D features) mAP IOU@0.5 72.9 # 5
mAP IOU@0.3 83.6 # 5
mAP IOU@0.4 80.1 # 3
mAP IOU@0.6 62.4 # 5
mAP IOU@0.7 47.4 # 5
Avg mAP (0.3:0.7) 69.3 # 8

Methods


No methods listed for this paper. Add relevant methods here