SlowFast Networks for Video Recognition

We present SlowFast networks for video recognition. Our model involves (i) a Slow pathway, operating at low frame rate, to capture spatial semantics, and (ii) a Fast pathway, operating at high frame rate, to capture motion at fine temporal resolution... (read more)

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Action Recognition In Videos AVA v2.1 SlowFast mAP (Val) 28.2 # 1
Action Recognition AVA v2.1 SlowFast (Kinetics-400 pretraining) mAP (Val) 26.3 # 5
Action Recognition AVA v2.1 SlowFast (Kinetics-600 pretraining) mAP (Val) 26.8 # 4
Action Recognition AVA v2.1 SlowFast (Kinetics-600 pretraining, NL) mAP (Val) 27.3 # 3
Action Recognition AVA v2.1 SlowFast++ (Kinetics-600 pretraining, NL) mAP (Val) 28.3 # 1
Action Recognition AVA v2.2 SlowFast, 4x16, R50 (Kinetics-400 pretraining) mAP 21.9 # 10
Action Recognition AVA v2.2 SlowFast, 8x8 R101+NL (Kinetics-600 pretraining) mAP 27.1 # 5
Action Classification Charades SlowFast (Kinetics-400 pretraining, NL) MAP 42.5 # 16
Action Classification Charades SlowFast (Kinetics-600 pretraining, NL) MAP 45.2 # 11
Action Classification Charades SlowFast (Kinetics-600 pretraining) MAP 42.1 # 18
Action Classification Kinetics-400 SlowFast 4x16 (ResNet-50) Vid acc@1 75.6 # 58
Vid acc@5 92.1 # 40
Action Classification Kinetics-400 SlowFast 8x8 (ResNet-50) Vid acc@1 77 # 49
Vid acc@5 92.6 # 37
Action Classification Kinetics-400 SlowFast 8x8 (ResNet-101) Vid acc@1 77.9 # 37
Vid acc@5 93.2 # 32
Action Classification Kinetics-400 SlowFast 16x8 (ResNet-101) Vid acc@1 78.9 # 28
Vid acc@5 93.5 # 26
Action Classification Kinetics-400 SlowFast (ResNet-101 + NL) Vid acc@1 79.8 # 18
Action Classification Kinetics-400 SlowFast 16x8 (ResNet-101 + NL) Vid acc@5 93.9 # 20
Action Classification Kinetics-600 SlowFast 16x8 (ResNet-101 + NL) Top-1 Accuracy 81.8 # 12
Top-5 Accuracy 95.1 # 12
Action Classification Kinetics-600 SlowFast 8x8 (ResNet-101) Top-1 Accuracy 80.4 # 17
Top-5 Accuracy 94.8 # 15
Action Classification Kinetics-600 SlowFast 8x8 (ResNet-50) Top-1 Accuracy 79.9 # 18
Top-5 Accuracy 94.5 # 16
Action Classification Kinetics-600 SlowFast 16x8 (ResNet-101) Top-1 Accuracy 81.1 # 15
Top-5 Accuracy 95.1 # 12
Action Classification Kinetics-600 SlowFast 4x16 (ResNet-50) Top-1 Accuracy 78.8 # 20
Top-5 Accuracy 94 # 17
Action Recognition Something-Something V2 SlowFast Top-1 Accuracy 61.7 # 29

Results from Other Papers


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK SOURCE PAPER COMPARE
Action Recognition AVA v2.2 SlowFast, 8x8, R101 (Kinetics-400 pretraining) mAP 23.8 # 9
Action Recognition AVA v2.2 SlowFast, 16x8 R101+NL (Kinetics-600 pretraining) mAP 27.5 # 2
Action Recognition Diving-48 SlowFast Accuracy 77.6 # 3

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet