TASK |
DATASET |
MODEL |
METRIC NAME |
METRIC VALUE |
GLOBAL RANK |
REMOVE |
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; R+F)
|
Top-1 Accuracy
|
61.5
|
# 24
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; R+F)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; R+F)
|
Frozen
|
false
|
# 1
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (R2+1D; R+F)
|
Top-1
|
28.3
|
# 5
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (R2+1D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (R2+1D; RGB)
|
Top-1
|
25.3
|
# 8
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (R2+1D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (S3D; RGB)
|
Top-1
|
25.5
|
# 7
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (S3D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; RGB)
|
Top-1 Accuracy
|
38.5
|
# 36
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; RGB)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; RGB)
|
Frozen
|
true
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; R+F)
|
Top-1 Accuracy
|
62.2
|
# 23
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; R+F)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (S3D; R+F)
|
Frozen
|
false
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; RGB)
|
Top-1 Accuracy
|
52.4
|
# 33
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; RGB)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51
|
ViCC (R2+1D; RGB)
|
Frozen
|
false
|
# 1
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (S3D; R+F)
|
Top-1
|
29.7
|
# 3
|
|
Self-supervised Video Retrieval
|
HMDB51
|
ViCC (S3D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (R2+1D; RGB)
|
Top-1 Accuracy
|
52.4
|
# 13
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (R2+1D; RGB)
|
Pretraining Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (S3D; RGB))
|
Top-1 Accuracy
|
47.9
|
# 14
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (S3D; RGB))
|
Pretraining Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (S3D; R+F)
|
Top-1 Accuracy
|
62.2
|
# 9
|
|
Self-Supervised Action Recognition
|
HMDB51 (finetuned)
|
ViCC (S3D; R+F)
|
Pretraining Dataset
|
UCF101
|
# 1
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (S3D; RGB)
|
Top-1
|
62.1
|
# 6
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (S3D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (R2+1D; RGB)
|
Top-1
|
58.6
|
# 8
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (R2+1D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (S3D; R+F)
|
Top-1
|
65.1
|
# 4
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (S3D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; R+F)
|
3-fold Accuracy
|
90.5
|
# 22
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; R+F)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; R+F)
|
Frozen
|
false
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
3-fold Accuracy
|
72.2
|
# 36
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
Frozen
|
true
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
3-fold Accuracy
|
88.8
|
# 23
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (S3D; RGB)
|
Frozen
|
false
|
# 1
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (R2+1D; R+F)
|
Top-1
|
59.9
|
# 7
|
|
Self-supervised Video Retrieval
|
UCF101
|
ViCC (R2+1D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; R+F)
|
3-fold Accuracy
|
88.8
|
# 23
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; R+F)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; R+F)
|
Frozen
|
false
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; RGB)
|
3-fold Accuracy
|
82.8
|
# 30
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; RGB)
|
Pre-Training Dataset
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101
|
ViCC (R2+1D; RGB)
|
Frozen
|
false
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (S3D; RGB)
|
3-fold Accuracy
|
84.3
|
# 13
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (S3D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (S3D; R+F)
|
3-fold Accuracy
|
90.5
|
# 9
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (S3D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (R2+1D; RGB)
|
3-fold Accuracy
|
82.8
|
# 14
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (R2+1D; RGB)
|
Pretrain
|
UCF101
|
# 1
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (R2+1D; R+F)
|
3-fold Accuracy
|
88.8
|
# 11
|
|
Self-Supervised Action Recognition
|
UCF101 (finetuned)
|
ViCC (R2+1D; R+F)
|
Pretrain
|
UCF101
|
# 1
|
|