Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles

24 Nov 2018Dahun KimDonghyeon ChoIn So Kweon

Self-supervised tasks such as colorization, inpainting and zigsaw puzzle have been utilized for visual representation learning for still images, when the number of labeled images is limited or absent at all. Recently, this worthwhile stream of study extends to video domain where the cost of human labeling is even more expensive... (read more)

PDF Abstract

Results from the Paper


#5 best model for Self-Supervised Action Recognition on UCF101 (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
RESULT LEADERBOARD
Self-Supervised Action Recognition UCF101 3D Cubic Puzzles (3D ResNet-18) 3-fold Accuracy 65.8 # 5