Self-Supervised Spatio-Temporal Representation Learning Using Variable Playback Speed Prediction

5 Mar 2020Hyeon ChoTaehoon KimHyung Jin ChangWonjun Hwang

We propose a self-supervised learning method by predicting the variable playback speeds of a video. Without semantic labels, we learn the spatio-temporal representation of the video by leveraging the variations in the visual appearance according to different playback speeds under the assumption of temporal coherence... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper