no code implementations • 5 May 2020 • Ramy Mounir, Roman Gula, Jörn Theuerkauf, Sudeep Sarkar
We present a self-supervised perceptual prediction framework capable of temporal event segmentation by building stable representations of objects over time and demonstrate it on long videos, spanning several days.