Self-supervised Video Retrieval

7 papers with code • 2 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

BestJuly/Inter-intra-video-contrastive-learning 6 Aug 2020

With the proposed Inter-Intra Contrastive (IIC) framework, we can train spatio-temporal convolutional networks to learn video representations.

007: Democratically Finding The Cause of Packet Drops

behnazak/Vigil-007SourceCode 20 Feb 2018

Network failures continue to plague datacenter operators as their symptoms may not have direct correlation with where or why they occur.

Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning

BestJuly/VCP 2 Jan 2020

As a proxy task, it converts rich self-supervised representations into video clip operations (options), which enhances the flexibility and reduces the complexity of representation learning.

Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning

yuanyao366/PRP CVPR 2020

The generative perception model acts as a feature decoder to focus on comprehending high temporal resolution and short-term representation by introducing a motion-attention mechanism.

Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning

BestJuly/Pretext-Contrastive-Learning 29 Oct 2020

It is convenient to treat PCL as a standard training strategy and apply it to many other works in self-supervised video feature learning.

TCLR: Temporal Contrastive Learning for Video Representation


However, prior work on contrastive learning for video data has not explored the effect of explicitly encouraging the features to be distinct across the temporal dimension.

Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting

martinetoering/ViCC 18 Jun 2021

Instance-level contrastive learning techniques, which rely on data augmentation and a contrastive loss function, have found great success in the domain of visual representation learning.