TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Self-supervised Video Retrieval	HMDB51	SLIC (R3D-18)	Top-1	28.9	# 4
Self-supervised Video Retrieval	HMDB51	SLIC (R3D-18)	Pretrain	UCF101	# 1
Self-Supervised Action Recognition	HMDB51	SLIC (R3D-18)	Top-1 Accuracy	54.5	# 29
Self-Supervised Action Recognition	HMDB51	SLIC (R3D-18)	Pre-Training Dataset	UCF101	# 1
Self-Supervised Action Recognition	HMDB51	SLIC (R3D-18)	Frozen	false	# 1
Self-Supervised Action Recognition	UCF101	SLIC (R3D-18)	Pre-Training Dataset	UCF101	# 1
Self-Supervised Action Recognition	UCF101	SLIC (R3D-18)	Frozen	false	# 1
Self-Supervised Action Recognition	UCF101	SLIC (R3D-18)	split-1 Top-1 Accuracy	83.2	# 1
Self-supervised Video Retrieval	UCF101	SLIC (R3D-18)	Top-1	71.6	# 3
Self-supervised Video Retrieval	UCF101	SLIC (R3D-18)	Pretrain	UCF101	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/slic-self-supervised-learning-with-iterative-1/self-supervised-action-recognition-on-ucf101)](https://paperswithcode.com/sota/self-supervised-action-recognition-on-ucf101?p=slic-self-supervised-learning-with-iterative-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/slic-self-supervised-learning-with-iterative-1/self-supervised-video-retrieval-on-ucf101)](https://paperswithcode.com/sota/self-supervised-video-retrieval-on-ucf101?p=slic-self-supervised-learning-with-iterative-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/slic-self-supervised-learning-with-iterative-1/self-supervised-video-retrieval-on-hmdb51)](https://paperswithcode.com/sota/self-supervised-video-retrieval-on-hmdb51?p=slic-self-supervised-learning-with-iterative-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/slic-self-supervised-learning-with-iterative-1/self-supervised-action-recognition-on-hmdb51)](https://paperswithcode.com/sota/self-supervised-action-recognition-on-hmdb51?p=slic-self-supervised-learning-with-iterative-1)`

SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos

CVPR 2022 · Salar Hosseini Khorasgani, Yuxuan Chen, Florian Shkurti ·

Self-supervised methods have significantly closed the gap with end-to-end supervised learning for image classification. In the case of human action videos, however, where both appearance and motion are significant factors of variation, this gap remains significant. One of the key reasons for this is that sampling pairs of similar video clips, a required step for many self-supervised contrastive learning methods, is currently done conservatively to avoid false positives. A typical assumption is that similar clips only occur temporally close within a single video, leading to insufficient examples of motion similarity. To mitigate this, we propose SLIC, a clustering-based self-supervised contrastive learning method for human action videos. Our key contribution is that we improve upon the traditional intra-video positive sampling by using iterative clustering to group similar video instances. This enables our method to leverage pseudo-labels from the cluster assignments to sample harder positives and negatives. SLIC outperforms state-of-the-art video retrieval baselines by +15.4% on top-1 recall on UCF101 and by +5.7% when directly transferred to HMDB51. With end-to-end finetuning for action classification, SLIC achieves 83.2% top-1 accuracy (+0.8%) on UCF101 and 54.5% on HMDB51 (+1.6%). SLIC is also competitive with the state-of-the-art in action classification after self-supervised pretraining on Kinetics400.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

rvl-lab-utoronto/video_similarity_s… official

Tasks

Add Remove

Action Classification

Clustering

Contrastive Learning

Retrieval

Self-Supervised Action Recognition

Self-Supervised Learning

Self-supervised Video Retrieval

Video Retrieval

Datasets

UCF101

Kinetics

HMDB51

Kinetics 400

Results from the Paper

Add Remove

Ranked #1 on Self-Supervised Action Recognition on UCF101 (Pre-Training Dataset metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Self-supervised Video Retrieval	HMDB51	SLIC (R3D-18)	Top-1	28.9	# 4	Compare
Self-supervised Video Retrieval	HMDB51	SLIC (R3D-18)	Pretrain	UCF101	# 1	Compare
Self-Supervised Action Recognition	HMDB51	SLIC (R3D-18)	Top-1 Accuracy	54.5	# 29	Compare
			Pre-Training Dataset	UCF101	# 1	Compare
			Frozen	false	# 1	Compare
Self-Supervised Action Recognition	UCF101	SLIC (R3D-18)	Pre-Training Dataset	UCF101	# 1	Compare
			Frozen	false	# 1	Compare
			split-1 Top-1 Accuracy	83.2	# 1	Compare
Self-supervised Video Retrieval	UCF101	SLIC (R3D-18)	Top-1	71.6	# 3	Compare
Self-supervised Video Retrieval	UCF101	SLIC (R3D-18)	Pretrain	UCF101	# 1	Compare

Methods

Add Remove

Contrastive Learning

Edit Social Preview

SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove