TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Unsupervised Image Classification	STL-10	MV-MR	Accuracy	89.67	# 2
Self-Supervised Learning	STL-10	MV-MR	Accuracy	89.67	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mv-mr-multi-views-and-multi-representations/unsupervised-image-classification-on-stl-10)](https://paperswithcode.com/sota/unsupervised-image-classification-on-stl-10?p=mv-mr-multi-views-and-multi-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mv-mr-multi-views-and-multi-representations/self-supervised-learning-on-stl-10)](https://paperswithcode.com/sota/self-supervised-learning-on-stl-10?p=mv-mr-multi-views-and-multi-representations)`

MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation

21 Mar 2023 · Vitaliy Kinakh, Mariia Drozdova, Slava Voloshynovskiy ·

We present a new method of self-supervised learning and knowledge distillation based on the multi-views and multi-representations (MV-MR). The MV-MR is based on the maximization of dependence between learnable embeddings from augmented and non-augmented views, jointly with the maximization of dependence between learnable embeddings from augmented view and multiple non-learnable representations from non-augmented view. We show that the proposed method can be used for efficient self-supervised classification and model-agnostic knowledge distillation. Unlike other self-supervised techniques, our approach does not use any contrastive learning, clustering, or stop gradients. MV-MR is a generic framework allowing the incorporation of constraints on the learnable embeddings via the usage of image multi-representations as regularizers. Along this line, knowledge distillation is considered a particular case of such a regularization. MV-MR provides the state-of-the-art performance on the STL10 and ImageNet-1K datasets among non-contrastive and clustering-free methods. We show that a lower complexity ResNet50 model pretrained using proposed knowledge distillation based on the CLIP ViT model achieves state-of-the-art performance on STL10 linear evaluation. The code is available at: https://github.com/vkinakh/mv-mr

PDF Abstract