ID	rn50_in1k_deepclusterv2_400ep_2x160_4x96
LR	0.3
Epochs	400
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1

ID	rn50_in1k_deepclusterv2_400ep_2x224
LR	0.3
Epochs	400
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1

ID	rn50_in1k_deepclusterv2_800ep_2x224_6x96
LR	0.3
Epochs	800
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1

DeepClusterV2

facebookresearch / vissl

Last updated on Feb 27, 2021

Parameters 26 Million

FLOPs 4 Billion

File Size 111.37 MB

Training Data ImageNet

Training Resources 8x NVIDIA V100 GPUs

Training Time

Training Techniques	DeepCluster, Weight Decay, SGD with Momentum
Architecture	1x1 Convolution, Bottleneck Residual Block, Batch Normalization, Convolution, Global Average Pooling, Residual Block, Residual Connection, ReLU, Max Pooling, Softmax
ID	rn50_in1k_deepclusterv2_400ep_2x160_4x96
LR	0.3
Epochs	400
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1
SHOW MORE
SHOW LESS

Parameters 26 Million

FLOPs 4 Billion

File Size 111.37 MB

Training Data ImageNet

Training Resources 8x NVIDIA V100 GPUs

Training Time

Training Techniques	DeepCluster, Weight Decay, SGD with Momentum
Architecture	1x1 Convolution, Bottleneck Residual Block, Batch Normalization, Convolution, Global Average Pooling, Residual Block, Residual Connection, ReLU, Max Pooling, Softmax
ID	rn50_in1k_deepclusterv2_400ep_2x224
LR	0.3
Epochs	400
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1
SHOW MORE
SHOW LESS

Parameters 26 Million

FLOPs 4 Billion

File Size 111.37 MB

Training Data ImageNet

Training Resources 8x NVIDIA V100 GPUs

Training Time

Training Techniques	DeepCluster, Weight Decay, SGD with Momentum
Architecture	1x1 Convolution, Bottleneck Residual Block, Batch Normalization, Convolution, Global Average Pooling, Residual Block, Residual Connection, ReLU, Max Pooling, Softmax
ID	rn50_in1k_deepclusterv2_800ep_2x224_6x96
LR	0.3
Epochs	800
Layers	50
Classes	1000
Momentum	0.9
Weight Decay	0.0
Width Multiplier	1
DeepCluster Loss Temperature	0.1
SHOW MORE
SHOW LESS

README.md

Summary

DeepClusterV2 is a self-supervision approach for learning image representations. DeepCluster iteratively groups the features with a standard clustering algorithm, k-means, and uses the subsequent assignments as supervision to update the weights of the network. The second version of DeepCluster is obtained by obtained by applying various training improvements introduced in other self-supervised learning papers. Among these improvements are the use of stronger data augmentation, MLP projection head, cosine learning rate schedule, temperature parameter, memory bank, and multi-clustering.

How do I train this model?

Get started with VISSL by trying one of the Colab tutorial notebooks.

Citation

@misc{caron2021unsupervised,
      title={Unsupervised Learning of Visual Features by Contrasting Cluster Assignments}, 
      author={Mathilde Caron and Ishan Misra and Julien Mairal and Priya Goyal and Piotr Bojanowski and Armand Joulin},
      year={2021},
      eprint={2006.09882},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Results

Image Classification on ImageNet

Image Classification

BENCHMARK	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
ImageNet	DeepClusterV2 ResNet-50 (800 epochs, 2x224+6x96)	Top 1 Accuracy	75.18%	# 226
ImageNet	DeepClusterV2 ResNet-50 (400 epochs, 2x160+4x96)	Top 1 Accuracy	74.32%	# 239
ImageNet	DeepClusterV2 ResNet-50 (400 epochs, 2x224)	Top 1 Accuracy	70.01%	# 272