TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Knowledge Distillation	ImageNet	HSAKD (T: ResNet-34 S:ResNet-18)	Top-1 accuracy %	72.39	# 20
Knowledge Distillation	ImageNet	HSAKD (T: ResNet-34 S:ResNet-18)	CRD training setting	✘	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hierarchical-self-supervised-augmented/knowledge-distillation-on-imagenet)](https://paperswithcode.com/sota/knowledge-distillation-on-imagenet?p=hierarchical-self-supervised-augmented)`

Hierarchical Self-supervised Augmented Knowledge Distillation

29 Jul 2021 · Chuanguang Yang, Zhulin An, Linhang Cai, Yongjun Xu ·

Knowledge distillation often involves how to define and transfer knowledge from teacher to student effectively. Although recent self-supervised contrastive knowledge achieves the best performance, forcing the network to learn such knowledge may damage the representation learning of the original class recognition task. We therefore adopt an alternative self-supervised augmented task to guide the network to learn the joint distribution of the original recognition task and self-supervised auxiliary task. It is demonstrated as a richer knowledge to improve the representation power without losing the normal classification capability. Moreover, it is incomplete that previous methods only transfer the probabilistic knowledge between the final layers. We propose to append several auxiliary classifiers to hierarchical intermediate feature maps to generate diverse self-supervised knowledge and perform the one-to-one transfer to teach the student network thoroughly. Our method significantly surpasses the previous SOTA SSKD with an average improvement of 2.56\% on CIFAR-100 and an improvement of 0.77\% on ImageNet across widely used network pairs. Codes are available at https://github.com/winycg/HSAKD.

PDF Abstract

Code

Add Remove Mark official

winycg/HSAKD official

Tasks

Add Remove

Knowledge Distillation

Representation Learning

Datasets

ImageNet

Results from the Paper

Edit

Ranked #20 on Knowledge Distillation on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Knowledge Distillation	ImageNet	HSAKD (T: ResNet-34 S:ResNet-18)	Top-1 accuracy %	72.39	# 20		Compare
Knowledge Distillation	ImageNet	HSAKD (T: ResNet-34 S:ResNet-18)	CRD training setting	✘	# 1		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Hierarchical Self-supervised Augmented Knowledge Distillation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove