TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Knowledge Distillation	CIFAR-100	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	75.63	# 12
Knowledge Distillation	CIFAR-100	vgg8 (T:vgg13 S:vgg8)	Top-1 Accuracy (%)	74.84	# 15
Knowledge Distillation	ImageNet	Knowledge Review (T: ResNet-34 S:ResNet-18)	Top-1 accuracy %	71.61	# 33
Knowledge Distillation	ImageNet	Knowledge Review (T: ResNet-34 S:ResNet-18)	CRD training setting	✓	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-knowledge-via-knowledge-review/knowledge-distillation-on-cifar-100)](https://paperswithcode.com/sota/knowledge-distillation-on-cifar-100?p=distilling-knowledge-via-knowledge-review)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-knowledge-via-knowledge-review/knowledge-distillation-on-imagenet)](https://paperswithcode.com/sota/knowledge-distillation-on-imagenet?p=distilling-knowledge-via-knowledge-review)`

Distilling Knowledge via Knowledge Review

CVPR 2021 · Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia ·

Knowledge distillation transfers knowledge from the teacher network to the student one, with the goal of greatly improving the performance of the student network. Previous methods mostly focus on proposing feature transformation and loss functions between the same level's features to improve the effectiveness. We differently study the factor of connection path cross levels between teacher and student networks, and reveal its great importance. For the first time in knowledge distillation, cross-stage connection paths are proposed. Our new review mechanism is effective and structurally simple. Our finally designed nested and compact framework requires negligible computation overhead, and outperforms other methods on a variety of tasks. We apply our method to classification, object detection, and instance segmentation tasks. All of them witness significant student network performance improvement. Code is available at https://github.com/Jia-Research-Lab/ReviewKD

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

Jia-Research-Lab/ReviewKD official

245

yoshitomo-matsubara/torchdistill

↳ Quickstart in

Colab

1,281

dvlab-research/reviewkd

245

ruipingl/skr_pea

ruipingl/transkd

See all 7 implementations

Tasks

Add Remove

Instance Segmentation

Knowledge Distillation

object-detection

Object Detection

Semantic Segmentation

Datasets

ImageNet

MS COCO

CIFAR-100

Results from the Paper

Edit

Ranked #12 on Knowledge Distillation on CIFAR-100

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Knowledge Distillation	CIFAR-100	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	75.63	# 12	Compare
Knowledge Distillation	CIFAR-100	vgg8 (T:vgg13 S:vgg8)	Top-1 Accuracy (%)	74.84	# 15	Compare
Knowledge Distillation	ImageNet	Knowledge Review (T: ResNet-34 S:ResNet-18)	Top-1 accuracy %	71.61	# 33	Compare
Knowledge Distillation	ImageNet	Knowledge Review (T: ResNet-34 S:ResNet-18)	CRD training setting	✓	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Distilling Knowledge via Knowledge Review

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove