TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO test-dev	Faster R-CNN (Bottleneck-injected ResNet-50 and FPN)	box mAP	35.9	# 218
Object Detection	COCO test-dev	Faster R-CNN (Bottleneck-injected ResNet-50 and FPN)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	Faster R-CNN (Bottleneck-injected ResNet-50 and FPN)	Operations per network pass	None	# 1
Object Detection	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	box mAP	36.9	# 213
Object Detection	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	Operations per network pass	None	# 1
Instance Segmentation	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	mask AP	33.6	# 98
Image Classification	ImageNet	ResNet-18 (KD w/ ResNet-34 teacher)	Top 1 Accuracy	71.37%	# 938
Image Classification	ImageNet	ResNet-18 (L2 w/ ResNet-34 teacher)	Top 1 Accuracy	71.08%	# 941
Image Classification	ImageNet	ResNet-18 (SSKD w/ ResNet-34 teacher)	Top 1 Accuracy	70.09%	# 950
Image Classification	ImageNet	ResNet-18 (CRD w/ ResNet-34 teacher)	Top 1 Accuracy	70.93%	# 943
Image Classification	ImageNet	ResNet-18 (FT w/ ResNet-34 teacher)	Top 1 Accuracy	71.56%	# 935
Image Classification	ImageNet	ResNet-18 (PAD-L2 w/ ResNet-34 teacher)	Top 1 Accuracy	71.71%	# 932
Image Classification	ImageNet	ResNet-18 (PAD-L2 w/ ResNet-34 teacher)	Hardware Burden	None	# 1
Image Classification	ImageNet	ResNet-18 (PAD-L2 w/ ResNet-34 teacher)	Operations per network pass	None	# 1
Image Classification	ImageNet	ResNet-18 (tf-KD w/ ResNet-18 teacher)	Top 1 Accuracy	70.52%	# 947

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/torchdistill-a-modular-configuration-driven/instance-segmentation-on-coco)](https://paperswithcode.com/sota/instance-segmentation-on-coco?p=torchdistill-a-modular-configuration-driven)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/torchdistill-a-modular-configuration-driven/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=torchdistill-a-modular-configuration-driven)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/torchdistill-a-modular-configuration-driven/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=torchdistill-a-modular-configuration-driven)`

torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation

25 Nov 2020 · Yoshitomo Matsubara ·

While knowledge distillation (transfer) has been attracting attentions from the research community, the recent development in the fields has heightened the need for reproducible studies and highly generalized frameworks to lower barriers to such high-quality, reproducible deep learning research. Several researchers voluntarily published frameworks used in their knowledge distillation studies to help other interested researchers reproduce their original work. Such frameworks, however, are usually neither well generalized nor maintained, thus researchers are still required to write a lot of code to refactor/build on the frameworks for introducing new methods, models, datasets and designing experiments. In this paper, we present our developed open-source framework built on PyTorch and dedicated for knowledge distillation studies. The framework is designed to enable users to design experiments by declarative PyYAML configuration files, and helps researchers complete the recently proposed ML Code Completeness Checklist. Using the developed framework, we demonstrate its various efficient training strategies, and implement a variety of knowledge distillation methods. We also reproduce some of their original experimental results on the ImageNet and COCO datasets presented at major machine learning conferences such as ICLR, NeurIPS, CVPR and ECCV, including recent state-of-the-art methods. All the source code, configurations, log files and trained model weights are publicly available at https://github.com/yoshitomo-matsubara/torchdistill .

PDF Abstract

Code

Add Remove Mark official

yoshitomo-matsubara/torchdistill official

↳ Quickstart in

Colab

1,281

Tasks

Add Remove

Image Classification

Instance Segmentation

Knowledge Distillation

Model Compression

Neural Network Compression

Object Detection

Datasets

CIFAR-10

ImageNet

MS COCO

CIFAR-100

Results from the Paper

Edit

Ranked #98 on Instance Segmentation on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO test-dev	Faster R-CNN (Bottleneck-injected ResNet-50 and FPN)	box mAP	35.9	# 218	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Object Detection	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	box mAP	36.9	# 213	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Instance Segmentation	COCO test-dev	Mask R-CNN (Bottleneck-injected ResNet-50, FPN)	mask AP	33.6	# 98	Compare
Image Classification	ImageNet	ResNet-18 (KD w/ ResNet-34 teacher)	Top 1 Accuracy	71.37%	# 938	Compare
Image Classification	ImageNet	ResNet-18 (L2 w/ ResNet-34 teacher)	Top 1 Accuracy	71.08%	# 941	Compare
Image Classification	ImageNet	ResNet-18 (SSKD w/ ResNet-34 teacher)	Top 1 Accuracy	70.09%	# 950	Compare
Image Classification	ImageNet	ResNet-18 (CRD w/ ResNet-34 teacher)	Top 1 Accuracy	70.93%	# 943	Compare
Image Classification	ImageNet	ResNet-18 (FT w/ ResNet-34 teacher)	Top 1 Accuracy	71.56%	# 935	Compare
Image Classification	ImageNet	ResNet-18 (PAD-L2 w/ ResNet-34 teacher)	Top 1 Accuracy	71.71%	# 932	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Image Classification	ImageNet	ResNet-18 (tf-KD w/ ResNet-18 teacher)	Top 1 Accuracy	70.52%	# 947	Compare

Methods

Add Remove

Knowledge Distillation

Edit Social Preview

torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove