TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Neural Architecture Search	CIFAR-10	DNA-c	Top-1 Error Rate	1.7%	# 2
Neural Architecture Search	CIFAR-100	DNA-c	Percentage Error	11.7	# 1
Neural Architecture Search	ImageNet	DNA-a	Top-1 Error Rate	22.9	# 72
Neural Architecture Search	ImageNet	DNA-a	Accuracy	77.1	# 58
Neural Architecture Search	ImageNet	DNA-a	FLOPs	348M	# 116
Neural Architecture Search	ImageNet	DNA-a	Params	4.2M	# 53
Neural Architecture Search	ImageNet	DNA-b	Top-1 Error Rate	22.5	# 62
Neural Architecture Search	ImageNet	DNA-b	Accuracy	77.5	# 50
Neural Architecture Search	ImageNet	DNA-b	FLOPs	406M	# 119
Neural Architecture Search	ImageNet	DNA-b	Params	4.9M	# 45
Neural Architecture Search	ImageNet	DNA-d	Top-1 Error Rate	21.6	# 47
Neural Architecture Search	ImageNet	DNA-d	Accuracy	78.4	# 36
Neural Architecture Search	ImageNet	DNA-d	FLOPs	611M	# 130
Neural Architecture Search	ImageNet	DNA-d	Params	6.4M	# 18
Neural Architecture Search	ImageNet	DNA-c	Top-1 Error Rate	22.2	# 58
Neural Architecture Search	ImageNet	DNA-c	Accuracy	77.8	# 46
Neural Architecture Search	ImageNet	DNA-c	FLOPs	466M	# 122
Neural Architecture Search	ImageNet	DNA-c	Params	5.3M	# 36

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/blockwisely-supervised-neural-architecture/neural-architecture-search-on-cifar-100-1)](https://paperswithcode.com/sota/neural-architecture-search-on-cifar-100-1?p=blockwisely-supervised-neural-architecture)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/blockwisely-supervised-neural-architecture/neural-architecture-search-on-cifar-10)](https://paperswithcode.com/sota/neural-architecture-search-on-cifar-10?p=blockwisely-supervised-neural-architecture)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/blockwisely-supervised-neural-architecture/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=blockwisely-supervised-neural-architecture)`

Blockwisely Supervised Neural Architecture Search with Knowledge Distillation

29 Nov 2019 · Changlin Li, Jiefeng Peng, Liuchun Yuan, Guangrun Wang, Xiaodan Liang, Liang Lin, Xiaojun Chang ·

Neural Architecture Search (NAS), aiming at automatically designing network architectures by machines, is hoped and expected to bring about a new revolution in machine learning. Despite these high expectation, the effectiveness and efficiency of existing NAS solutions are unclear, with some recent works going so far as to suggest that many existing NAS solutions are no better than random architecture selection. The inefficiency of NAS solutions may be attributed to inaccurate architecture evaluation. Specifically, to speed up NAS, recent works have proposed under-training different candidate architectures in a large search space concurrently by using shared network parameters; however, this has resulted in incorrect architecture ratings and furthered the ineffectiveness of NAS. In this work, we propose to modularize the large search space of NAS into blocks to ensure that the potential candidate architectures are fully trained; this reduces the representation shift caused by the shared parameters and leads to the correct rating of the candidates. Thanks to the block-wise search, we can also evaluate all of the candidate architectures within a block. Moreover, we find that the knowledge of a network model lies not only in the network parameters but also in the network architecture. Therefore, we propose to distill the neural architecture (DNA) knowledge from a teacher model as the supervision to guide our block-wise architecture search, which significantly improves the effectiveness of NAS. Remarkably, the capacity of our searched architecture has exceeded the teacher model, demonstrating the practicability and scalability of our method. Finally, our method achieves a state-of-the-art 78.4\% top-1 accuracy on ImageNet in a mobile setting, which is about a 2.1\% gain over EfficientNet-B0. All of our searched models along with the evaluation code are available online.

PDF Abstract

Code

Add Remove Mark official

changlin31/DNA official

230

Tasks

Add Remove

Knowledge Distillation

Neural Architecture Search

Datasets

CIFAR-10

ImageNet

CIFAR-100

Results from the Paper

Edit

Ranked #1 on Neural Architecture Search on CIFAR-100

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Neural Architecture Search	CIFAR-10	DNA-c	Top-1 Error Rate	1.7%	# 2	Compare
Neural Architecture Search	CIFAR-100	DNA-c	Percentage Error	11.7	# 1	Compare
Neural Architecture Search	ImageNet	DNA-a	Top-1 Error Rate	22.9	# 72	Compare
			Accuracy	77.1	# 58	Compare
			FLOPs	348M	# 116	Compare
			Params	4.2M	# 53	Compare
Neural Architecture Search	ImageNet	DNA-b	Top-1 Error Rate	22.5	# 62	Compare
			Accuracy	77.5	# 50	Compare
			FLOPs	406M	# 119	Compare
			Params	4.9M	# 45	Compare
Neural Architecture Search	ImageNet	DNA-d	Top-1 Error Rate	21.6	# 47	Compare
			Accuracy	78.4	# 36	Compare
			FLOPs	611M	# 130	Compare
			Params	6.4M	# 18	Compare
Neural Architecture Search	ImageNet	DNA-c	Top-1 Error Rate	22.2	# 58	Compare
			Accuracy	77.8	# 46	Compare
			FLOPs	466M	# 122	Compare
			Params	5.3M	# 36	Compare

Methods

Add Remove

SPEED

Edit Social Preview

Blockwisely Supervised Neural Architecture Search with Knowledge Distillation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove