TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	Proxyless-G + c/o	Percentage correct	97.92	# 56
Image Classification	CIFAR-10	Proxyless-G + c/o	PARAMS	5.7M	# 197
Neural Architecture Search	CIFAR-10 Image Classification	Proxyless-G + c/o	Percentage error	2.08	# 6
Neural Architecture Search	CIFAR-10 Image Classification	Proxyless-G + c/o	Params	5.7M	# 13
Image Classification	ImageNet	Proxyless	Top 1 Accuracy	74.6%	# 902
Image Classification	ImageNet	Proxyless	Number of params	4.0M	# 377
Neural Architecture Search	ImageNet	ProxylesNAS	Top-1 Error Rate	24.9	# 113
Neural Architecture Search	ImageNet	ProxylesNAS	Accuracy	75.1	# 90
Neural Architecture Search	ImageNet	ProxylesNAS	Params	5.1M	# 41
Neural Architecture Search	ImageNet	ProxylesNAS	MACs	581M	# 124

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/proxylessnas-direct-neural-architecture/architecture-search-on-cifar-10-image)](https://paperswithcode.com/sota/architecture-search-on-cifar-10-image?p=proxylessnas-direct-neural-architecture)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/proxylessnas-direct-neural-architecture/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=proxylessnas-direct-neural-architecture)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/proxylessnas-direct-neural-architecture/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=proxylessnas-direct-neural-architecture)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/proxylessnas-direct-neural-architecture/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=proxylessnas-direct-neural-architecture)`

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

ICLR 2019 · Han Cai, Ligeng Zhu, Song Han ·

Neural architecture search (NAS) has a great impact by automatically designing effective neural network architectures. However, the prohibitive computational demand of conventional NAS algorithms (e.g. $10^4$ GPU hours) makes it difficult to \emph{directly} search the architectures on large-scale tasks (e.g. ImageNet). Differentiable NAS can reduce the cost of GPU hours via a continuous representation of network architecture but suffers from the high GPU memory consumption issue (grow linearly w.r.t. candidate set size). As a result, they need to utilize~\emph{proxy} tasks, such as training on a smaller dataset, or learning with only a few blocks, or training just for a few epochs. These architectures optimized on proxy tasks are not guaranteed to be optimal on the target task. In this paper, we present \emph{ProxylessNAS} that can \emph{directly} learn the architectures for large-scale target tasks and target hardware platforms. We address the high memory consumption issue of differentiable NAS and reduce the computational cost (GPU hours and GPU memory) to the same level of regular training while still allowing a large candidate set. Experiments on CIFAR-10 and ImageNet demonstrate the effectiveness of directness and specialization. On CIFAR-10, our model achieves 2.08\% test error with only 5.7M parameters, better than the previous state-of-the-art architecture AmoebaNet-B, while using 6$\times$ fewer parameters. On ImageNet, our model achieves 3.1\% better top-1 accuracy than MobileNetV2, while being 1.2$\times$ faster with measured GPU latency. We also apply ProxylessNAS to specialize neural architectures for hardware with direct hardware metrics (e.g. latency) and provide insights for efficient CNN architecture design.

PDF Abstract ICLR 2019 PDF ICLR 2019 Abstract

Code

Add Remove Mark official

MIT-HAN-LAB/ProxylessNAS official

1,409

osmr/imgclsmob

2,917

mit-han-lab/once-for-all

↳ Quickstart in

Colab

1,836

mit-han-lab/ProxylessNAS

↳ Quickstart in

Colab

PyTorch Hub

1,409

mit-han-lab/amc

417

See all 23 implementations

Tasks

Add Remove

Image Classification

Neural Architecture Search

Datasets

CIFAR-10

ImageNet

Results from the Paper

Edit

Ranked #6 on Neural Architecture Search on CIFAR-10 Image Classification (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	Proxyless-G + c/o	Percentage correct	97.92	# 56	Compare
Image Classification	CIFAR-10	Proxyless-G + c/o	PARAMS	5.7M	# 197	Compare
Neural Architecture Search	CIFAR-10 Image Classification	Proxyless-G + c/o	Percentage error	2.08	# 6	Compare
Neural Architecture Search	CIFAR-10 Image Classification	Proxyless-G + c/o	Params	5.7M	# 13	Compare
Image Classification	ImageNet	Proxyless	Top 1 Accuracy	74.6%	# 902	Compare
Image Classification	ImageNet	Proxyless	Number of params	4.0M	# 377	Compare
Neural Architecture Search	ImageNet	ProxylesNAS	Top-1 Error Rate	24.9	# 113	Compare
			Accuracy	75.1	# 90	Compare
			Params	5.1M	# 41	Compare
			MACs	581M	# 124	Compare

Methods

Add Remove

1x1 Convolution • Adam • Average Pooling • Batch Normalization • Convolution • Cutout • Depthwise Convolution • Depthwise Separable Convolution • Differentiable NAS • DropPath • Global Average Pooling • Inverted Residual Block • MobileNetV2 • Pointwise Convolution • ProxylessNAS • ProxylessNet-CPU • ProxylessNet-GPU • ProxylessNet-Mobile • REINFORCE

Edit Social Preview

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove