TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Neural Architecture Search	CIFAR-10	PC-DARTS	Top-1 Error Rate	2.57%	# 27
Neural Architecture Search	CIFAR-10	PC-DARTS	Search Time (GPU days)	0.1	# 3
Neural Architecture Search	CIFAR-10	PC-DARTS	Parameters	3.6M	# 23
Neural Architecture Search	CIFAR-10	PC-DARTS-CIFAR	Top-1 Error Rate	2.51%	# 20
Neural Architecture Search	ImageNet	PC-DARTS (ImageNet)	Top-1 Error Rate	24.2	# 102
Neural Architecture Search	ImageNet	PC-DARTS (ImageNet)	Accuracy	75.8	# 81
Neural Architecture Search	ImageNet	PC-DARTS (ImageNet)	Params	5.3M	# 36
Neural Architecture Search	ImageNet	PC-DARTS (ImageNet)	MACs	597M	# 129

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pc-darts-partial-channel-connections-for/neural-architecture-search-on-cifar-10)](https://paperswithcode.com/sota/neural-architecture-search-on-cifar-10?p=pc-darts-partial-channel-connections-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pc-darts-partial-channel-connections-for/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=pc-darts-partial-channel-connections-for)`

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

ICLR 2020 · Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong ·

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture. In this paper, we present a novel approach, namely, Partially-Connected DARTS, by sampling a small part of super-network to reduce the redundancy in exploring the network space, thereby performing a more efficient search without comprising the performance. In particular, we perform operation search in a subset of channels while bypassing the held out part in a shortcut. This strategy may suffer from an undesired inconsistency on selecting the edges of super-net caused by sampling different channels. We alleviate it using edge normalization, which adds a new set of edge-level parameters to reduce uncertainty in search. Thanks to the reduced memory cost, PC-DARTS can be trained with a larger batch size and, consequently, enjoys both faster speed and higher training stability. Experimental results demonstrate the effectiveness of the proposed method. Specifically, we achieve an error rate of 2.57% on CIFAR10 with merely 0.1 GPU-days for architecture search, and a state-of-the-art top-1 error rate of 24.2% on ImageNet (under the mobile setting) using 3.8 GPU-days for search. Our code has been made available at: https://github.com/yuhuixu1993/PC-DARTS.

PDF Abstract ICLR 2020 PDF ICLR 2020 Abstract

Code

Add Remove Mark official

yuhuixu1993/PC-DARTS official

432

chenxin061/pdarts

360

peteryuX/pcdarts-tf2

MS-Mind/MS-Code-06

ddghost/new_darts

See all 8 implementations

Tasks

Add Remove

Neural Architecture Search

Datasets

CIFAR-10

ImageNet

MS COCO

CIFAR-100

ssd

Results from the Paper

Edit

Ranked #20 on Neural Architecture Search on CIFAR-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Neural Architecture Search	CIFAR-10	PC-DARTS	Top-1 Error Rate	2.57%	# 27	Compare
			Search Time (GPU days)	0.1	# 3	Compare
			Parameters	3.6M	# 23	Compare
Neural Architecture Search	CIFAR-10	PC-DARTS-CIFAR	Top-1 Error Rate	2.51%	# 20	Compare
Neural Architecture Search	ImageNet	PC-DARTS (ImageNet)	Top-1 Error Rate	24.2	# 102	Compare
			Accuracy	75.8	# 81	Compare
			Params	5.3M	# 36	Compare
			MACs	597M	# 129	Compare

Methods

Add Remove

DARTS • SPEED

Edit Social Preview

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove