TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Neural Architecture Search	ImageNet	BigNASModel-L	Top-1 Error Rate	20.5	# 30
Neural Architecture Search	ImageNet	BigNASModel-L	Accuracy	79.5	# 23
Neural Architecture Search	ImageNet	BigNASModel-L	Params	6.4M	# 18
Neural Architecture Search	ImageNet	BigNASModel-L	MACs	586M	# 126
Neural Architecture Search	ImageNet	BigNASModel-M	Top-1 Error Rate	21.1	# 39
Neural Architecture Search	ImageNet	BigNASModel-M	Accuracy	78.9	# 30
Neural Architecture Search	ImageNet	BigNASModel-M	Params	5.5M	# 30
Neural Architecture Search	ImageNet	BigNASModel-M	MACs	418M	# 113
Neural Architecture Search	ImageNet	BigNASModel-S	Top-1 Error Rate	23.5	# 81
Neural Architecture Search	ImageNet	BigNASModel-S	Accuracy	76.5	# 65
Neural Architecture Search	ImageNet	BigNASModel-S	Params	4.5M	# 50
Neural Architecture Search	ImageNet	BigNASModel-S	MACs	242M	# 80

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bignas-scaling-up-neural-architecture-search/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=bignas-scaling-up-neural-architecture-search)`

BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

ECCV 2020 · Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le ·

Neural architecture search (NAS) has shown promising results discovering models that are both accurate and fast. For NAS, training a one-shot model has become a popular strategy to rank the relative quality of different architectures (child models) using a single set of shared weights. However, while one-shot model weights can effectively rank different network architectures, the absolute accuracies from these shared weights are typically far below those obtained from stand-alone training. To compensate, existing methods assume that the weights must be retrained, finetuned, or otherwise post-processed after the search is completed. These steps significantly increase the compute requirements and complexity of the architecture search and model deployment. In this work, we propose BigNAS, an approach that challenges the conventional wisdom that post-processing of the weights is necessary to get good prediction accuracies. Without extra retraining or post-processing steps, we are able to train a single set of shared weights on ImageNet and use these weights to obtain child models whose sizes range from 200 to 1000 MFLOPs. Our discovered model family, BigNASModels, achieve top-1 accuracies ranging from 76.5% to 80.9%, surpassing state-of-the-art models in this range including EfficientNets and Once-for-All networks without extra retraining or post-processing. We present ablative study and analysis to further understand the proposed BigNASModels.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

GAIA-vision/GAIA-cv

Tasks

Add Remove

Neural Architecture Search

Datasets

ImageNet

Results from the Paper

Edit

Ranked #30 on Neural Architecture Search on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Neural Architecture Search	ImageNet	BigNASModel-L	Top-1 Error Rate	20.5	# 30	Compare
			Accuracy	79.5	# 23	Compare
			Params	6.4M	# 18	Compare
			MACs	586M	# 126	Compare
Neural Architecture Search	ImageNet	BigNASModel-M	Top-1 Error Rate	21.1	# 39	Compare
			Accuracy	78.9	# 30	Compare
			Params	5.5M	# 30	Compare
			MACs	418M	# 113	Compare
Neural Architecture Search	ImageNet	BigNASModel-S	Top-1 Error Rate	23.5	# 81	Compare
			Accuracy	76.5	# 65	Compare
			Params	4.5M	# 50	Compare
			MACs	242M	# 80	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove