TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	NAS-RL	Percentage correct	96.4	# 106
Neural Architecture Search	CIFAR-10 Image Classification	NAS-RL-A + c/o	Percentage error	2.4	# 10
Neural Architecture Search	CIFAR-10 Image Classification	NAS-RL-A + c/o	Params	27.6M	# 17
Language Modelling	Penn Treebank (Character Level)	NAS-RL	Bit per Character (BPC)	1.214	# 13
Language Modelling	Penn Treebank (Character Level)	NAS-RL	Number of params	16.3M	# 5
Language Modelling	Penn Treebank (Word Level)	NAS-RL	Test perplexity	64.0	# 32
Language Modelling	Penn Treebank (Word Level)	NAS-RL	Params	25M	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-architecture-search-with-reinforcement/architecture-search-on-cifar-10-image)](https://paperswithcode.com/sota/architecture-search-on-cifar-10-image?p=neural-architecture-search-with-reinforcement)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-architecture-search-with-reinforcement/language-modelling-on-penn-treebank-character)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-character?p=neural-architecture-search-with-reinforcement)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-architecture-search-with-reinforcement/language-modelling-on-penn-treebank-word)](https://paperswithcode.com/sota/language-modelling-on-penn-treebank-word?p=neural-architecture-search-with-reinforcement)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-architecture-search-with-reinforcement/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=neural-architecture-search-with-reinforcement)`

Neural Architecture Search with Reinforcement Learning

5 Nov 2016 · Barret Zoph, Quoc V. Le ·

Neural networks are powerful and flexible models that work well for many difficult learning tasks in image, speech and natural language understanding. Despite their success, neural networks are still hard to design. In this paper, we use a recurrent network to generate the model descriptions of neural networks and train this RNN with reinforcement learning to maximize the expected accuracy of the generated architectures on a validation set. On the CIFAR-10 dataset, our method, starting from scratch, can design a novel network architecture that rivals the best human-invented architecture in terms of test set accuracy. Our CIFAR-10 model achieves a test error rate of 3.65, which is 0.09 percent better and 1.05x faster than the previous state-of-the-art model that used a similar architectural scheme. On the Penn Treebank dataset, our model can compose a novel recurrent cell that outperforms the widely-used LSTM cell, and other state-of-the-art baselines. Our cell achieves a test set perplexity of 62.4 on the Penn Treebank, which is 3.6 perplexity better than the previous state-of-the-art model. The cell can also be transferred to the character language modeling task on PTB and achieves a state-of-the-art perplexity of 1.214.

PDF Abstract

Code

Add Remove Mark official

tensorflow/models official

76,589

carpedm20/ENAS-pytorch

2,684

titu1994/neural-architecture-search

422

DataCanvasIO/Hypernets

261

barisozmen/deepaugment

244

See all 11 implementations

Tasks

Add Remove

Image Classification

Language Modelling

Natural Language Understanding

Neural Architecture Search

reinforcement-learning

Reinforcement Learning (RL)

Datasets

CIFAR-10

Penn Treebank

Results from the Paper

Edit

Ranked #10 on Neural Architecture Search on CIFAR-10 Image Classification

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	NAS-RL	Percentage correct	96.4	# 106	Compare
Neural Architecture Search	CIFAR-10 Image Classification	NAS-RL-A + c/o	Percentage error	2.4	# 10	Compare
Neural Architecture Search	CIFAR-10 Image Classification	NAS-RL-A + c/o	Params	27.6M	# 17	Compare
Language Modelling	Penn Treebank (Character Level)	NAS-RL	Bit per Character (BPC)	1.214	# 13	Compare
Language Modelling	Penn Treebank (Character Level)	NAS-RL	Number of params	16.3M	# 5	Compare
Language Modelling	Penn Treebank (Word Level)	NAS-RL	Test perplexity	64.0	# 32	Compare
Language Modelling	Penn Treebank (Word Level)	NAS-RL	Params	25M	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Neural Architecture Search with Reinforcement Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove