TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	UniNet-B5	Top 1 Accuracy	87%	# 112
Image Classification	ImageNet	UniNet-B5	Number of params	72.9M	# 791
Image Classification	ImageNet	UniNet-B5	GFLOPs	20.4	# 368
Image Classification	ImageNet	UniNet-B6	Top 1 Accuracy	87.4%	# 93
Image Classification	ImageNet	UniNet-B6	Number of params	117M	# 876
Image Classification	ImageNet	UniNet-B6	GFLOPs	51	# 426
Neural Architecture Search	ImageNet	UniNet-B0	Top-1 Error Rate	19.2	# 12
Neural Architecture Search	ImageNet	UniNet-B0	FLOPs	555M	# 126
Neural Architecture Search	ImageNet	UniNet-B0	Params	11.5M	# 5
Image Classification	ImageNet	UniNet-B0	Top 1 Accuracy	80.8%	# 623
Image Classification	ImageNet	UniNet-B0	Number of params	11.5M	# 489
Image Classification	ImageNet	UniNet-B0	GFLOPs	0.555	# 57

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/uninet-unified-architecture-search-with-1/neural-architecture-search-on-imagenet)](https://paperswithcode.com/sota/neural-architecture-search-on-imagenet?p=uninet-unified-architecture-search-with-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/uninet-unified-architecture-search-with-1/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=uninet-unified-architecture-search-with-1)`

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

12 Jul 2022 · Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu ·

Recently, transformer and multi-layer perceptron (MLP) architectures have achieved impressive results on various vision tasks. However, how to effectively combine those operators to form high-performance hybrid visual architectures still remains a challenge. In this work, we study the learnable combination of convolution, transformer, and MLP by proposing a novel unified architecture search approach. Our approach contains two key designs to achieve the search for high-performance networks. First, we model the very different searchable operators in a unified form, and thus enable the operators to be characterized with the same set of configuration parameters. In this way, the overall search space size is significantly reduced, and the total search cost becomes affordable. Second, we propose context-aware downsampling modules (DSMs) to mitigate the gap between the different types of operators. Our proposed DSMs are able to better adapt features from different types of operators, which is important for identifying high-performance hybrid architectures. Finally, we integrate configurable operators and DSMs into a unified search space and search with a Reinforcement Learning-based search algorithm to fully explore the optimal combination of the operators. To this end, we search a baseline network and scale it up to obtain a family of models, named UniNets, which achieve much better accuracy and efficiency than previous ConvNets and Transformers. In particular, our UniNet-B5 achieves 84.9% top-1 accuracy on ImageNet, outperforming EfficientNet-B7 and BoTNet-T7 with 44% and 55% fewer FLOPs respectively. By pretraining on the ImageNet-21K, our UniNet-B6 achieves 87.4%, outperforming Swin-L with 51% fewer FLOPs and 41% fewer parameters. Code is available at https://github.com/Sense-X/UniNet.

PDF Abstract

Code

Add Remove Mark official

sense-x/uninet official

sense-x/tokenmix

Tasks

Add Remove

Image Classification

Neural Architecture Search

Datasets

ImageNet

MS COCO

Results from the Paper

Edit

Ranked #12 on Neural Architecture Search on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	UniNet-B5	Top 1 Accuracy	87%	# 112	Compare
			Number of params	72.9M	# 791	Compare
			GFLOPs	20.4	# 368	Compare
Image Classification	ImageNet	UniNet-B6	Top 1 Accuracy	87.4%	# 93	Compare
			Number of params	117M	# 876	Compare
			GFLOPs	51	# 426	Compare
Neural Architecture Search	ImageNet	UniNet-B0	Top-1 Error Rate	19.2	# 12	Compare
			FLOPs	555M	# 126	Compare
			Params	11.5M	# 5	Compare
Image Classification	ImageNet	UniNet-B0	Top 1 Accuracy	80.8%	# 623	Compare
			Number of params	11.5M	# 489	Compare
			GFLOPs	0.555	# 57	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove