TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	TResNet-XL	Percentage correct	99	# 19
Image Classification	CIFAR-100	TResNet-L-V2	Percentage correct	92.6	# 13
Image Classification	Flowers-102	TResNet-L	Accuracy	99.1%	# 13
Image Classification	ImageNet	TResNet-XL	Top 1 Accuracy	84.3%	# 305
Image Classification	ImageNet	TResNet-XL	Number of params	77M	# 803
Image Classification	ImageNet	TResNet-XL	Hardware Burden	None	# 1
Image Classification	ImageNet	TResNet-XL	Operations per network pass	None	# 1
Fine-Grained Image Classification	Oxford 102 Flowers	TResNet-L	Accuracy	99.1%	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tresnet-high-performance-gpu-dedicated/fine-grained-image-classification-on-oxford)](https://paperswithcode.com/sota/fine-grained-image-classification-on-oxford?p=tresnet-high-performance-gpu-dedicated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tresnet-high-performance-gpu-dedicated/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=tresnet-high-performance-gpu-dedicated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tresnet-high-performance-gpu-dedicated/image-classification-on-flowers-102)](https://paperswithcode.com/sota/image-classification-on-flowers-102?p=tresnet-high-performance-gpu-dedicated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tresnet-high-performance-gpu-dedicated/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=tresnet-high-performance-gpu-dedicated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tresnet-high-performance-gpu-dedicated/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=tresnet-high-performance-gpu-dedicated)`

TResNet: High Performance GPU-Dedicated Architecture

30 Mar 2020 · Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman ·

Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work, we introduce a series of architecture modifications that aim to boost neural networks' accuracy, while retaining their GPU training and inference efficiency. We first demonstrate and discuss the bottlenecks induced by FLOPs-optimizations. We then suggest alternative designs that better utilize GPU structure and assets. Finally, we introduce a new family of GPU-dedicated models, called TResNet, which achieve better accuracy and efficiency than previous ConvNets. Using a TResNet model, with similar GPU throughput to ResNet50, we reach 80.8 top-1 accuracy on ImageNet. Our TResNet models also transfer well and achieve state-of-the-art accuracy on competitive single-label classification datasets such as Stanford cars (96.0%), CIFAR-10 (99.0%), CIFAR-100 (91.5%) and Oxford-Flowers (99.1%). They also perform well on multi-label classification and object detection tasks. Implementation is available at: https://github.com/mrT23/TResNet.

PDF Abstract

Code

Add Remove Mark official

rwightman/pytorch-image-models official

29,648

mrT23/TResNet official

460

Alibaba-MIIL/TResNet

460

Tasks

Add Remove

Fine-Grained Image Classification

General Classification

Image Classification

Multi-Label Classification

object-detection

Object Detection

Vocal Bursts Intensity Prediction

Datasets

CIFAR-10

ImageNet

CIFAR-100

Oxford 102 Flower

Results from the Paper

Edit

Ranked #7 on Fine-Grained Image Classification on Oxford 102 Flowers (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	TResNet-XL	Percentage correct	99	# 19	Compare
Image Classification	CIFAR-100	TResNet-L-V2	Percentage correct	92.6	# 13	Compare
Image Classification	Flowers-102	TResNet-L	Accuracy	99.1%	# 13	Compare
Image Classification	ImageNet	TResNet-XL	Top 1 Accuracy	84.3%	# 305	Compare
			Number of params	77M	# 803	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Fine-Grained Image Classification	Oxford 102 Flowers	TResNet-L	Accuracy	99.1%	# 7	Compare

Methods

Add Remove

1x1 Convolution • Anti-Alias Downsampling • AutoAugment • Average Pooling • Batch Normalization • ColorJitter • Convolution • Cutout • Dense Connections • Global Average Pooling • InPlace-ABN • Label Smoothing • Leaky ReLU • LSTM • Mish • ReLU • Residual Connection • Sigmoid Activation • Softplus • Squeeze-and-Excitation Block • Tanh Activation • TResNet • Weight Decay

Edit Social Preview

TResNet: High Performance GPU-Dedicated Architecture

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove