TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	ResNet-8 (Trainable Activations)	Percentage correct	86.5	# 201
Image Classification	CIFAR-10	ResNet-8 (Trainable Activations)	PARAMS	0.075M	# 1
Image Classification	CIFAR-10	ResNet-8 (Trainable Activations)	Top-1 Accuracy	86.5	# 34
Image Classification	CIFAR-10	ResNet-56 (Trainable Activations)	Percentage correct	88.8	# 195
Image Classification	CIFAR-10	ResNet-56 (Trainable Activations)	PARAMS	0.853M	# 179
Image Classification	CIFAR-10	ResNet-56 (Trainable Activations)	Top-1 Accuracy	88.8	# 31
Image Classification	CIFAR-10	ResNet-26 (Trainable Activations)	Percentage correct	91.1	# 178
Image Classification	CIFAR-10	ResNet-26 (Trainable Activations)	PARAMS	0.366M	# 168
Image Classification	CIFAR-10	ResNet-26 (Trainable Activations)	Top-1 Accuracy	91.1	# 26
Image Classification	CIFAR-10	ResNet-20 (Trainable Activations)	Percentage correct	90.4	# 187
Image Classification	CIFAR-10	ResNet-20 (Trainable Activations)	PARAMS	0.269M	# 166
Image Classification	CIFAR-10	ResNet-20 (Trainable Activations)	Top-1 Accuracy	90.4	# 29
Image Classification	CIFAR-10	ResNet-14 (Trainable Activations)	Percentage correct	89.0	# 193
Image Classification	CIFAR-10	ResNet-14 (Trainable Activations)	PARAMS	0.172M	# 164
Image Classification	CIFAR-10	ResNet-14 (Trainable Activations)	Top-1 Accuracy	89.0	# 30
Image Classification	CIFAR-10	ResNet-44 (Trainable Activations)	Percentage correct	90.5	# 185
Image Classification	CIFAR-10	ResNet-44 (Trainable Activations)	PARAMS	0.658M	# 176
Image Classification	CIFAR-10	ResNet-44 (Trainable Activations)	Top-1 Accuracy	90.5	# 28
Image Classification	CIFAR-10	ResNet-32 (Trainable Activations)	Percentage correct	90.9	# 179
Image Classification	CIFAR-10	ResNet-32 (Trainable Activations)	PARAMS	0.464M	# 170
Image Classification	CIFAR-10	ResNet-32 (Trainable Activations)	Top-1 Accuracy	90.9	# 27
Image Classification	MNIST	DNN-3 (Trainable Activations)	Percentage error	3.0	# 78
Image Classification	MNIST	DNN-3 (Trainable Activations)	Accuracy	97.0	# 29
Image Classification	MNIST	DNN-3 (Trainable Activations)	Trainable Parameters	80568	# 2
Image Classification	MNIST	DNN-2 (Trainable Activations)	Percentage error	3.6	# 79
Image Classification	MNIST	DNN-2 (Trainable Activations)	Accuracy	96.4	# 30
Image Classification	MNIST	DNN-2 (Trainable Activations)	Trainable Parameters	5500	# 1
Image Classification	MNIST	DNN-5 (Trainable Activations)	Percentage error	2.8	# 77
Image Classification	MNIST	DNN-5 (Trainable Activations)	Accuracy	97.2	# 28
Image Classification	MNIST	DNN-5 (Trainable Activations)	Trainable Parameters	175180	# 92

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/trainable-activations-for-image/image-classification-on-mnist)](https://paperswithcode.com/sota/image-classification-on-mnist?p=trainable-activations-for-image)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/trainable-activations-for-image/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=trainable-activations-for-image)`

Trainable Activations for Image Classification

Preprints 2023 · Evgenii Pishchik ·

Non-linear activation functions are one of the main parts of deep neural network architectures. The choice of the activation function can affect model speed, performance and convergence. Most popular activation functions don't have any trainable parameters and don't alter during the training. We propose different activation functions with and without trainable parameters. Said activation functions have a number of advantages and disadvantages. We'll be testing the performance of said activation functions and comparing the results with widely known activation function ReLU. We assume that the activation functions with trainable parameters can outperform functions without ones, because the trainable parameters allow the model to "select'' the type of each of the activation functions itself, however, this strongly depends on the architecture of the deep neural network and the activation function itself.

PDF Abstract

Code

Add Remove Mark official

Pe4enIks/TrainableActivation official

Tasks

Add Remove

Image Classification

Datasets

CIFAR-10

MNIST

Results from the Paper

Add Remove

Ranked #77 on Image Classification on MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	ResNet-8 (Trainable Activations)	Percentage correct	86.5	# 201	Compare
			PARAMS	0.075M	# 1	Compare
			Top-1 Accuracy	86.5	# 34	Compare
Image Classification	CIFAR-10	ResNet-56 (Trainable Activations)	Percentage correct	88.8	# 195	Compare
			PARAMS	0.853M	# 179	Compare
			Top-1 Accuracy	88.8	# 31	Compare
Image Classification	CIFAR-10	ResNet-26 (Trainable Activations)	Percentage correct	91.1	# 178	Compare
			PARAMS	0.366M	# 168	Compare
			Top-1 Accuracy	91.1	# 26	Compare
Image Classification	CIFAR-10	ResNet-20 (Trainable Activations)	Percentage correct	90.4	# 187	Compare
			PARAMS	0.269M	# 166	Compare
			Top-1 Accuracy	90.4	# 29	Compare
Image Classification	CIFAR-10	ResNet-14 (Trainable Activations)	Percentage correct	89.0	# 193	Compare
			PARAMS	0.172M	# 164	Compare
			Top-1 Accuracy	89.0	# 30	Compare
Image Classification	CIFAR-10	ResNet-44 (Trainable Activations)	Percentage correct	90.5	# 185	Compare
			PARAMS	0.658M	# 176	Compare
			Top-1 Accuracy	90.5	# 28	Compare
Image Classification	CIFAR-10	ResNet-32 (Trainable Activations)	Percentage correct	90.9	# 179	Compare
			PARAMS	0.464M	# 170	Compare
			Top-1 Accuracy	90.9	# 27	Compare
Image Classification	MNIST	DNN-3 (Trainable Activations)	Percentage error	3.0	# 78	Compare
			Accuracy	97.0	# 29	Compare
			Trainable Parameters	80568	# 2	Compare
Image Classification	MNIST	DNN-2 (Trainable Activations)	Percentage error	3.6	# 79	Compare
			Accuracy	96.4	# 30	Compare
			Trainable Parameters	5500	# 1	Compare
Image Classification	MNIST	DNN-5 (Trainable Activations)	Percentage error	2.8	# 77	Compare
			Accuracy	97.2	# 28	Compare
			Trainable Parameters	175180	# 92	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • CosLU • DELU • Global Average Pooling • Kaiming Initialization • LinComb • Max Pooling • NormLinComb • ReLU • ReLU6 • ReLUN • Residual Block • Residual Connection • ResNet • ScaledSoftSign • SGD • ShiLU • Sigmoid Activation • SiLU • Softsign Activation • Tanh Activation

Edit Social Preview

Trainable Activations for Image Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove