PATS: A New Neural Network Activation Function with Parameter

16 Jun 2020 · Baoyou Zheng, Zhiping Wang ·

Activation function is crucial to the recent successes of deep neural networks. In this paper, we propose a new activation function with parameters, named PATS. Specifically, PATS is a non-monotonic function which combines arctangent function and sigmoid function. In the process of network model training, the parameter of PATS is a random number from the uniform distribution, which improves the flexibility of network model and reduces the risk of over fitting. In addition, PATS can be widely used in existing deep network models. We use several classic deep network models to test the performance of PATS. The experimental results on CIFAR-10 and CIFAR-100 datasets show that compared with other activation functions, PATS has better performance in improving the learning ability and robustness of deep network model.

PDF

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

Mish • Softplus • Tanh Activation

Edit Social Preview

PATS: A New Neural Network Activation Function with Parameter

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove