The Adaptive Quadratic Linear Unit (AQuLU): Adaptive Non Monotonic Piecewise Activation Function

Zhandong Wu; Haiye Yu; Lei Zhang; Yuanyuan Sui

doi:10.17559/TV-20230614000735

Tehnički Vjesnik (Jan 2023)

The Adaptive Quadratic Linear Unit (AQuLU): Adaptive Non Monotonic Piecewise Activation Function

Zhandong Wu,
Haiye Yu,
Lei Zhang,
Yuanyuan Sui

Affiliations

Zhandong Wu: College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
Haiye Yu: College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
Lei Zhang: College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
Yuanyuan Sui: College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China

DOI: https://doi.org/10.17559/TV-20230614000735
Journal volume & issue: Vol. 30, no. 5
pp. 1469 – 1485

Abstract

Read online

The activation function plays a key role in inﬂuencing the performance and training dynamics of neural networks. There are hundreds of activation functions widely used as rectiﬁed linear units (ReLUs), but most of them are applied to complex and large neural networks, which often have gradient explosion and vanishing gradient problems. By studying a variety of non-monotonic activation functions, we propose a method to construct a non-monotonic activation function, x·Φ(x), with Φ(x) [0, 1]. With the hardening treatment of Φ(x), we propose an adaptive non-monotonic segmented activation function, called the adaptive quadratic linear unit, abbreviated as AQuLU, which ensures the sparsity of the input data and improves training efficiency. In image classiﬁcation based on different state-of-the-art neural network architectures, the performance of AQuLUs has signiﬁcant advantages for more complex and deeper architectures with various activation functions. The ablation experimental study further validates the compatibility and stability of AQuLUs with different depths, complexities, optimizers, learning rates, and batch sizes. We thus demonstrate the high efficiency, robustness, and simplicity of AQuLUs.

Published in Tehnički Vjesnik

ISSN: 1330-3651 (Print); 1848-6339 (Online)
Publisher: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
Country of publisher: Croatia
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: http://hrcak.srce.hr/tehnicki-vjesnik

About the journal

Abstract

Keywords