Adaptive Morphing Activation Function for Neural Networks

被引:3
作者
Herrera-Alcantara, Oscar [1 ]
Arellano-Balderas, Salvador [2 ]
机构
[1] Univ Autonoma Metropolitana, Dept Sistemas, Azcapotzalco 02200, Mexico
[2] Univ Autonoma Metropolitana, Dept Ciencias Bas, Azcapotzalco 02200, Mexico
关键词
fractional derivative; activation function; machine learning; wavelets; WAVELETS;
D O I
10.3390/fractalfract8080444
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
A novel morphing activation function is proposed, motivated by the wavelet theory and the use of wavelets as activation functions. Morphing refers to the gradual change of shape to mimic several apparently unrelated activation functions. The shape is controlled by the fractional order derivative, which is a trainable parameter to be optimized in the neural network learning process. Given the morphing activation function, and taking only integer-order derivatives, efficient piecewise polynomial versions of several existing activation functions are obtained. Experiments show that the performance of polynomial versions PolySigmoid, PolySoftplus, PolyGeLU, PolySwish, and PolyMish is similar or better than their counterparts Sigmoid, Softplus, GeLU, Swish, and Mish. Furthermore, it is possible to learn the best shape from the data by optimizing the fractional-order derivative with gradient descent algorithms, leading to the study of a more general formula based on fractional calculus to build and adapt activation functions with properties useful in machine learning.
引用
收藏
页数:32
相关论文
共 45 条
[1]  
Agarap A. F., 2018, arXiv, DOI DOI 10.48550/ARXIV.1803.08375
[2]  
Agostinelli F., 2015, P INT C LEARN REPR W
[3]  
Alijani Z, 2025, Arxiv, DOI arXiv:2310.11875
[4]   Fractional-Order Deep Backpropagation Neural Network [J].
Bao, Chunhui ;
Pu, Yifei ;
Zhang, Yi .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[5]   TanhSoft-Dynamic Trainable Activation Functions for Faster Learning and Better Performance [J].
Biswas, Koushik ;
Kumar, Sandeep ;
Banerjee, Shilpak ;
Pandey, Ashish Kumar .
IEEE ACCESS, 2021, 9 :120613-120623
[6]   Deep learning with ExtendeD Exponential Linear Unit (DELU) [J].
Catalbas, Burak ;
Morgul, Oemer .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) :22705-22724
[7]  
Clevert Djork-Arne, 2016, INT C LEARN REPR ICL
[8]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[9]  
Daubechies I., 1992, 10 LECT WAVELETS, DOI [10.1137/1.9781611970104, DOI 10.1137/1.9781611970104]
[10]  
Dubey Arun Kumar, 2019, Applications of Computing, Automation and Wireless Systems in Electrical Engineering. Proceedings of MARC 2018. Lecture Notes in Electrical Engineering (LNEE 553), P873, DOI 10.1007/978-981-13-6772-4_76