Optimal fractional-order PID controller based on fractional-order actor-critic algorithm

被引:0
作者
Raafat Shalaby
Mohammad El-Hossainy
Belal Abo-Zalam
Tarek A. Mahmoud
机构
[1] Menoufia University,Department of Industrial Electronics and Control Engineering, Faculty of Electronic Engineering
[2] Nile University,Department of Mechatronics Engineering, School of Engineering and Applied Science
[3] New Cairo Technological University,Department of New and Renewable Energy, Faculty of Industry and Energy Technology
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Fractional-order PID controller; Reinforcement learning; Actor-critic algorithm; Gray wolf optimization; Lyapunov theorem;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online optimization approach of a fractional-order PID controller based on a fractional-order actor-critic algorithm (FOPID-FOAC) is proposed. The proposed FOPID-FOAC scheme exploits the advantages of the FOPID controller and FOAC approaches to improve the performance of nonlinear systems. The proposed FOAC is built by developing a FO-based learning approach for the actor-critic neural network with adaptive learning rates. Moreover, a FO rectified linear unit (RLU) is introduced to enable the AC neural network to define and optimize its own activation function. By the means of the Lyapunov theorem, the convergence and the stability analysis of the proposed algorithm are investigated. The FO operators for the FOAC learning algorithm are obtained using the gray wolf optimization (GWO) algorithm. The effectiveness of the proposed approach is proven by extensive simulations based on the tracking problem of the two degrees of freedom (2-DOF) helicopter system and the stabilization issue of the inverted pendulum (IP) system. Moreover, the performance of the proposed algorithm is compared against optimized FOPID control approaches in different system conditions, namely when the system is subjected to parameter uncertainties and external disturbances. The performance comparison is conducted in terms of two types of performance indices, the error performance indices, and the time response performance indices. The first one includes the integral absolute error (IAE), and the integral squared error (ISE), whereas the second type involves the rising time, the maximum overshoot (Max. OS), and the settling time. The simulation results explicitly indicate the high effectiveness of the proposed FOPID-FOAC controller in terms of the two types of performance measurements under different scenarios compared with the other control algorithms.
引用
收藏
页码:2347 / 2380
页数:33
相关论文
共 203 条
[21]  
Acosta GG(2019)A novel structure of actor-critic learning based on an interval type-2 tsk fuzzy neural network IEEE Trans Fuzzy Syst 28 412-429
[22]  
Chen M(2010)Properties and inequalities of generalized k-gamma, beta and zeta functions Int J Contemp Math Sci 5 76-105
[23]  
Lam HK(2021)Deep convolutional neural network based on adaptive gradient optimizer for fault detection in scim ISA Trans 111 533-541
[24]  
Shi Q(2021)Voltage stability of solar dish-stirling based autonomous dc microgrid using grey wolf optimised fopid-controller Int J Sustain Energ 40 371-390
[25]  
Xiao B(2022)Deep reinforcement learning with shallow controllers: An experimental application to pid tuning Control Eng Pract 121 9034-9060
[26]  
Dwivedi P(2012)Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers IEEE Control Syst Mag 32 153-167
[27]  
Pandey S(2020)New fractional derivative with sigmoid function as the kernel and its models Chin J Phys 68 98-111
[28]  
Junghare A(2020)Inquiry-based learning: development of an introductory manufacturing processes course based on a mobile inverted pendulum robot Int J Mech Eng Educ 48 46-61
[29]  
Fei J(2021)Direct adaptive control for nonlinear systems using a tsk fuzzy echo state network based on fractional-order learning algorithm J Franklin Inst 358 1940-1950
[30]  
Wang Z(2015)Dynamical behavior of fractional-order hastings-powell food chain model and its discretization Commun Nonlinear Sci Numer Simul 27 116704-116723