Optimal fractional-order PID controller based on fractional-order actor-critic algorithm

被引:0
作者
Raafat Shalaby
Mohammad El-Hossainy
Belal Abo-Zalam
Tarek A. Mahmoud
机构
[1] Menoufia University,Department of Industrial Electronics and Control Engineering, Faculty of Electronic Engineering
[2] Nile University,Department of Mechatronics Engineering, School of Engineering and Applied Science
[3] New Cairo Technological University,Department of New and Renewable Energy, Faculty of Industry and Energy Technology
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Fractional-order PID controller; Reinforcement learning; Actor-critic algorithm; Gray wolf optimization; Lyapunov theorem;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online optimization approach of a fractional-order PID controller based on a fractional-order actor-critic algorithm (FOPID-FOAC) is proposed. The proposed FOPID-FOAC scheme exploits the advantages of the FOPID controller and FOAC approaches to improve the performance of nonlinear systems. The proposed FOAC is built by developing a FO-based learning approach for the actor-critic neural network with adaptive learning rates. Moreover, a FO rectified linear unit (RLU) is introduced to enable the AC neural network to define and optimize its own activation function. By the means of the Lyapunov theorem, the convergence and the stability analysis of the proposed algorithm are investigated. The FO operators for the FOAC learning algorithm are obtained using the gray wolf optimization (GWO) algorithm. The effectiveness of the proposed approach is proven by extensive simulations based on the tracking problem of the two degrees of freedom (2-DOF) helicopter system and the stabilization issue of the inverted pendulum (IP) system. Moreover, the performance of the proposed algorithm is compared against optimized FOPID control approaches in different system conditions, namely when the system is subjected to parameter uncertainties and external disturbances. The performance comparison is conducted in terms of two types of performance indices, the error performance indices, and the time response performance indices. The first one includes the integral absolute error (IAE), and the integral squared error (ISE), whereas the second type involves the rising time, the maximum overshoot (Max. OS), and the settling time. The simulation results explicitly indicate the high effectiveness of the proposed FOPID-FOAC controller in terms of the two types of performance measurements under different scenarios compared with the other control algorithms.
引用
收藏
页码:2347 / 2380
页数:33
相关论文
共 203 条
  • [1] Aguilar CZ(2020)Fractional order neural networks for system identification Chaos, Solitons & Fractals 130 2561-2569
  • [2] Gómez-Aguilar J(2017)Rotary flexible joint control by fractional order controllers Int J Control Autom Syst 15 126-135
  • [3] Alvarado-Martínez V(2020)A novel cffopi-fopid controller for agc performance enhancement of single and multi-area electric power systems ISA Trans 100 2297-2301
  • [4] Romero-Ugalde H(2013)On tuning fractional order [proportional-derivative] controllers for a class of fractional order systems Automatica 49 4573-4584
  • [5] Al-Saggaf UM(2020)Nn reinforcement learning adaptive control for a class of nonstrict-feedback discrete-time systems IEEE Trans Cybernet 50 543-551
  • [6] Mehedi IM(2010)Tuning fuzzy pd and pi controllers using reinforcement learning ISA Trans 49 280-294
  • [7] Mansouri R(2020)An adaptive deep reinforcement learning approach for mimo pid control of mobile robots ISA Trans 102 2059-2063
  • [8] Bettayeb M(2019)Reinforcement learning-based control of nonlinear systems using lyapunov stability concept and fuzzy reward scheme IEEE Trans Circuits Syst II Exp Briefs 67 5121-5145
  • [9] Arya Y(2017)Performance analysis and experimental validation of 2-dof fractional-order controller for underactuated rotary inverted pendulum Arab J Sci Eng 42 167965-167974
  • [10] Badri V(2020)Multi-loop recurrent neural network fractional-order terminal sliding mode control of mems gyroscope IEEE Access 8 1900-1912