Reinforcement Learning Based Adaptive Predefined-Time Optimal Control for Strict-Feedback Nonlinear Systems

被引:0
作者
Jin, Yitong [1 ]
Wang, Fang [1 ]
Zhang, Xueyi [2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Foreign Languages, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
actor-critic architecture; fuzzy logic system (FLS); optimal control; predefined-time control; reinforcement learning (RL); TRACKING CONTROL; ALGORITHM;
D O I
10.1002/acs.4044
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An adaptive predefined-time optimal control method based on fuzzy logic system (FLS) is presented in this paper for strict-feedback nonlinear systems. The settling time of predefined-time control can be set in advance, which is independent of the design parameters and initial conditions. The optimal control relies on solving the Hamilton-Jacobi-Bellman (HJB) equation, which is difficult to calculate directly due to its inherent nonlinearity. To overcome this difficulty, the reinforcement learning (RL) strategy of actor-critic structure is used, where the actor and critic are used to achieve control behavior and evaluate system performance, respectively. In addition, the RL algorithm is simplified, and the two conditions of persistence of excitation and known dynamics required for the most optimal controls are eliminated. The main objective of this article is to ensure that the tracking error converges to a small neighborhood near the origin within a predefined time, while minimizing energy consumption. Finally, the efficiency of the suggested control strategy is demonstrated by using a practical simulation example.
引用
收藏
页数:13
相关论文
共 48 条
[1]   Finite-time and fixed-time synchronization of a class of inertial neural networks with multi-proportional delays and its application to secure communication [J].
Alimi, Adel M. ;
Aouiti, Chaouki ;
Assali, El Abed .
NEUROCOMPUTING, 2019, 332 :29-43
[2]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[3]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[4]  
Sánchez-Tones JD, 2015, P AMER CONTR CONF, P5842, DOI 10.1109/ACC.2015.7172255
[5]   A class of predefined-time stable dynamical systems [J].
Diego Sanchez-Torres, Juan ;
Gomez-Gutierrez, David ;
Lopez, Esteban ;
Loukianov, Alexander G. .
IMA JOURNAL OF MATHEMATICAL CONTROL AND INFORMATION, 2018, 35 :1-29
[6]   Boundary Feedback Control of a Nonhomogeneous Wind Turbine Tower With Exogenous Disturbances [J].
Han, Zhiji ;
Liu, Zhijie ;
Kang, Wen ;
He, Wei .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (04) :1952-1959
[7]   Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints [J].
He, Pingan ;
Jagannathan, S. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :425-436
[8]   Trajectory Tracking Control for a Three-Dimensional Flexible Wing [J].
He, Wei ;
Tang, Xinyue ;
Wang, Tingting ;
Liu, Zhijie .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (05) :2243-2250
[9]   Prescribed-Time Output-Feedback Control of Stochastic Nonlinear Systems [J].
Li, Wuquan ;
Krstic, Miroslav .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) :1431-1446
[10]   Adaptive Optimized Backstepping Control-Based RL Algorithm for Stochastic Nonlinear Systems With State Constraints and Its Application [J].
Li, Yongming ;
Fan, Yanli ;
Li, Kewen ;
Liu, Wei ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) :10542-10555