Reinforcement Learning Based Adaptive Predefined-Time Optimal Control for Strict-Feedback Nonlinear Systems

被引:0
作者
Jin, Yitong [1 ]
Wang, Fang [1 ]
Zhang, Xueyi [2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Foreign Languages, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
actor-critic architecture; fuzzy logic system (FLS); optimal control; predefined-time control; reinforcement learning (RL); TRACKING CONTROL; ALGORITHM;
D O I
10.1002/acs.4044
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An adaptive predefined-time optimal control method based on fuzzy logic system (FLS) is presented in this paper for strict-feedback nonlinear systems. The settling time of predefined-time control can be set in advance, which is independent of the design parameters and initial conditions. The optimal control relies on solving the Hamilton-Jacobi-Bellman (HJB) equation, which is difficult to calculate directly due to its inherent nonlinearity. To overcome this difficulty, the reinforcement learning (RL) strategy of actor-critic structure is used, where the actor and critic are used to achieve control behavior and evaluate system performance, respectively. In addition, the RL algorithm is simplified, and the two conditions of persistence of excitation and known dynamics required for the most optimal controls are eliminated. The main objective of this article is to ensure that the tracking error converges to a small neighborhood near the origin within a predefined time, while minimizing energy consumption. Finally, the efficiency of the suggested control strategy is demonstrated by using a practical simulation example.
引用
收藏
页数:13
相关论文
共 48 条
[11]   Adaptive Neural Network Finite-Time Control for Multi-Input and Multi-Output Nonlinear Systems With Positive Powers of Odd Rational Numbers [J].
Li, Yongming ;
Li, Kewen ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) :2532-2543
[12]   Online identifier-actor-critic algorithm for optimal control of nonlinear systems [J].
Lin, Hanquan ;
Wei, Qinglai ;
Liu, Derong .
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 38 (03) :317-335
[13]   Predefined-time backstepping control for a nonlinear strict-feedback system [J].
Liu, Bojun ;
Hou, Mingshan ;
Wu, Cihang ;
Wang, Wencong ;
Wu, Zhonghua ;
Huang, Bing .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (08) :3354-3372
[14]   Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming [J].
Liu, Derong ;
Huang, Yuzhu ;
Wang, Ding ;
Wei, Qinglai .
INTERNATIONAL JOURNAL OF CONTROL, 2013, 86 (09) :1554-1566
[15]   Barrier Lyapunov Functions-based adaptive control for a class of nonlinear pure-feedback systems with full state constraints [J].
Liu, Yan-Jun ;
Tong, Shaocheng .
AUTOMATICA, 2016, 64 :70-75
[16]  
Miller W. T., 1995, "A Menu of Designs for Reinforcement Learning Over Time, 67-95,"
[17]   Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning [J].
Modares, Hamidreza ;
Lewis, Frank L. .
AUTOMATICA, 2014, 50 (07) :1780-1792
[18]   Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks [J].
Modares, Hamidreza ;
Lewis, Frank L. ;
Naghibi-Sistani, Mohammad-Bagher .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (10) :1513-1525
[19]   Predefined-time control of cooperative manipulators [J].
Munoz-Vazquez, Aldo Jonathan ;
Sanchez-Torres, Juan Diego .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (17) :7295-7306
[20]   Global Predefined Time and Accuracy Adaptive Neural Network Control for Uncertain Strict-Feedback Systems With Output Constraint and Dead Zone [J].
Ni, Junkang ;
Shi, Peng .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (12) :7903-7918