Adaptive output feedback reinforcement learning control for continuous time switched stochastic nonlinear systems with unknown control coefficients and full-state constraints

被引:5
作者
Li, Hongyao [1 ]
Wang, Fuli [2 ,3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang, Liaoning, Peoples R China
[3] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang 110000, Liaoning, Peoples R China
关键词
Adaptive control; ADT; backstepping; NN; RL; switched stochastic nonlinear systems; PRESCRIBED PERFORMANCE; TRACKING CONTROL; NEURAL-CONTROL; STABILITY; DESIGN; STABILIZATION;
D O I
10.1080/00207721.2023.2272217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the problem of adaptive neural network (NN) reinforcement learning (RL) tracking control is investigated for the continuous time (CT) switched stochastic nonlinear systems with unknown control coefficients and full-state constraints. First, a set of reconstructed states are defined to handle the unknown control coefficients, and switched state observers are developed to estimate unmeasurable reconstructed states. Then, to improve the tracking performance, based on the minimal learning parameter (MLP) method and the RL control design technique, the adaptive RL controller is developed by the backstepping method. Finally, the boundedness of the tracking error and all signals is demonstrated via the average dwell time (ADT) method and tangent type time-varying barrier multiple Lyapunov functions. The effectiveness of the proposed scheme is verified by two examples.
引用
收藏
页码:332 / 354
页数:23
相关论文
共 46 条
[1]   NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems [J].
Bai, Weiwei ;
Li, Tieshan ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) :4573-4584
[2]   Quasi-Time-Dependent Output Control for Discrete-Time Switched System With Mode-Dependent Average Dwell Time [J].
Fei, Zhongyang ;
Shi, Shuang ;
Wang, Zhenhuan ;
Wu, Ligang .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (08) :2647-2653
[3]   Adaptive Neural Control Using Tangent Time-Varying BLFs for a Class of Uncertain Stochastic Nonlinear Systems With Full State Constraints [J].
Gao, Tingting ;
Liu, Yan-Jun ;
Li, Dapeng ;
Tong, Shaocheng ;
Li, Tieshan .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) :1943-1953
[4]   Integral Reinforcement Learning-Based Adaptive NN Control for Continuous-Time Nonlinear MIMO Systems With Unknown Control Directions [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11) :4068-4077
[5]   Adaptive neural control for a class of switched nonlinear systems [J].
Han, Thanh-Trung ;
Ge, Shuzhi Sam ;
Lee, Tong Heng .
SYSTEMS & CONTROL LETTERS, 2009, 58 (02) :109-118
[6]   Reinforcement learning-based output feedback control of nonlinear systems with input constraints [J].
He, P ;
Jagannathan, S .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (01) :150-154
[7]   Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints [J].
He, Wei ;
Chen, Yuhao ;
Yin, Zhao .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) :620-629
[8]  
Hespanha J. P., 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304), P2655, DOI 10.1109/CDC.1999.831330
[9]   On modeling and secure control of cyber-physical systems with attacks/faults changing system dynamics: An average dwell-time approach [J].
Huang, Xin ;
Dong, Jiuxiang .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (16) :5481-5498
[10]   Improved ADT Method for the Switched Nonlinear System: Stability Analysis and Application [J].
Jin, Cui-Li ;
Wang, Qing-Guo ;
Wang, Rui ;
Wu, Di .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (08) :3580-3584