Adaptive output-feedback control for a class of nonlinear systems based on optimized backstepping technique

被引:4
作者
Shen, Fei [1 ]
Wang, Xinjun [2 ]
Li, Haotian [1 ]
Yin, Xinghui [1 ]
机构
[1] Hohai Univ, Sch Comp & Informat, Nanjing, Peoples R China
[2] Shandong Normal Univ, Coll Informat Sci & Engn, Jinan 250000, Peoples R China
关键词
actor-critic architecture; neural networks (NNs); optimized backstepping (OB); output-feedback control; reinforcement learning (RL);
D O I
10.1002/acs.3397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, an adaptive output-feedback control for a class of strict-feedback nonlinear systems is developed based on optimized backstepping technique. Neural networks are utilized to approximate unknown functions, while a state observer is designed to estimate the unmeasurable system state signals. Since the presented optimized control scheme requires training the adaptive parameters for reinforcement learning (RL), it will be more challenging for designing control algorithms and deriving the adaptive update rates. In general, optimization control is designed based on the solution of Hamilton-Jacobi-Bellman equation, but solving the equation is very difficult due to the inherent nonlinearity and intractability. So, RL strategy of actor-critic architecture is used. According to the Lyapunov stability theory, it is proved that all signals of the closed-loop systems are semi-global uniformly ultimately bounded. Finally, the results of the simulation cases are provided to show the effectiveness of the designed controller scheme.
引用
收藏
页码:1077 / 1097
页数:21
相关论文
共 30 条
[1]   NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems [J].
Bai, Weiwei ;
Li, Tieshan ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) :4573-4584
[2]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[3]   Hamilton-Jacobi-Bellman Equation and Feedback Synthesis for Impulsive Control [J].
Fraga, Sergio Loureiro ;
Pereira, Fernando Lobo .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (01) :244-249
[4]   Direct adaptive NN control of a class of nonlinear systems [J].
Ge, SS ;
Wang, C .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01) :214-221
[5]   Backstepping Control for Nonlinear Systems With Time Delays and Applications to Chemical Reactor Systems [J].
Hua, Changchun ;
Liu, Peter X. ;
Guan, Xinping .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (09) :3723-3732
[6]  
Ito K., 1999, OPTIMAL CONTROL
[7]   A dynamic recurrent neural-network-based adaptive observer for a class of nonlinear systems [J].
Kim, YH ;
Lewis, FL ;
Abdallah, CT .
AUTOMATICA, 1997, 33 (08) :1539-1543
[8]  
Kokotovic PV, 1991, ADAPTIVE FEEDBACK LI, V160, P309
[9]  
Kristic M., 1995, NONLINEAR ADAPTIVE C
[10]   Adaptive Fuzzy Output-Feedback Stabilization Control for a Class of Switched Nonstrict-Feedback Nonlinear Systems [J].
Li, Yongming ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (04) :1007-1016