Reinforcement learning-based optimized backstepping control of nonlinear strict feedback system with unknown control gain function

被引：7

作者：

Zhou, Ranran ^{[1
]}

Wen, Guoxing ^{[1
,2
]}

Li, Bin ^{[1
]}

机构：

[1] Qilu Univ Technol, Sch Math & Stat, Shandong Acad Sci, Jinan, Peoples R China

[2] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China

来源：

OPTIMAL CONTROL APPLICATIONS & METHODS | 2022年 / 43卷 / 05期

基金：

中国国家自然科学基金;

关键词：

critic-actor architecture; neural network; nonlinear system; optimal control; reinforcement learning (RL); TRACKING CONTROL; INPUT; STATE;

D O I：

10.1002/oca.2895

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The article develops an optimized control via learning from the design idea of optimized backstepping (OB) technique for the nonlinear strict feedback systems containing the unknown dynamic and control gain functions. OB technique requires to deal with the actual and virtual controls of backstepping as the optimized solutions of corresponding subsystems so that the entire backstepping control is optimized. In the work, for achieving the optimization control, reinforcement learning (RL) of critic-actor structure is constructed in every backstepping step on the basis of the neural network approximation of the Hamilton-Jacobi-Bellman equation's solution. Since the unknown nonlinear control gain function is considered, the complexity of control algorithm is greatly increased. However, the proposed RL is with the simple training laws, it can greatly alleviate the algorithm complexity for the optimized control. Finally, the feasibility of the method is demonstrated by both theory and simulation.

引用

页码：1358 / 1378

页数：21

共 35 条

[1] DYNAMIC PROGRAMMING [J].

BELLMAN, R .

SCIENCE, 1966, 153 (3731) :34-&

[2] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) :2614-2624

[3] Direct adaptive NN control of a class of nonlinear systems [J].

Ge, SS ;

Wang, C .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01) :214-221

[4] Adaptive neural network control of nonlinear systems by state and output feedback [J].

Ge, SS ;

Hang, CC ;

Zhang, T .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1999, 29 (06) :818-828

[5] Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints [J].

He, Wei ;

Chen, Yuhao ;

Yin, Zhao .

IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) :620-629

[6]

Kiumarsikhomartash B., 2015, OPTIMAL TRACKING CON

[7] Observer-Based Adaptive Fuzzy Fault-Tolerant Optimal Control for SISO Nonlinear Systems [J].

Li, Yongming ;

Sun, Kangkang ;

Tong, Shaocheng .

IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (02) :649-661

[8] Event-triggered adaptive backstepping control for parametric strict-feedback nonlinear systems [J].

Li, Yuan-Xin ;

Yang, Guang-Hong .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 28 (03) :976-1000

[9] Adaptive Tracking Control for Perturbed Strict-Feedback Nonlinear Systems Based on Optimized Backstepping Technique [J].

Liu, Yongchao ;

Zhu, Qidan ;

Wen, Guoxing .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) :853-865

[10] Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning [J].

Modares, Hamidreza ;

Lewis, Frank L. .

AUTOMATICA, 2014, 50 (07) :1780-1792

← 1 2 3 4 →