Adaptive reinforcement learning optimal tracking control for strict-feedback nonlinear systems with prescribed performance

被引：40

作者：

Huang, Zongsheng ^{[1
]}

Bai, Weiwei ^{[1
]}

Li, Tieshan ^{[1
,2
,3
,5
,6
,7
]}

Long, Yue ^{[1
]}

Chen, C. L. Philip ^{[1
,4
]}

Liang, Hongjing ^{[1
]}

Yang, Hanqing ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automation Engn, Chengdu 611731, Sichuan, Peoples R China

[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst, Huzhou 313000, Peoples R China

[3] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100089, Peoples R China

[4] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China

[5] Univ Elect Sci & Technol China, Sch Automation Engn, Chengdu 611731, Sichuan, Peoples R China

[6] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst, Huzhou 313000, Peoples R China

[7] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100089, Peoples R China

来源：

INFORMATION SCIENCES | 2023年 / 621卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Prescribed performance control; Adaptive dynamic programming; Tracking control; Strict-feedback nonlinear systems; OPTIMAL-CONTROL DESIGN; NEURAL-CONTROL;

D O I：

10.1016/j.ins.2022.11.109

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The reinforcement learning-based prescribed performance optimal tracking control prob-lem is considered for a class of strict-feedback nonlinear systems in this paper. The unknown nonlinearities and cost function are approximated by radial-basis-function (RBF) neural network (NN). The overall controller consists of an adaptive controller and an optimal compensation term. Firstly, the adaptive controller is designed by backstepping control method. Subsequently, the optimal compensation term is derived via policy itera-tion by minimizing cost function. In addition, depending on the prescribed performance control, the tracking error can be limited in the prescribed area. Therefore, the whole con-trol scheme can effectively guarantee that the tracking error converges to a bound with prescribed performance while the cost function is minimized. The stability analysis shows that all signals in the closed-loop system are bounded. Finally, the effectiveness and advan-tages of the designed control strategy are illustrated by the simulation examples.(c) 2022 Elsevier Inc. All rights reserved.

引用

页码：407 / 423

页数：17

共 46 条

[1] Event-Triggered Multigradient Recursive Reinforcement Learning Tracking Control for Multiagent Systems [J].

Bai, Weiwei ;

Li, Tieshan ;

Long, Yue ;

Chen, C. L. Philip .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) :366-379

[2] Prescribed Performance Adaptive Control for Multi-Input Multi-Output Affine in the Control Nonlinear Systems [J].

Bechlioulis, Charalampos P. ;

Rovithakis, George A. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (05) :1220-1226

[3] Adaptive control with guaranteed transient and steady state tracking error bounds for strict feedback systems [J].

Bechlioulis, Charalampos P. ;

Rovithakis, George A. .

AUTOMATICA, 2009, 45 (02) :532-538

[4] H∞ optimal control for semi-Markov jump linear systems via TP-free temporal difference (λ) learning [J].

Chen, Yaogang ;

Wen, Jiwei ;

Luan, Xiaoli ;

Liu, Fei .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (14) :6905-6916

[5] Two-loop reinforcement learning algorithm for finite-horizon optimal control of continuous-time affine nonlinear systems [J].

Chen, Zhe ;

Xue, Wenqian ;

Li, Ning ;

Lewis, Frank L. .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (01) :393-420

[6] Constrained Decoupling Adaptive Dynamic Programming for A Partially Uncontrollable Time-Delayed Model of Energy Systems [J].

Chen, Zitao ;

Chen, Si-Zhe ;

Chen, Kairui ;

Zhang, Yun .

INFORMATION SCIENCES, 2022, 608 :1352-1374

[7] Fixed-Time Prescribed Performance Adaptive Trajectory Tracking Control for a QUAV [J].

Cui, Guozeng ;

Yang, Wei ;

Yu, Jinpeng ;

Li, Ze ;

Tao, Chongben .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) :494-498

[8] Event-triggered output-feedback adaptive tracking control of autonomous underwater vehicles using reinforcement learning [J].

Deng, Yingjie ;

Liu, Tao ;

Zhao, Dingxuan .

APPLIED OCEAN RESEARCH, 2021, 113

[9] Broad learning system-based adaptive optimal control design for dynamic positioning of marine vessels [J].

Gao, Xiaoyang ;

Bai, Weiwei ;

Li, Tieshan ;

Yuan, Liang'en ;

Long, Yue .

NONLINEAR DYNAMICS, 2021, 105 (02) :1593-1609

[10] Adaptive neural control of uncertain MIMO nonlinear systems [J].

Ge, SS ;

Wang, C .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (03) :674-692

← 1 2 3 4 5 →