Robust control for a class of nonlinear systems with input constraints based on actor-critic learning

被引:8
作者
Li, Dongdong [1 ,2 ,3 ]
Dong, Jiuxiang [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ China, Shenyang, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
actor-critic neural networks; adaptive dynamic programming; input constraints; robust control; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; SLIDING-MODE CONTROL; DESIGN; SUBJECT; STATE; ROBOT;
D O I
10.1002/rnc.6190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article focuses on establishing a general robust actor-critic online learning control structure for disturbed nonlinear continuous systems with input constraints. It enriches the existing studies for the robustness of input constraint systems. First, the problem of robust controller design is successfully transformed into optimal controller design, and this process is proven, in which a particular nonquadratic discount cost function is defined. Then, build two neural networks (NNs) to estimate the cost function together and update each other. In the update process of actor NN, a robust term related to the state is introduced, which can guarantee the system's stability during the online learning process, and the state information is more fully utilized. Furthermore, using Lyapunov's direct method, it is proved that the estimated weights of the closed-loop optimal control system and the actor-critic NNs are uniformly ultimately bounded (UUB). It also provides extended discussions and a simulation example to demonstrate the robustness verification results of the novel algorithm.
引用
收藏
页码:7635 / 7654
页数:20
相关论文
共 51 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation [J].
Bai, Weiwei ;
Zhou, Qi ;
Li, Tieshan ;
Li, Hongyi .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) :3433-3443
[3]   An Event-Triggered Approach for Load Frequency Control With Supplementary ADP [J].
Dong, Lu ;
Tang, Yufei ;
He, Haibo ;
Sun, Changyin .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2017, 32 (01) :581-589
[4]   Robust tracking control of quadrotor via on-policy adaptive dynamic programming [J].
Dou, Liqian ;
Su, Xiaotong ;
Zhao, Xinyi ;
Zong, Qun ;
He, Lei .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (07) :2509-2525
[5]   Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].
Fan, Quan-Yong ;
Yang, Guang-Hong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177
[6]   Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems [J].
Gao, Weinan ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) :2614-2624
[7]   Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming [J].
Gao, Weinan ;
Jiang, Yu ;
Jiang, Zhong-Ping ;
Chai, Tianyou .
AUTOMATICA, 2016, 72 :37-45
[8]   Neural-network control of nonaffine nonlinear system with zero dynamics by state output feedback [J].
Ge, SS ;
Zhang, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (04) :900-918
[9]   Event-triggered guaranteed cost fault-tolerant optimal tracking control for uncertain nonlinear system via adaptive dynamic programming [J].
Guo, Zijie ;
Yao, Deyin ;
Bai, Weiwei ;
Li, Hongyi ;
Lu, Renquan .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (07) :2572-2592
[10]   An event-triggered integer-mixed adaptive dynamic programming for switched nonlinear systems with bounded inputs [J].
Han, Xiumei ;
Qin, Haiqin ;
Wang, Zhitao ;
Xu, Ning ;
Zhao, Xudong ;
Zhao, Jinfeng .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (15) :7280-7297