Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation

被引:124
作者
Zhang, Haoyan [1 ]
Zhao, Xudong [1 ,2 ]
Wang, Huanqing [3 ]
Zong, Guangdeng [4 ]
Xu, Ning [5 ]
机构
[1] Bohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Peoples R China
[2] Dalian Univ Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
[3] Bohai Univ, Coll Math Sci, Jinzhou 121013, Peoples R China
[4] Tiangong Univ, Coll Control Sci & Engn, Tianjin 300387, Peoples R China
[5] Bohai Univ, Coll Informat Sci & Technol, Jinzhou 121013, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal control; Switches; Perturbation methods; Adaptive systems; Artificial neural networks; Control systems; Uncertainty; Actor-critic (AC) neural networks (NNs) architecture; adaptive optimal control; hierarchical sliding-mode surface (HSMS); switched nonlinear systems; unknown perturbation; APPROXIMATE OPTIMAL-CONTROL; TRAJECTORY TRACKING; DESIGN;
D O I
10.1109/TNNLS.2022.3183991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article studies the hierarchical sliding-mode surface (HSMS)-based adaptive optimal control problem for a class of switched continuous-time (CT) nonlinear systems with unknown perturbation under an actor-critic (AC) neural networks (NNs) architecture. First, a novel perturbation observer with a nested parameter adaptive law is designed to estimate the unknown perturbation. Then, by constructing an especial cost function related to HSMS, the original control issue is further converted into the problem of finding a series of optimal control policies. The solution to the HJB equation is identified by the HSMS-based AC NNs, where the actor and critic updating laws are developed to implement the reinforcement learning (RL) strategy simultaneously. The critic update law is designed via the gradient descent approach and the principle of standardization, such that the persistence of excitation (PE) condition is no longer needed. Based on the Lyapunov stability theory, all the signals of the closed-loop switched nonlinear systems are strictly proved to be bounded in the sense of uniformly ultimate boundedness (UUB). Finally, the simulation results are presented to verify the validity of the proposed adaptive optimal control scheme.
引用
收藏
页码:1559 / 1571
页数:13
相关论文
共 45 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[3]   Dynamic programming for constrained optimal control of discrete-time linear hybrid systems [J].
Borrelli, F ;
Baotic, M ;
Bemporad, A ;
Morari, M .
AUTOMATICA, 2005, 41 (10) :1709-1721
[4]   Semi-global adaptive backstepping control for parametric strict-feedback systems with non-triangular structural uncertainties? [J].
Cai, Jianping ;
Mei, Congli ;
Yan, Qiuzhen .
ISA TRANSACTIONS, 2022, 126 :180-189
[5]   Adaptive Sliding Mode Control of Multi-Input Nonlinear Systems With Perturbations to Achieve Asymptotical Stability [J].
Chang, Yaote .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (12) :2863-2869
[6]   Command filtering-based adaptive neural network control for uncertain switched nonlinear systems using event-triggered communication [J].
Chen, Zhongyu ;
Niu, Ben ;
Zhang, Liang ;
Zhao, Jinfeng ;
Ahmad, Adil M. ;
Alassafi, Madini O. .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (11) :6507-6522
[7]   Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].
Fan, Quan-Yong ;
Yang, Guang-Hong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177
[8]   Integral Reinforcement Learning-Based Adaptive NN Control for Continuous-Time Nonlinear MIMO Systems With Unknown Control Directions [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11) :4068-4077
[9]   Adaptive-Critic Design for Decentralized Event-Triggered Control of Constrained Nonlinear Interconnected Systems Within an Identifier-Critic Framework [J].
Huo, Xin ;
Karimi, Hamid Reza ;
Zhao, Xudong ;
Wang, Bohui ;
Zong, Guangdeng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) :7478-7491
[10]   Adaptive Fuzzy Hierarchical Sliding-Mode Control for the Trajectory Tracking of Uncertain Underactuated Nonlinear Dynamic Systems [J].
Hwang, Chih-Lyang ;
Chiang, Chiang-Cheng ;
Yeh, Yao-Wei .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (02) :286-299