Distributed Fault-Tolerant Containment Control Protocols for the Discrete-Time Multiagent Systems via Reinforcement Learning Method

被引:141
作者
Li, Tieshan [1 ,2 ,3 ]
Bai, Weiwei [4 ]
Liu, Qi [5 ]
Long, Yue [4 ]
Chen, C. L. Philip [1 ,6 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst, Huzhou 313000, Peoples R China
[3] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Peoples R China
[5] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[6] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
基金
中国国家自然科学基金;
关键词
Actuators; Fault tolerant systems; Fault tolerance; Control systems; Protocols; Artificial neural networks; Reinforcement learning; Containment control; fault-tolerant control (FTC); multiagent systems (MASs); reinforcement learning (RL); OPTIMAL CONSENSUS CONTROL; FEEDBACK; ALGORITHM; DYNAMICS;
D O I
10.1109/TNNLS.2021.3121403
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article investigates the model-free fault-tolerant containment control problem for multiagent systems (MASs) with time-varying actuator faults. Depending on the relative state information of neighbors, a distributed containment control method based on reinforcement learning (RL) is adopted to achieve containment control objective without prior knowledge on the system dynamics. First, based on the information of agent itself and its neighbors, a containment error system is established. Then, the optimal containment control problem is transformed into an optimal regulation problem for the containment error system. Furthermore, the RL-based policy iteration method is employed to deal with the corresponding optimal regulation problem, and the nominal controller is proposed for the original fault-free system. Based on the nominal controller, a fault-tolerant controller is further developed to compensate for the influence of actuator faults on MAS. Meanwhile, the uniform boundedness of the containment errors can be guaranteed by using the presented control scheme. Finally, numerical simulations are given to show the effectiveness and advantages of the proposed method.
引用
收藏
页码:3979 / 3991
页数:13
相关论文
共 41 条
[1]   NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems [J].
Bai, Weiwei ;
Li, Tieshan ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) :4573-4584
[2]   Event-Triggered Multigradient Recursive Reinforcement Learning Tracking Control for Multiagent Systems [J].
Bai, Weiwei ;
Li, Tieshan ;
Long, Yue ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) :366-379
[3]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[4]   Distributed adaptive fault-tolerant containment control for a class of multi-agent systems with non-identical matching non-linear functions [J].
Deng, Chao ;
Yang, Guang-Hong .
IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (03) :273-281
[5]   Periodic Event-Triggered Synchronization for Discrete-Time Complex Dynamical Networks [J].
Ding, Sanbo ;
Wang, Zhanshan ;
Xie, Xiangpeng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) :3622-3633
[6]   Containment control of heterogeneous linear multi-agent systems [J].
Haghshenas, Hamed ;
Badamchizadeh, Mohammad Ali ;
Baradarannia, Mandi .
AUTOMATICA, 2015, 54 :210-216
[7]   Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints [J].
He, Pingan ;
Jagannathan, S. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :425-436
[8]   Containment control in mobile networks [J].
Ji, M. ;
Ferrari-Trecate, G. ;
Egerstedt, M. ;
Buffa, A. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (08) :1972-1975
[9]   Leader-follower containment control over directed random graphs [J].
Kan, Zhen ;
Shea, John M. ;
Dixon, Warren E. .
AUTOMATICA, 2016, 66 :56-62
[10]   Distributed Fault-Tolerant Control of Multiagent Systems: An Adaptive Learning Approach [J].
Khalili, Mohsen ;
Zhang, Xiaodong ;
Cao, Yongcan ;
Polycarpou, Marios M. ;
Parisini, Thomas .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) :420-432