Reinforcement learning intermittent optimal formation control for multi-agent systems with disturbances

被引:0
作者
Liu, Erliang [1 ]
Miao, Guoying [1 ]
Hu, Jingyu [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
formation control; multi-agent systems; disturbance observer; intermittent event-triggered; ADP; NONLINEAR-SYSTEMS; CONSENSUS; SYNCHRONIZATION; ALGORITHM; TRACKING; DESIGN;
D O I
10.1088/1361-6501/ad7a18
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper investigates disturbance-resistant intermittent event-triggered optimal formation control problems of second-order multi-agent systems by using the reinforcement learning method, which takes into account the influence of network damage including denial-of-service (DoS) and deception attacks, stochastic noises, and unknown external disturbances. Firstly, we propose a novel disturbance observer based on adaptive control to estimate unknown external disturbances under an event-triggered mechanism. Secondly, by use of estimation of disturbances, an innovative intermittent event-triggered optimal formation algorithm is given. By applying theories such as Lyapunov stability and stochastic stability, sufficient conditions are derived to guarantee that all agents achieve the desired formation in mean square sense. Additionally, in the model-free case, the optimal controller is solved using the least squares method, which is computationally less complex than some existing approaches. Finally, the theoretical results are effectively validated through simulation examples.
引用
收藏
页数:17
相关论文
共 37 条
[21]   Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints [J].
Mohammadi, Mehdi ;
Arefi, Mohammad Mehdi ;
Setoodeh, Peyman ;
Kaynak, Okyay .
INFORMATION SCIENCES, 2021, 554 :84-98
[22]  
Peng JW, 2021, CHIN CONTR CONF, P5315, DOI 10.23919/CCC52363.2021.9550415
[23]  
Qi Han, 2020, Proceedings of the 2020 IEEE 3rd International Conference of Safe Production and Informatization (IICSPI), P380, DOI 10.1109/IICSPI51290.2020.9332398
[24]   Neural network-based online H∞ control for discrete-time affine nonlinear system using adaptive dynamic programming [J].
Qin, Chunbin ;
Zhang, Huaguang ;
Wang, Yingchun ;
Luo, Yanhong .
NEUROCOMPUTING, 2016, 198 :91-99
[25]  
Rizvi SAA, 2019, IEEE DECIS CONTR P, P145, DOI 10.1109/CDC40024.2019.9029829
[26]   Synchronization of nonlinear multi-agent systems using a non-fragile sampled data control approach and its application to circuit systems [J].
Samy, Stephen Arockia ;
Ramachandran, Raja ;
Anbalagan, Pratap ;
Cao, Yang .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2023, 24 (04) :553-566
[27]   Disturbance observer-based robust missile autopilot design with full-state constraints via adaptive dynamic programming [J].
Sun, Jingliang ;
Liu, Chunsheng .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (05) :2344-2368
[28]   Consensus of Leader-Following Multiagent Systems: A Distributed Event-Triggered Impulsive Control Strategy [J].
Tan, Xuegang ;
Cao, Jinde ;
Li, Xiaodi .
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) :792-801
[29]   Distributed robust stabilization of linear multi-agent systems with intermittent control [J].
Wan, Ying ;
Cao, Jinde .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2015, 352 (10) :4515-4527
[30]   Event-triggered formation control of AUVs with fixed-time RBF disturbance observer [J].
Wang, Hongbin ;
Su, Bo .
APPLIED OCEAN RESEARCH, 2021, 112