Optimal Incremental-containment Control of Two-order Swarm System Based on Reinforcement Learning

被引:1
作者
Chen, Haipeng [1 ]
Fu, Wenxing [1 ]
Liu, Junmin [2 ]
Yu, Dengxiu [3 ]
Chen, Kang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian, Peoples R China
[2] Inner Mongolia North Heavy Ind Grp Co Ltd, Baotou, Peoples R China
[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Backstepping; Lypunov function; optimal incremental-containment control; reinforcement learning; swarm system; MULTIAGENT SYSTEMS;
D O I
10.1007/s12555-022-0710-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the optimal incremental-containment control of two-order swarm system based on reinforcement learning (RL) is proposed to avoid the dilemma that the number of agents in a swarm system is immutable, which is essential for a swarm system that cannot meet the containment demands and need more agents to expand the containment range. Notably, the number of agents in a swarm system with a traditional containment controller is immutable, which limits the containment range that the swarm system can achieve. Besides, in traditional optimal control theory, it is obtained by solving the Hamilton-Jacobi-Bellman (HJB) equation, which is difficult to solve due to the unknown nonlinearity. To overcome these problems, several contributions are made in this paper. Firstly, in order to overcome the dilemma that the number of agents in the swarm system is immutable, the incremental-containment control is proposed. Secondly, considering the error and control input as the optimization goal, the optimal cost function is introduced and the optimal incremental-containment control is proposed to reduce resource waste and increase hardware service life. Furthermore, based on the proposed optimal incremental-containment control, the controller is designed by a new RL based on the backstepping method. The Lyapunov function is used to prove the stability of controller. The simulation show the efficiency of the proposed controller.
引用
收藏
页码:3443 / 3455
页数:13
相关论文
共 39 条
[1]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[2]   Simplified optimized finite-time containment control for a class of multi-agent systems with actuator faults [J].
Cui, Jingpeng ;
Pan, Yingnan ;
Xue, Hong ;
Tan, Lihua .
NONLINEAR DYNAMICS, 2022, 109 (04) :2799-2816
[3]   Theory and Experiment on Formation-Containment Control of Multiple Multirotor Unmanned Aerial Vehicle Systems [J].
Dong, Xiwang ;
Hua, Yongzhao ;
Zhou, Yan ;
Ren, Zhang ;
Zhong, Yisheng .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (01) :229-240
[4]   A tutorial on optimal control and reinforcement learning methods for quantum technologies [J].
Giannelli, Luigi ;
Sgroi, Sofia ;
Brown, Jonathon ;
Paraoanu, Gheorghe Sorin ;
Paternostro, Mauro ;
Paladino, Elisabetta ;
Falci, Giuseppe .
PHYSICS LETTERS A, 2022, 434
[5]   Swarm control for large-scale omnidirectional mobile robots within incremental behavior [J].
Jin, Xiaoyue ;
Wang, Zhen ;
Zhao, Junsheng ;
Yu, Dengxiu .
INFORMATION SCIENCES, 2022, 614 :35-50
[6]   Optimal consensus control for unknown second-order multi-agent systems: Using model-free reinforcement learning method [J].
Li, Jun ;
Ji, Lianghao ;
Li, Huaqing .
APPLIED MATHEMATICS AND COMPUTATION, 2021, 410
[7]   Distributed Fault-Tolerant Containment Control Protocols for the Discrete-Time Multiagent Systems via Reinforcement Learning Method [J].
Li, Tieshan ;
Bai, Weiwei ;
Liu, Qi ;
Long, Yue ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :3979-3991
[8]   Containment Control of Semi-Markovian Multiagent Systems With Switching Topologies [J].
Liang, Hongjing ;
Zhang, Linchuang ;
Sun, Yonghui ;
Huang, Tingwen .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06) :3889-3899
[9]  
Lions PL, 1992, ICIAM 91 P 2 INT C I, V61, P182
[10]   Distributed adaptive fuzzy control approach for prescribed-time containment of uncertain nonlinear multi-agent systems with unknown hysteresis [J].
Liu, Dacai ;
Liu, Zhi ;
Chen, C. L. Philip ;
Zhang, Yun .
NONLINEAR DYNAMICS, 2021, 105 (01) :257-275