Research on Efficient Multiagent Reinforcement Learning for Multiple UAVs' Distributed Jamming Strategy

被引：0

作者：

Ran, Weizhi ^{[1
]}

Luo, Rong ^{[2
]}

Zhang, Funing ^{[3
]}

Luo, Renwei ^{[1
]}

Xu, Yang ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[2] PLA, Naval Res Acad, Beijing 100161, Peoples R China

[3] Xian Univ Architecture & Technol, Sch Informat & Control Engn, Xian 710055, Peoples R China

来源：

ELECTRONICS | 2023年 / 12卷 / 18期

关键词：

multiagent reinforcement learning; IPPO learning algorithm; multiple UAVs distributed jamming strategy; LEVEL; GAME; GO;

D O I：

10.3390/electronics12183874

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To support Unmanned Aerial Vehicle (UAV) joint electromagnetic countermeasure decisions in real time, coordinating multiple UAVs for efficiently jamming distributed hostile radar stations requires complex and highly flexible strategies. However, with the nature of the high complexity dimension and partial observation of the electromagnetic battleground, no such strategy can be generated by pre-coded software or decided by a human commander. In this paper, an initial effort is made to integrate multiagent reinforcement learning, which has been proven to be effective in game strategy generation, into the distributed airborne electromagnetic countermeasures domain. The key idea is to design a training simulator which close to a real electromagnetic countermeasure strategy game, so that we can easily collect huge valuable training data other than in the real battle ground which is sparse and far less than sufficient. In addition, this simulator is able to simulate all the necessary decision factors for multiple UAV coordination, so that multiagents can freely search for their optimal joint strategies with our improved Independent Proximal Policy Optimization (IPPO) learning algorithm which suits the game well. In the last part, a typical domain scenario is built to test, and the use case and experiment results manifest that the design is efficient in coordinating a group of UAVs equipped with lightweight jamming devices. Their coordination strategies are not only capable of handling given jamming tasks for the dynamic jamming of hostile radar stations but also beat expectations. The reinforcement learning algorithm can do some heuristic searches to help the group find the tactical vulnerabilities of the enemies and improve the multiple UAVs' jamming performance.

引用

页数：12

共 26 条

[11] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
Orr, James
Dutta, Ayan
[J]. SENSORS, 2023, 23 (07)
[12] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
[13] Silver D, 2014, PR MACH LEARN RES, V32
[14] Mastering the game of Go without human knowledge
Silver, David
Schrittwieser, Julian
Simonyan, Karen
Antonoglou, Ioannis
Huang, Aja
Guez, Arthur
Hubert, Thomas
Baker, Lucas
Lai, Matthew
Bolton, Adrian
Chen, Yutian
Lillicrap, Timothy
Hui, Fan
Sifre, Laurent
van den Driessche, George
Graepel, Thore
Hassabis, Demis
[J]. NATURE, 2017, 550 (7676) : 354 - +
[15] Mastering the game of Go with deep neural networks and tree search
Silver, David
Huang, Aja
Maddison, Chris J.
Guez, Arthur
Sifre, Laurent
van den Driessche, George
Schrittwieser, Julian
Antonoglou, Ioannis
Panneershelvam, Veda
Lanctot, Marc
Dieleman, Sander
Grewe, Dominik
Nham, John
Kalchbrenner, Nal
Sutskever, Ilya
Lillicrap, Timothy
Leach, Madeleine
Kavukcuoglu, Koray
Graepel, Thore
Hassabis, Demis
[J]. NATURE, 2016, 529 (7587) : 484 - +
[16] Soleyman S., 2020, P AAAI S 2 WORKSH DE
[17] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[18] Grandmaster level in StarCraft II using multi-agent reinforcement learning
Vinyals, Oriol
Babuschkin, Igor
Czarnecki, Wojciech M.
Mathieu, Michael
Dudzik, Andrew
Chung, Junyoung
Choi, David H.
Powell, Richard
Ewalds, Timo
Georgiev, Petko
Oh, Junhyuk
Horgan, Dan
Kroiss, Manuel
Danihelka, Ivo
Huang, Aja
Sifre, Laurent
Cai, Trevor
Agapiou, John P.
Jaderberg, Max
Vezhnevets, Alexander S.
Leblond, Remi
Pohlen, Tobias
Dalibard, Valentin
Budden, David
Sulsky, Yury
Molloy, James
Paine, Tom L.
Gulcehre, Caglar
Wang, Ziyu
Pfaff, Tobias
Wu, Yuhuai
Ring, Roman
Yogatama, Dani
Wunsch, Dario
McKinney, Katrina
Smith, Oliver
Schaul, Tom
Lillicrap, Timothy
Kavukcuoglu, Koray
Hassabis, Demis
Apps, Chris
Silver, David
[J]. NATURE, 2019, 575 (7782) : 350 - +
[19] Wang X., 2023, P 2023 IEEE T AUT SC
[20] Wu KD, 2016, 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), P930, DOI 10.1109/CGNCC.2016.7828910

← 1 2 3 →