Hierarchical Multi-Agent Training Based on Reinforcement Learning

被引：1

作者：

Wang, Guanghua ^{[1
]}

Li, Wenjie ^{[2
]}

Wu, Zhanghua ^{[3
]}

Guo, Xian ^{[1
]}

机构：

[1] Nankai Univ, Inst Robot & Automat Informat Syst, Tianjin, Peoples R China

[2] State Grid Tianjin Elect Power Co, Tianjin, Peoples R China

[3] Jiangsu Automat Res Inst, Lianyungang, Jiangsu, Peoples R China

来源：

2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS | 2024年

关键词：

Multi-Agent Systems; Reinforcement Learning; Multi-Agent Proximal Policy Optimization Algorithm; Formation Confrontation;

D O I：

10.1109/ACIRS62330.2024.10684909

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In the current multi-UAV adversarial games, issues exist such as the instability and difficulty in learning distributed strategies, as well as a lack of coordinated formation UAVs. In this paper, a hierarchical multi-agent training framework is proposed to solve these problems, which categorizes UAV formations into two types of intelligent agents: virtual centroid agents and UAVs within the formation. The centroid agents are responsible for controlling the overall movement of the formation. In contrast, the UAVs within the formation are capable of flexibly adjusting their speed and heading on this basis. By constructing a confrontation scenario involving multiple formations and types of UAVs, the effectiveness of the hierarchical training framework is experimentally validated. The average winning rate against UAVs controlled by strategy methods based on rule construction reaches 97%, enabling both formation variations and tactical evolutions.

引用

页码：11 / 18

页数：8

共 8 条

[1] A coordination architecture for spacecraft formation control
Beard, RW
Lawton, J
Hadaegh, FY
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2001, 9 (06) : 777 - 790
[2] Burgin G. H., 1988, Rule-based air combat simulation
[3] De la Cruz C, 2006, IEEE IND ELEC, P1543
[4] Geng WX, 2014, CHIN CONT DECIS CONF, P135, DOI 10.1109/CCDC.2014.6852132
[5] Nguyen Khanh, arXiv
[6] Beyond-Visual-Range Air Combat Tactics Auto-Generation by Reinforcement Learning
Piao, Haiyin
Sun, Zhixiao
Meng, Guanglei
Chen, Hechang
Qu, Bohao
Lang, Kuijun
Sun, Yang
Yang, Shengqi
Peng, Xuanqi
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[7] [施伟 Shi Wei], 2021, [自动化学报, Acta Automatica Sinica], V47, P1610
[8] [杨爱武 Yang Aiwu], 2020, [飞行力学, Flight Dynamics], V38, P87

← 1 →