Hierarchical Multi-Agent Training Based on Reinforcement Learning

被引:1
作者
Wang, Guanghua [1 ]
Li, Wenjie [2 ]
Wu, Zhanghua [3 ]
Guo, Xian [1 ]
机构
[1] Nankai Univ, Inst Robot & Automat Informat Syst, Tianjin, Peoples R China
[2] State Grid Tianjin Elect Power Co, Tianjin, Peoples R China
[3] Jiangsu Automat Res Inst, Lianyungang, Jiangsu, Peoples R China
来源
2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS | 2024年
关键词
Multi-Agent Systems; Reinforcement Learning; Multi-Agent Proximal Policy Optimization Algorithm; Formation Confrontation;
D O I
10.1109/ACIRS62330.2024.10684909
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the current multi-UAV adversarial games, issues exist such as the instability and difficulty in learning distributed strategies, as well as a lack of coordinated formation UAVs. In this paper, a hierarchical multi-agent training framework is proposed to solve these problems, which categorizes UAV formations into two types of intelligent agents: virtual centroid agents and UAVs within the formation. The centroid agents are responsible for controlling the overall movement of the formation. In contrast, the UAVs within the formation are capable of flexibly adjusting their speed and heading on this basis. By constructing a confrontation scenario involving multiple formations and types of UAVs, the effectiveness of the hierarchical training framework is experimentally validated. The average winning rate against UAVs controlled by strategy methods based on rule construction reaches 97%, enabling both formation variations and tactical evolutions.
引用
收藏
页码:11 / 18
页数:8
相关论文
共 8 条
  • [1] A coordination architecture for spacecraft formation control
    Beard, RW
    Lawton, J
    Hadaegh, FY
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2001, 9 (06) : 777 - 790
  • [2] Burgin G. H., 1988, Rule-based air combat simulation
  • [3] De la Cruz C, 2006, IEEE IND ELEC, P1543
  • [4] Geng WX, 2014, CHIN CONT DECIS CONF, P135, DOI 10.1109/CCDC.2014.6852132
  • [5] Nguyen Khanh, arXiv
  • [6] Beyond-Visual-Range Air Combat Tactics Auto-Generation by Reinforcement Learning
    Piao, Haiyin
    Sun, Zhixiao
    Meng, Guanglei
    Chen, Hechang
    Qu, Bohao
    Lang, Kuijun
    Sun, Yang
    Yang, Shengqi
    Peng, Xuanqi
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] [施伟 Shi Wei], 2021, [自动化学报, Acta Automatica Sinica], V47, P1610
  • [8] [杨爱武 Yang Aiwu], 2020, [飞行力学, Flight Dynamics], V38, P87