Dynamic Attention Network for Multi-UAV Reinforcement Learning

被引:0
|
作者
Xu, Dongsheng [1 ]
Wu, Shang [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha, Hunan, Peoples R China
来源
INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021) | 2021年 / 12156卷
关键词
MADDPG; Transfer learning; Attention; Reinforcement learning; LEVEL;
D O I
10.1117/12.2626437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent methods for multi-agent reinforcement learning problems make use of Deep Neural Networks and provide stateof-the-art performance with dedicated neural network architectures and comprehensive training tricks. However, these deep reinforcement learning methods suffer from reproducibility issues, especially in transfer learning. Since the fixed size of the network input, it is difficult for the existing network structure to transfer the strategies learned from a small scale to a large scale. We argue that proper network architecture design is crucial to the cross-scale reinforcement transfer learning. In this paper, we use transfer training with attention network to solve multi-agent combat problems from aerial unmanned aerial vehicle (UAV) combat scenarios, and extend the small-scale learning to large-scale complex scenarios. We combine the attention neural network with the MADDPG algorithm to process the agent observation. It started training from a small-scale multi-UAV combat scenario and gradually increases the number of UAV. The experimental results show that methods for multi-agent UAV combat problems trained by attention transfer learning can achieve the target performance faster and provide better performance than the method without attention transfer learning.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning
    Wang, Qiang
    Zhang, Wenqi
    Liu, Yuanwei
    Liu, Ying
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2243 - 2246
  • [2] Optimization Design of Multi-UAV Communication Network Based on Reinforcement Learning
    Cao, Zhengyang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [3] Multi-UAV Collaborative Detection Based on Reinforcement Learning
    Hao, Yuanhui
    Guo, Chubing
    Ke, Liangjun
    ADVANCES IN SWARM INTELLIGENCE, PT I, ICSI 2024, 2024, 14788 : 463 - 474
  • [4] Multi-UAV Assisted Network Coverage Optimization for Rescue Operations using Reinforcement Learning
    Oubbati, Omar Sami
    Badis, Hakim
    Rachedi, Abderrezak
    Lakas, Abderrahmane
    Lorenz, Pascal
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [5] Toward Autonomous Multi-UAV Wireless Network: A Survey of Reinforcement Learning-Based Approaches
    Bai, Yu
    Zhao, Hui
    Zhang, Xin
    Chang, Zheng
    Jantti, Riku
    Yang, Kun
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (04): : 3038 - 3067
  • [6] Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning
    Ding, Yunlong
    Kuang, Minchi
    Shi, Heng
    Gao, Jiazhan
    DRONES, 2024, 8 (10)
  • [7] Multi-UAV Reinforcement Learning for Data Collection in Cellular MIMO Networks
    Diaz-Vilor, Carles
    Abdelhady, Amr M.
    Eltawil, Ahmed M.
    Jafarkhani, Hamid
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15462 - 15476
  • [8] Reinforcement Learning Based Trajectory Planning for Multi-UAV Load Transportation
    Estevez, Julian
    Manuel Lopez-Guede, Jose
    del Valle-Echavarri, Javier
    Grana, Manuel
    IEEE ACCESS, 2024, 12 : 144009 - 144016
  • [9] Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
    Moon, Jiseon
    Papaioannou, Savvas
    Laoudias, Christos
    Kolios, Panayiotis
    Kim, Sunwoo
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15441 - 15455
  • [10] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
    Xiao, Jian
    Yuan, Guohui
    Xue, Yuxi
    He, Jinhui
    Wang, Yaoting
    Zou, Yuanjiang
    Wang, Zhuoran
    NEUROCOMPUTING, 2024, 595