Research on multi-UAV task decision-making based on improved MADDPG algorithm and transfer learning

被引:13
|
作者
Li, Bo [1 ]
Liang, Shiyang [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Gao, Peixin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
关键词
multi-UAV task decision; improved MADDPG algorithm; two-layer experience pool; transfer learning;
D O I
10.1504/IJBIC.2021.118087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the intelligent algorithms of multi-UAV task decision-making have been suffering some major issues, such as, slow learning speed and poor generalisation capability, and these issues have made it difficult to obtain expected learning results within a reasonable time and to apply a trained model in a new environment. To address these problems, an improved algorithm, namely PMADDPG, based on multi-agent deep deterministic policy gradient (MADDPG) is proposed in this paper. This algorithm adopts a two-layer experience pool structure in order to achieve the priority experience replay. Experiences are stored in an experience pool of the first layer, and then, experiences more conducive to training and learning are selected according to priority criteria and put into an experience pool of the second layer. Furthermore, the experiences from the experience pool of the second layer are selected for model training based on PMADDPG algorithm. In addition, a model-based environment transfer learning method is designed to improve the generalisation capability of the algorithm. Comparative experiments have shown that, compared with MADDPG algorithm, proposed algorithms can scientifically improve the learning speed, task success rate and generalisation capability.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [31] Decision-making of multi-UAV combat game via enhanced competitive learning pigeon-inspired optimization
    Lei Y.
    Duan H.
    Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica, 2024, 54 (01): : 136 - 148
  • [32] A Method of Multi-UAV Cooperative Task Assignment Based on Reinforcement Learning
    Zhao, Xiaohu
    Jiang, Hanli
    An, Chenyang
    Wu, Ruocheng
    Guo, Yijun
    Yang, Daquan
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [33] Decision-Making Method of Multi-UAV Cooperate Air Combat Under Uncertain Environment
    Jian, Jialong
    Chen, Yong
    Li, Qiuni
    Li, Hongbo
    Zheng, Xiaokang
    Han, Chongchong
    IEEE JOURNAL ON MINIATURIZATION FOR AIR AND SPACE SYSTEMS, 2024, 5 (03): : 138 - 148
  • [34] Multi-UAV cooperative task assignment based on multi-strategy improved DBO
    Ran Zhang
    Xiao Chen
    Maoyuan Li
    Cluster Computing, 2025, 28 (3)
  • [35] Cooperative mapping task assignment of heterogeneous multi-UAV using an improved genetic algorithm
    Li, Jiaxuan
    Yang, Xuerong
    Yang, Yajun
    Liu, Xianglin
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [36] An Improved Chaotic Self-Adapting Monkey Algorithm for Multi-UAV Task Assignment
    Cui, Yujuan
    IEEE JOURNAL ON MINIATURIZATION FOR AIR AND SPACE SYSTEMS, 2024, 5 (01): : 9 - 15
  • [37] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
    Li, Ke
    Zhang, Kun
    Zhang, Zhenchong
    Liu, Zekun
    Hua, Shuai
    He, Jianliang
    SENSORS, 2021, 21 (06)
  • [38] Dynamic Selection Method for Cooperative Decision-Making Center of Multi-UAV System based on Cloud Trust Model
    Xu, Jie
    Guo, Qing
    Li, Zhaoxia
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 922 - 926
  • [39] Maneuvering decision-making of multi-UAV attack-defence confrontation based on PER-MATD3
    Fu X.
    Xu Z.
    Zhu J.
    Wang N.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (07):
  • [40] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
    Wang, Huan
    Wang, Jintao
    SCIENTIFIC REPORTS, 2024, 14 (01)