Research on multi-UAV task decision-making based on improved MADDPG algorithm and transfer learning

被引:13
|
作者
Li, Bo [1 ]
Liang, Shiyang [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Gao, Peixin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
关键词
multi-UAV task decision; improved MADDPG algorithm; two-layer experience pool; transfer learning;
D O I
10.1504/IJBIC.2021.118087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the intelligent algorithms of multi-UAV task decision-making have been suffering some major issues, such as, slow learning speed and poor generalisation capability, and these issues have made it difficult to obtain expected learning results within a reasonable time and to apply a trained model in a new environment. To address these problems, an improved algorithm, namely PMADDPG, based on multi-agent deep deterministic policy gradient (MADDPG) is proposed in this paper. This algorithm adopts a two-layer experience pool structure in order to achieve the priority experience replay. Experiences are stored in an experience pool of the first layer, and then, experiences more conducive to training and learning are selected according to priority criteria and put into an experience pool of the second layer. Furthermore, the experiences from the experience pool of the second layer are selected for model training based on PMADDPG algorithm. In addition, a model-based environment transfer learning method is designed to improve the generalisation capability of the algorithm. Comparative experiments have shown that, compared with MADDPG algorithm, proposed algorithms can scientifically improve the learning speed, task success rate and generalisation capability.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [41] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
    Huan Wang
    Jintao Wang
    Scientific Reports, 14
  • [42] Research on Multi-UAV Loading Multi-type Sensors Cooperative Reconnaissance Task Planning Based on Genetic Algorithm
    Li, Ji-Ting
    Zhang, Sheng
    Zheng, Zhan
    Xing, Li-Ning
    He, Ren-Jie
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 485 - 500
  • [43] Improved Multi-Objective Particle Swarm Optimization Algorithm Based on Area Division With Application in Multi-UAV Task Assignment
    Wang, Yafei
    Zhang, Liang
    IEEE ACCESS, 2023, 11 : 123519 - 123530
  • [44] Radar Anti-Jamming Decision-Making Method Based on DDPG-MADDPG Algorithm
    Wei, Jingjing
    Wei, Yinsheng
    Yu, Lei
    Xu, Rongqing
    REMOTE SENSING, 2023, 15 (16)
  • [45] Multi-UAV Formation Control in Complex Conditions Based on Improved Consistency Algorithm
    Tao, Canhui
    Zhang, Ru
    Song, Zhiping
    Wang, Baoshou
    Jin, Yang
    DRONES, 2023, 7 (03)
  • [46] Track planning of multi-UAV cooperative reconnaissance based on improved genetic algorithm
    Li W.
    Hu Y.
    Pang Q.
    Li Y.
    Jia H.
    Hu, Yongjiang (huyongjiang_jxxy@163.com), 1600, Editorial Department of Journal of Chinese Inertial Technology (28): : 248 - 255
  • [47] PHM-Based Multi-UAV Task Assignment
    de Medeiros, Ivo Paixao
    Rodrigues, Leonardo Ramos
    Shiguemori, Elcio Hideiti
    Santos, Rafael
    Nascimento Junior, Cairo Lucio
    2014 8TH ANNUAL IEEE SYSTEMS CONFERENCE (SYSCON), 2014, : 42 - 49
  • [48] Research on Optimization Method of Multi-UAV Collaborative Task Planning
    Cao Ze-ling
    Wang Qi
    Yang Ye-qing
    2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2018,
  • [49] Multi-UAV Task Reallocation Based on Dynamic Window Consensus-Based Bundle Algorithm
    Shen, Junyi
    Zhou, Jiuli
    Bi, Wenhao
    2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL I, APISAT 2023, 2024, 1050 : 352 - 361
  • [50] Multi-Dimensional Decision-Making for UAV Air Combat Based on Hierarchical Reinforcement Learning
    Zhang J.
    Wang D.
    Yang Q.
    Shi G.
    Lu Y.
    Zhang Y.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1547 - 1563