Research on multi-UAV task decision-making based on improved MADDPG algorithm and transfer learning

被引:13
|
作者
Li, Bo [1 ]
Liang, Shiyang [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Gao, Peixin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
关键词
multi-UAV task decision; improved MADDPG algorithm; two-layer experience pool; transfer learning;
D O I
10.1504/IJBIC.2021.118087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the intelligent algorithms of multi-UAV task decision-making have been suffering some major issues, such as, slow learning speed and poor generalisation capability, and these issues have made it difficult to obtain expected learning results within a reasonable time and to apply a trained model in a new environment. To address these problems, an improved algorithm, namely PMADDPG, based on multi-agent deep deterministic policy gradient (MADDPG) is proposed in this paper. This algorithm adopts a two-layer experience pool structure in order to achieve the priority experience replay. Experiences are stored in an experience pool of the first layer, and then, experiences more conducive to training and learning are selected according to priority criteria and put into an experience pool of the second layer. Furthermore, the experiences from the experience pool of the second layer are selected for model training based on PMADDPG algorithm. In addition, a model-based environment transfer learning method is designed to improve the generalisation capability of the algorithm. Comparative experiments have shown that, compared with MADDPG algorithm, proposed algorithms can scientifically improve the learning speed, task success rate and generalisation capability.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [1] An Improved PSO Algorithm for Solving multi-UAV Cooperative Reconnaissance Task Decision-Making Problem
    Zhang Yao-zhong
    Li Ji-wei
    Hu Bo
    Zhang Jian-dong
    2016 IEEE/CSAA INTERNATIONAL CONFERENCE ON AIRCRAFT UTILITY SYSTEMS (AUS), 2016, : 434 - 437
  • [2] Research on autonomous formation of Multi-UAV based on MADDPG algorithm
    Zhang, Yaozhong
    Wu, Zhuoran
    Ma, Yunhong
    Sun, Ruiyang
    Xu, Zixiang
    2022 IEEE 17TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA, 2022, : 249 - 254
  • [3] Research on heterogeneous multi-UAV collaborative decision-making method based on improved PPO
    Xu, Lin
    Zhang, Xinmiao
    Xiao, Dong
    Liu, Beihong
    Liu, Aixue
    APPLIED INTELLIGENCE, 2024, 54 (20) : 9892 - 9905
  • [4] Optimal Task Decision-Making for Heterogeneous Multi-UAV Cooperation Reconnaissance
    1600, Northwestern Polytechnical University (35):
  • [5] Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
    Li, Shaowei
    Jia, Yuhong
    Yang, Fan
    Qin, Qingyang
    Gao, Hui
    Zhou, Yaoming
    IEEE ACCESS, 2022, 10 : 91385 - 91396
  • [6] Multi-UAV Task Assignment based on Satisficing Decision Algorithm
    Ye, Xinning
    Lei, Zhongkui
    Liu, Kun
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 439 - 442
  • [7] MULTI-UAV Task Allocation Based on Improved Genetic Algorithm
    Wu, Xueli
    Yin, Yanan
    Xu, Lei
    Wu, Xiaojing
    Meng, Fanhua
    Zhen, Ran
    IEEE ACCESS, 2021, 9 : 100369 - 100379
  • [8] Multi-UAV Cooperative Reconnaissance Decision-Making Based on Lingo
    Zhang Yao-zhong
    Xie Song-yan
    2017 2ND ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS), 2017, : 210 - 213
  • [9] Multi-UAV cooperative maneuver decision-making for pursuitevasion using improved MADRL
    Delin Luo
    Zihao Fan
    Ziyi Yang
    Yang Xu
    Defence Technology , 2024, (05) : 187 - 197
  • [10] Research on Multi-UAV Task Assignment Based on a Multi-Objective, Improved Brainstorming Optimization Algorithm
    Wang, Xiaofang
    Yin, Shi
    Luo, Lianyong
    Qiao, Xin
    APPLIED SCIENCES-BASEL, 2024, 14 (06):