Research on Multi UAV Algorithm Based on Evolutionary Reinforcement Learning

被引:0
作者
Huang, Jingyi [1 ]
Cui, Yujie [1 ]
Wu, Shuying [1 ]
Yang, Ziyi [1 ]
Li, Bo [1 ]
Wang, Geng [1 ]
机构
[1] Northwestern Polytech Univ, 127 Youyi West Rd, Xian, Shaanxi, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II | 2025年 / 15202卷
关键词
Reinforcement Learning; IDQN; Evolutionary Learning; Generalization; Sparse Rewards;
D O I
10.1007/978-981-96-0774-7_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores and investigates the lack of exploration performance and generalization performance common to multi-intelligence reinforcement learning algorithms during multi-UAV cooperative reconnaissance to investigate the problem. The principle of evolutionary learning is proposed to improve the performance of the algorithms. Unlike traditional deep reinforcement learning, which typically struggles with tasks that have few rewards, evolutionary approaches excel in this context by reducing the risk of premature convergence. The key advantage is the inherent ability of evolutionary methods to incorporate prior knowledge, which significantly improves the algorithm's search and generalization capabilities. By integrating these evolutionary mechanisms, this research aims to improve the robustness and adaptability of IDQN algorithms. In this study, Airsim is used as a simulation experiment environment to meet the requirements of complex dynamic environments, and the experimental results show that evolutionary reinforcement learning effectively improves the performance of UAV model reconnaissance and achieves more effective decision-making in complex dynamic environments.
引用
收藏
页码:447 / 459
页数:13
相关论文
共 50 条
  • [41] Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning
    Xing, Xiaojun
    Zhou, Zhiwei
    Li, Yan
    Xiao, Bing
    Xun, Yilin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12484 - 12499
  • [42] Research of Elevator Group Scheduling System Based on Reinforcement Learning Algorithm
    Zheng, Liu
    Guang, Shu
    Hui, Dong
    PROCEEDINGS OF 2013 2ND INTERNATIONAL CONFERENCE ON MEASUREMENT, INFORMATION AND CONTROL (ICMIC 2013), VOLS 1 & 2, 2013, : 606 - 610
  • [43] A Multi-Step Reinforcement Learning Algorithm
    Zhang, Zhicong
    Hu, Kaishun
    Huang, Huiyu
    Li, Shuai
    Zhao, Shaoyong
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3611 - 3615
  • [44] Fuzzy PID Controller for UAV Based on Reinforcement Learning
    Zhang, Benyi
    Zhang, Weiping
    Mou, Jiawang
    Yang, Runmin
    Zhang, Yichen
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1724 - 1732
  • [45] Reinforcement Learning based Scheduling for Heterogeneous UAV Networking
    Wang, Jian
    Liu, Yongxin
    Niu, Shuteng
    Song, Houbing
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 420 - 427
  • [46] Adaptive evolutionary programming based on reinforcement learning
    Zhang, Huaxiang
    Lu, Jing
    INFORMATION SCIENCES, 2008, 178 (04) : 971 - 984
  • [47] Reinforcement-Learning-based Miniature UAV Identification
    She Xiaoyu
    Guan Zhenyu
    Mao Ruizhi
    Li Jie
    Yang Chengwei
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2017, : 237 - 242
  • [48] Energy Management of Hybrid UAV Based on Reinforcement Learning
    Shen, Huan
    Zhang, Yao
    Mao, Jianguo
    Yan, Zhiwei
    Wu, Linwei
    ELECTRONICS, 2021, 10 (16)
  • [49] Training reinforcement learning models via an adversarial evolutionary algorithm
    Coletti, Mark
    Gunaratne, Chathika
    Schuman, Catherine D.
    Patton, Robert
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
  • [50] Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
    Li, Shaowei
    Jia, Yuhong
    Yang, Fan
    Qin, Qingyang
    Gao, Hui
    Zhou, Yaoming
    IEEE ACCESS, 2022, 10 : 91385 - 91396