Research on Multi UAV Algorithm Based on Evolutionary Reinforcement Learning

被引:0
作者
Huang, Jingyi [1 ]
Cui, Yujie [1 ]
Wu, Shuying [1 ]
Yang, Ziyi [1 ]
Li, Bo [1 ]
Wang, Geng [1 ]
机构
[1] Northwestern Polytech Univ, 127 Youyi West Rd, Xian, Shaanxi, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II | 2025年 / 15202卷
关键词
Reinforcement Learning; IDQN; Evolutionary Learning; Generalization; Sparse Rewards;
D O I
10.1007/978-981-96-0774-7_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores and investigates the lack of exploration performance and generalization performance common to multi-intelligence reinforcement learning algorithms during multi-UAV cooperative reconnaissance to investigate the problem. The principle of evolutionary learning is proposed to improve the performance of the algorithms. Unlike traditional deep reinforcement learning, which typically struggles with tasks that have few rewards, evolutionary approaches excel in this context by reducing the risk of premature convergence. The key advantage is the inherent ability of evolutionary methods to incorporate prior knowledge, which significantly improves the algorithm's search and generalization capabilities. By integrating these evolutionary mechanisms, this research aims to improve the robustness and adaptability of IDQN algorithms. In this study, Airsim is used as a simulation experiment environment to meet the requirements of complex dynamic environments, and the experimental results show that evolutionary reinforcement learning effectively improves the performance of UAV model reconnaissance and achieves more effective decision-making in complex dynamic environments.
引用
收藏
页码:447 / 459
页数:13
相关论文
共 50 条
  • [31] Reinforcement learning guided multi-objective differential evolutionary algorithm for product change paths
    Song, Xian-Fang
    Yang, Yang
    Zhang, Yong
    Zheng, Rui-Zhao
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2025, 42 (01): : 109 - 117
  • [32] Dynamic Attention Network for Multi-UAV Reinforcement Learning
    Xu, Dongsheng
    Wu, Shang
    INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [33] A Multi-agent Reinforcement Learning Algorithm Based on Stackelberg Game
    Cheng, Chi
    Zhu, Zhangqing
    Xin, Bo
    Chen, Chunlin
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 727 - 732
  • [34] An Improved Multi-objective Optimization Algorithm Based on Reinforcement Learning
    Liu, Jun
    Zhou, Yi
    Qiu, Yimin
    Li, Zhongfeng
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 501 - 513
  • [35] Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction
    童亮
    陆际联
    Journal of Beijing Institute of Technology(English Edition), 2006, (02) : 133 - 137
  • [36] Research on supply chain efficiency optimization algorithm based on reinforcement learning
    Zhou, Tao
    Xie, Lihua
    Zou, Chunbin
    Tian, Yong
    ADVANCES IN CONTINUOUS AND DISCRETE MODELS, 2024, 2024 (01):
  • [37] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
    Zhou W.-J.
    Zhang C.-Q.
    Tang W.-D.
    Yi Y.-H.
    Liu W.-W.
    Qin W.-D.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
  • [38] Research on Underwater Gliders Path Tracking Based on Reinforcement Learning Algorithm
    Shi Q.
    Zhang R.
    Zhang L.
    Lan S.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2023, 34 (09): : 1100 - 1110
  • [39] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
    Chengyu Hu
    Rui Qiao
    Wenyin Gong
    Xuesong Yan
    Ling Wang
    Memetic Computing, 2022, 14 : 451 - 460
  • [40] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
    Hu, Chengyu
    Qiao, Rui
    Gong, Wenyin
    Yan, Xuesong
    Wang, Ling
    MEMETIC COMPUTING, 2022, 14 (04) : 451 - 460