Research on Multi UAV Algorithm Based on Evolutionary Reinforcement Learning

被引:0
作者
Huang, Jingyi [1 ]
Cui, Yujie [1 ]
Wu, Shuying [1 ]
Yang, Ziyi [1 ]
Li, Bo [1 ]
Wang, Geng [1 ]
机构
[1] Northwestern Polytech Univ, 127 Youyi West Rd, Xian, Shaanxi, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II | 2025年 / 15202卷
关键词
Reinforcement Learning; IDQN; Evolutionary Learning; Generalization; Sparse Rewards;
D O I
10.1007/978-981-96-0774-7_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores and investigates the lack of exploration performance and generalization performance common to multi-intelligence reinforcement learning algorithms during multi-UAV cooperative reconnaissance to investigate the problem. The principle of evolutionary learning is proposed to improve the performance of the algorithms. Unlike traditional deep reinforcement learning, which typically struggles with tasks that have few rewards, evolutionary approaches excel in this context by reducing the risk of premature convergence. The key advantage is the inherent ability of evolutionary methods to incorporate prior knowledge, which significantly improves the algorithm's search and generalization capabilities. By integrating these evolutionary mechanisms, this research aims to improve the robustness and adaptability of IDQN algorithms. In this study, Airsim is used as a simulation experiment environment to meet the requirements of complex dynamic environments, and the experimental results show that evolutionary reinforcement learning effectively improves the performance of UAV model reconnaissance and achieves more effective decision-making in complex dynamic environments.
引用
收藏
页码:447 / 459
页数:13
相关论文
共 50 条
  • [21] Research on Vehicle Control Algorithm Based on Distributed Reinforcement Learning
    Liu W.
    Xiang Z.
    Liu W.
    Qi D.
    Wang Z.
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (09): : 1637 - 1645
  • [22] UAV Resource Cooperation Based on Reinforcement Learning
    Shan, Mingang
    Xiong, Jian
    Liu, Bo
    Shi, Zhiping
    Li, Xia
    Miao, Ningjie
    2021 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2021,
  • [23] Evolutionary computation and reinforcement learning integrated algorithm for distributed
    Li, Rui
    Wang, Ling
    Gong, Wenyin
    Chen, Jingfang
    Pan, Zixiao
    Wu, Yuting
    Yu, Yang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 135
  • [24] Transformer-Based Reinforcement Learning for Scalable Multi-UAV Area Coverage
    Chen, Dezhi
    Qi, Qi
    Fu, Qianlong
    Wang, Jingyu
    Liao, Jianxin
    Han, Zhu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 10062 - 10077
  • [25] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
  • [26] Reinforcement Learning based Approach for Multi-UAV Cooperative Searching in Unknown Environments
    Yue, Wei
    Guan, Xianhe
    Xi, Yun
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2018 - 2023
  • [27] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [28] Optimal formation tracking control based on reinforcement learning for multi-UAV systems
    Wang, Weizhen
    Chen, Xin
    Jia, Jiangbo
    Wu, Kaili
    Xie, Mingyang
    CONTROL ENGINEERING PRACTICE, 2023, 141
  • [29] Reinforcement learning-based multi-strategy cuckoo search algorithm for 3D UAV path planning
    Yu, Xiaobing
    Luo, Wenguan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [30] Securing UAV Communication Based on Multi-Agent Deep Reinforcement Learning in the Presence of Smart UAV Eavesdropper
    Wen, Chaoyang
    Fang, Yuan
    Qiu, Ling
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 1164 - 1169