Research on Multi UAV Algorithm Based on Evolutionary Reinforcement Learning

被引：0

作者：

Huang, Jingyi ^{[1
]}

Cui, Yujie ^{[1
]}

Wu, Shuying ^{[1
]}

Yang, Ziyi ^{[1
]}

Li, Bo ^{[1
]}

Wang, Geng ^{[1
]}

机构：

[1] Northwestern Polytech Univ, 127 Youyi West Rd, Xian, Shaanxi, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II | 2025年 / 15202卷

关键词：

Reinforcement Learning; IDQN; Evolutionary Learning; Generalization; Sparse Rewards;

D O I：

10.1007/978-981-96-0774-7_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores and investigates the lack of exploration performance and generalization performance common to multi-intelligence reinforcement learning algorithms during multi-UAV cooperative reconnaissance to investigate the problem. The principle of evolutionary learning is proposed to improve the performance of the algorithms. Unlike traditional deep reinforcement learning, which typically struggles with tasks that have few rewards, evolutionary approaches excel in this context by reducing the risk of premature convergence. The key advantage is the inherent ability of evolutionary methods to incorporate prior knowledge, which significantly improves the algorithm's search and generalization capabilities. By integrating these evolutionary mechanisms, this research aims to improve the robustness and adaptability of IDQN algorithms. In this study, Airsim is used as a simulation experiment environment to meet the requirements of complex dynamic environments, and the experimental results show that evolutionary reinforcement learning effectively improves the performance of UAV model reconnaissance and achieves more effective decision-making in complex dynamic environments.

引用

页码：447 / 459

页数：13

共 50 条

[41] Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning
Xing, Xiaojun
Zhou, Zhiwei
Li, Yan
Xiao, Bing
Xun, Yilin
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12484 - 12499
[42] Research of Elevator Group Scheduling System Based on Reinforcement Learning Algorithm
Zheng, Liu
Guang, Shu
Hui, Dong
PROCEEDINGS OF 2013 2ND INTERNATIONAL CONFERENCE ON MEASUREMENT, INFORMATION AND CONTROL (ICMIC 2013), VOLS 1 & 2, 2013, : 606 - 610
[43] A Multi-Step Reinforcement Learning Algorithm
Zhang, Zhicong
Hu, Kaishun
Huang, Huiyu
Li, Shuai
Zhao, Shaoyong
FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3611 - 3615
[44] Fuzzy PID Controller for UAV Based on Reinforcement Learning
Zhang, Benyi
Zhang, Weiping
Mou, Jiawang
Yang, Runmin
Zhang, Yichen
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1724 - 1732
[45] Reinforcement Learning based Scheduling for Heterogeneous UAV Networking
Wang, Jian
Liu, Yongxin
Niu, Shuteng
Song, Houbing
2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 420 - 427
[46] Adaptive evolutionary programming based on reinforcement learning
Zhang, Huaxiang
Lu, Jing
INFORMATION SCIENCES, 2008, 178 (04) : 971 - 984
[47] Reinforcement-Learning-based Miniature UAV Identification
She Xiaoyu
Guan Zhenyu
Mao Ruizhi
Li Jie
Yang Chengwei
PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2017, : 237 - 242
[48] Energy Management of Hybrid UAV Based on Reinforcement Learning
Shen, Huan
Zhang, Yao
Mao, Jianguo
Yan, Zhiwei
Wu, Linwei
ELECTRONICS, 2021, 10 (16)
[49] Training reinforcement learning models via an adversarial evolutionary algorithm
Coletti, Mark
Gunaratne, Chathika
Schuman, Catherine D.
Patton, Robert
51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
[50] Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
Li, Shaowei
Jia, Yuhong
Yang, Fan
Qin, Qingyang
Gao, Hui
Zhou, Yaoming
IEEE ACCESS, 2022, 10 : 91385 - 91396

← 1 2 3 4 5 →