Research on Multi UAV Algorithm Based on Evolutionary Reinforcement Learning

被引：0

作者：

Huang, Jingyi ^{[1
]}

Cui, Yujie ^{[1
]}

Wu, Shuying ^{[1
]}

Yang, Ziyi ^{[1
]}

Li, Bo ^{[1
]}

Wang, Geng ^{[1
]}

机构：

[1] Northwestern Polytech Univ, 127 Youyi West Rd, Xian, Shaanxi, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II | 2025年 / 15202卷

关键词：

Reinforcement Learning; IDQN; Evolutionary Learning; Generalization; Sparse Rewards;

D O I：

10.1007/978-981-96-0774-7_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores and investigates the lack of exploration performance and generalization performance common to multi-intelligence reinforcement learning algorithms during multi-UAV cooperative reconnaissance to investigate the problem. The principle of evolutionary learning is proposed to improve the performance of the algorithms. Unlike traditional deep reinforcement learning, which typically struggles with tasks that have few rewards, evolutionary approaches excel in this context by reducing the risk of premature convergence. The key advantage is the inherent ability of evolutionary methods to incorporate prior knowledge, which significantly improves the algorithm's search and generalization capabilities. By integrating these evolutionary mechanisms, this research aims to improve the robustness and adaptability of IDQN algorithms. In this study, Airsim is used as a simulation experiment environment to meet the requirements of complex dynamic environments, and the experimental results show that evolutionary reinforcement learning effectively improves the performance of UAV model reconnaissance and achieves more effective decision-making in complex dynamic environments.

引用

页码：447 / 459

页数：13

共 50 条

[31] Reinforcement learning guided multi-objective differential evolutionary algorithm for product change paths
Song, Xian-Fang
Yang, Yang
Zhang, Yong
Zheng, Rui-Zhao
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2025, 42 (01): : 109 - 117
[32] Dynamic Attention Network for Multi-UAV Reinforcement Learning
Xu, Dongsheng
Wu, Shang
INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
[33] A Multi-agent Reinforcement Learning Algorithm Based on Stackelberg Game
Cheng, Chi
Zhu, Zhangqing
Xin, Bo
Chen, Chunlin
2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 727 - 732
[34] An Improved Multi-objective Optimization Algorithm Based on Reinforcement Learning
Liu, Jun
Zhou, Yi
Qiu, Yimin
Li, Zhongfeng
ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 501 - 513
[35] Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction
童亮
陆际联
Journal of Beijing Institute of Technology(English Edition), 2006, (02) : 133 - 137
[36] Research on supply chain efficiency optimization algorithm based on reinforcement learning
Zhou, Tao
Xie, Lihua
Zou, Chunbin
Tian, Yong
ADVANCES IN CONTINUOUS AND DISCRETE MODELS, 2024, 2024 (01):
[37] A novel modified search and rescue optimization algorithm based on reinforcement learning for UAV path planning
Zhou W.-J.
Zhang C.-Q.
Tang W.-D.
Yi Y.-H.
Liu W.-W.
Qin W.-D.
Kongzhi yu Juece/Control and Decision, 2024, 39 (04): : 1203 - 1211
[38] Research on Underwater Gliders Path Tracking Based on Reinforcement Learning Algorithm
Shi Q.
Zhang R.
Zhang L.
Lan S.
Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2023, 34 (09): : 1100 - 1110
[39] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
Chengyu Hu
Rui Qiao
Wenyin Gong
Xuesong Yan
Ling Wang
Memetic Computing, 2022, 14 : 451 - 460
[40] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
Hu, Chengyu
Qiao, Rui
Gong, Wenyin
Yan, Xuesong
Wang, Ling
MEMETIC COMPUTING, 2022, 14 (04) : 451 - 460

← 1 2 3 4 5 →