A novel reinforcement learning based grey wolf optimizer algorithm for unmanned aerial vehicles (UAVs) path planning

被引：223

作者：

Qu, Chengzhi ^{[1
]}

Gai, Wendong ^{[1
]}

Zhong, Maiying ^{[1
]}

Zhang, Jing ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Qingdao 266590, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2020年 / 89卷

关键词：

Unmanned aerial vehicles (UAVs); Three-dimensional path planning; Reinforcement learning; Grey wolf optimizer;

D O I：

10.1016/j.asoc.2020.106099

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unmanned aerial vehicles (UAVs) have been used in wide range of areas, and a high-quality path planning method is needed for UAVs to satisfy their applications. However, many algorithms reported in the literature may not feasible or efficient, especially in the face of three-dimensional complex flight environment. In this paper, a novel reinforcement learning based grey wolf optimizer algorithm called RLGWO has been presented for solving this problem. In the proposed algorithm, the reinforcement learning is inserted that the individual is controlled to switch operations adaptively according to the accumulated performance. Considering that the proposed algorithm is designed to serve for UAVs path planning, four operations have been introduced for each individual: exploration, exploitation, geometric adjustment, and optimal adjustment. In addition, the cubic B-spline curve is used to smooth the generated flight route and make the planning path be suitable for the UAVs. The simulation experimental results show that the RLGWO algorithm can acquire a feasible and effective route successfully in complicated environment. (C) 2020 Elsevier B.V. All rights reserved.

引用

页数：12

共 37 条

[1] Lifetime Enhancement in Wireless Sensor Networks Using Fuzzy Approach and A-Star Algorithm [J].

AlShawi, Imad S. ;

Yan, Lianshan ;

Pan, Wei ;

Luo, Bin .

IEEE SENSORS JOURNAL, 2012, 12 (10) :3010-3018

[2] Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning [J].

Carlucho, Ignacio ;

De Paula, Mariano ;

Wang, Sen ;

Petillot, Yvan ;

Acosta, Gerardo G. .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 :71-86

[3] UAV path planning using artificial potential field method updated by optimal control theory [J].

Chen, Yong-bo ;

Luo, Guan-chen ;

Mei, Yue-song ;

Yu, Jian-qiao ;

Su, Xiao-long .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (06) :1407-1420

[4] Three-dimensional unmanned aerial vehicle path planning using modified wolf pack search algorithm [J].

Chen YongBo ;

Mei YueSong ;

Yu JianQiao ;

Su XiaoLong ;

Xu Nuo .

NEUROCOMPUTING, 2017, 266 :445-457

[5] Modified central force optimization (MCFO) algorithm for 3D UAV path planning [J].

Chen, Yongbo ;

Yu, Jianqiao ;

Mei, Yuesong ;

Wang, Yafei ;

Su, Xiaolong .

NEUROCOMPUTING, 2016, 171 :878-888

[6] Optimum laplacian wavelet mask based medical image using hybrid cuckoo search - grey wolf optimization algorithm [J].

Daniel, Ebenezer ;

Anitha, J. ;

Gnanaraj, J. .

KNOWLEDGE-BASED SYSTEMS, 2017, 131 :58-69

[7] Binary grey wolf optimization approaches for feature selection [J].

Emary, E. ;

Zawba, Hossam M. ;

Hassanien, Aboul Ella .

NEUROCOMPUTING, 2016, 172 :371-381

[8] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].

Grondman, Ivo ;

Busoniu, Lucian ;

Lopes, Gabriel A. D. ;

Babuska, Robert .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307

[9] Load frequency control of interconnected power system using grey wolf optimization [J].

Guha, Dipayan ;

Roy, Provas Kumar ;

Banerjee, Subrata .

SWARM AND EVOLUTIONARY COMPUTATION, 2016, 27 :97-115

[10] A survey of biogeography-based optimization [J].

Guo, Weian ;

Chen, Ming ;

Wang, Lei ;

Mao, Yanfen ;

Wu, Qidi .

NEURAL COMPUTING & APPLICATIONS, 2017, 28 (08) :1909-1926

← 1 2 3 4 →