Q-Learning-based Multi-UAV Cooperative Path Planning Method

被引：0

作者：

Yin Y. ^{[1
,2
]}

Wang X. ^{[1
]}

Zhou J. ^{[3
]}

机构：

[1] School of Aerospace Engineering, Beijing Institute of Technology, Beijing

[2] Beijing Institute of Electronic System Engineering, Beijing

[3] Xi'an Modern Control Technology Research Institute, Shaanxi, Xi'an

来源：

Binggong Xuebao/Acta Armamentarii | 2023年 / 44卷 / 02期

关键词：

collision avoidance; multiple UAVs; path planning; Q-learning; time coordination;

D O I：

10.12382/bgxb.2021.0606

中图分类号：

学科分类号：

摘要：

To solve the path planning problem of multiple UAVs' synchronous arrival at the target, the battlefield environment model and the Markov decision process model of the path planning for a single UAV is established, and the optimal path is calculated based on the Q-learning algorithm. With this algorithm, the Q-table is obtained and used to calculate the shortest path of each UAV and the cooperative range. Then the time-coordinated paths is obtained by adjusting the action selection strategy of the circumventing UAVs. Considering the collision avoidance problem of multiple UAVs, the partical replanning area is determined by designing retreat parameters, and based on the deep reinforcement learning theory, the neural network is used to replace Q-table to re-plan the partical path for UAVs, which can avoid the problem of dimensional explosion. As for the previously unexplored obstacles, the obstacle matrix is designed based on the idea of the artificial potential field theory, which is then superimposed on the original Q-table to realize collision avoidance for the unexplored obstacle. The simulation results verify that with the proposed reinforcement learning path planning method, the coordinated paths with time coordination and collision avoidance can be obtained, and the previously unexplored obstacles in the simulation can be avoided as well. Compared with A* algorithm, the proposed method can achieve higher efficiency for online application problems. © 2023 China Ordnance Society. All rights reserved.

引用

页码：484 / 495

页数：11

共 20 条

[1] CHEN Z Y, WEI W S, CHEN W C., Reinforcement learning-based intelligent guidance law for cooperative attack of multiple missiles [J], Acta Armamentarii, 42, 8, pp. 1638-1647, (2021)
[2] LUO W, TANG Q, FU C H, Et al., Deep-sarsa based multi-UAV path planning and obstacle avoidance in a dynamic environment [C], Proceedings of International Conference on Swarm Intelligence, pp. 102-111, (2018)
[3] CHEN X, AI Y D., Multi-UAV Path planning based on improved neural network[C], Proceedings of Chinese Control and Decision Conference, pp. 354-359, (2018)
[4] LIU X J, GU Q, YANG C L., Path planning of multi-cruise missile based on particle swarm optimization [C], Proceedings of 2019 International Conference on Sensing, Diagnostics, Prognostics, and Control, pp. 910-912, (2019)
[5] DU Y, PENG Y, SHAO S K, Et al., Cooperative path planning of multi-unmanned aerial vehicle based on improved particle swarm optimization [J], Science Technology and Engineering, 20, 32, pp. 13258-13264, (2020)
[6] WANG H B, HAO C, ZHANG P, Et al., Path planning of mobile robots based on A<sup>*</sup> algorithm and artificial potential field algorithm, China Mechanical Engineering, 30, 20, pp. 2489-2496, (2019)
[7] CAUSA F, FASANO G, GRASSI M., Multi-UAV path planning for autonomous missions in mixed GNSS coverage scenarios [J], Sensors, 18, 12, (2018)
[8] LAI D C, DAI J Y., Research on multi-UAV path planning and obstacle avoidance based on improved artificial potential field method, Proceedings of the 2020 3rd International Conference on Mechatronics, Robotics and Automation, pp. 84-88, (2020)
[9] DU N N, CHEN J, MA B, Et al., Optimization method for coverage path planning of multi-solar powered UAVs, Acta Aeronautica et Astronautica Sinica, 42, 6, (2021)
[10] WANG L S, ZHANG X L, DENG P Y, Et al., An energy-balanced path planning algorithm for multiple ferrying UAVs based on GA, International Journal of Aerospace Engineering, 20, pp. 1-15, (2020)

← 1 2 →