UAV Path Planning Employing MPC-Reinforcement Learning Method Considering Collision Avoidance

被引：11

作者：

Ramezani, Mahya ^{[1
]}

Habibi, Hamed ^{[1
]}

Sanchez-Lopez, Jose Luis ^{[1
]}

Voos, Holger ^{[1
]}

机构：

[1] Univ Luxembourg, Ctr Secur Reliabil & Trust, Luxembourg, Luxembourg

来源：

2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS | 2023年

基金：

欧盟地平线“2020”;

关键词：

path planning; reinforcement learning; model predictive control; LSTM network modeling; improved DDPG;

D O I：

10.1109/ICUAS57906.2023.10156232

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

In this paper, we tackle the problem of Unmanned Aerial (UAV) path planning in complex and uncertain environments by designing a Model Predictive Control (MPC), based on a Long-Short-Term Memory ( LSTM) network integrated into the Deep Deterministic Policy Gradient algorithm. In the proposed solution, LSTM-MPC operates as a deterministic policy within the DDPG network, and it leverages a predicting pool to store predicted future states and actions for improved robustness and efficiency. The use of the predicting pool also enables the initialization of the critic network, leading to improved convergence speed and reduced failure rate compared to traditional reinforcement learning and deep reinforcement learning methods. The effectiveness of the proposed solution is evaluated by numerical simulations.

引用

页码：507 / 514

页数：8

共 26 条

[1] Bohlin R., 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065), P521, DOI 10.1109/ROBOT.2000.844107
[2] Boyan Xin, 2022, 2022 7th International Conference on Control and Robotics Engineering (ICCRE)., P102, DOI 10.1109/ICCRE55123.2022.9770257
[3] Bruce J, 2002, 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, P2383, DOI 10.1109/IRDS.2002.1041624
[4] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
Cai, Wenqi
Kordabad, Arash B.
Esfahani, Hossein N.
Lekkas, Anastasios M.
Gros, Sebastien
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995
[5] MPC: Current practice and challenges
Darby, Mark L.
Nikolaou, Michael
[J]. CONTROL ENGINEERING PRACTICE, 2012, 20 (04) : 328 - 342
[6] A reinforcement learning-based approach for modeling and coverage of an unknown field using a team of autonomous ground vehicles
Faryadi, Saba
Mohammadpour Velni, Javad
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (02) : 1069 - 1084
[7] Maciel-Pearson BG, 2019, Arxiv, DOI arXiv:1912.05684
[8] Gros S, 2021, P AMER CONTR CONF, P1947, DOI 10.23919/ACC50511.2021.9482765
[9] Data-Driven Economic NMPC Using Reinforcement Learning
Gros, Sebastien
Zanon, Mario
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
[10] Hoffmann G., 2008, AIAA GUID NAV CONTR, P7410, DOI DOI 10.2514/6.2008-7410

← 1 2 3 →