Design of Anti-Interference Path Planning for Cellular-Connected UAVs Based on Improved DDPG

被引：0

作者：

Zhou, Quanxi ^{[1
]}

Wang, Yongjing ^{[2
]}

机构：

[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China

[2] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou, Zhejiang, Peoples R China

来源：

PROCEEDINGS OF THE 2024 IEEE 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC 2024 | 2024年

关键词：

UAV; Path Planning; Reinforcement Learning; Transmission Outage Probability; DDPG; Post Decision State; COMMUNICATION; NAVIGATION;

D O I：

10.1109/HPSC62738.2024.00020

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The flight and communication security of the Cellular-Connected Unmanned Aerial Vehicles (UAVs) is an important and popular research direction. Due to the complexity of the environmental space, UAVs face a complex and ever-changing task space. In recent years, reinforcement learning has rapidly advanced and widely applied in complex scenarios path planning problems. However, due to the discrete action space, their accuracy is limited. To address aforementioned problems, a new method for UAV path planning based on Deep Reinforcement Learning has been proposed in this paper. Specifically, this paper adopts an improved DDPG method with Actor-Critic framework, which can improve the accuracy. To further enhance the algorithm's precision and training speed, this paper introduces Post-Decision State method, which leverages experience for prediction to optimize the training results and enable UAVs to adapt to the ever-changing environment. Simulation experiments have proved that the improved method can increase training speed and make significant improvements in path performance.

引用

页码：71 / 76

页数：6

共 41 条

[1] UAV and obstacle sensing techniques - a perspective
Aswini, N.
Kumar, E. Krishna
Uma, S. V.
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS, 2018, 6 (01) : 32 - 46
[2] Efficient Nakagami-m fading channel simulation
Beaulieu, NC
Cheng, C
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2005, 54 (02) : 413 - 424
[3] UAV-guided navigation for ground robot tele-operation in a military reconnaissance environment
Chen, Jessie Y. C.
[J]. ERGONOMICS, 2010, 53 (08) : 940 - 950
[4] Drummond C.D., 2015, AUSTRALASIAN COASTS, P267
[5] Throughput Maximization for Periodic Real-Time Systems under the Maximal Temperature Constraint
Huang, Huang
Chaturvedi, Vivek
Quan, Gang
Fan, Jeffrey
Qiu, Meikang
[J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
[6] Idris Ahmad Syahrin, 2015, 2015 20th Microoptics Conference (MOC), P1, DOI 10.1109/MOC.2015.7416432
[7] Reinforcement Learning for UAV Attitude Control
Koch, William
Mancuso, Renato
West, Richard
Bestavros, Azer
[J]. ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2019, 3 (02)
[8] Development and Testing of a Two-UAV Communication Relay System
Li, Boyang
Jiang, Yifan
Sun, Jingxuan
Cai, Lingfeng
Wen, Chih-Yung
[J]. SENSORS, 2016, 16 (10)
[9] Li C. M., 2019, REINFORCEMENT LEARNI
[10] Wireless secure communication involving UAV: an overview of physical layer security
Li, Jiawei
Cheng, Ruixia
Zhu, Junwen
Tian, Yu
Zhang, Yiwen
[J]. 2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336

← 1 2 3 4 5 →