Design of Anti-Interference Path Planning for Cellular-Connected UAVs Based on Improved DDPG

被引:0
作者
Zhou, Quanxi [1 ]
Wang, Yongjing [2 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China
[2] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE 2024 IEEE 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC 2024 | 2024年
关键词
UAV; Path Planning; Reinforcement Learning; Transmission Outage Probability; DDPG; Post Decision State; COMMUNICATION; NAVIGATION;
D O I
10.1109/HPSC62738.2024.00020
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The flight and communication security of the Cellular-Connected Unmanned Aerial Vehicles (UAVs) is an important and popular research direction. Due to the complexity of the environmental space, UAVs face a complex and ever-changing task space. In recent years, reinforcement learning has rapidly advanced and widely applied in complex scenarios path planning problems. However, due to the discrete action space, their accuracy is limited. To address aforementioned problems, a new method for UAV path planning based on Deep Reinforcement Learning has been proposed in this paper. Specifically, this paper adopts an improved DDPG method with Actor-Critic framework, which can improve the accuracy. To further enhance the algorithm's precision and training speed, this paper introduces Post-Decision State method, which leverages experience for prediction to optimize the training results and enable UAVs to adapt to the ever-changing environment. Simulation experiments have proved that the improved method can increase training speed and make significant improvements in path performance.
引用
收藏
页码:71 / 76
页数:6
相关论文
共 41 条
  • [1] UAV and obstacle sensing techniques - a perspective
    Aswini, N.
    Kumar, E. Krishna
    Uma, S. V.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS, 2018, 6 (01) : 32 - 46
  • [2] Efficient Nakagami-m fading channel simulation
    Beaulieu, NC
    Cheng, C
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2005, 54 (02) : 413 - 424
  • [3] UAV-guided navigation for ground robot tele-operation in a military reconnaissance environment
    Chen, Jessie Y. C.
    [J]. ERGONOMICS, 2010, 53 (08) : 940 - 950
  • [4] Drummond C.D., 2015, AUSTRALASIAN COASTS, P267
  • [5] Throughput Maximization for Periodic Real-Time Systems under the Maximal Temperature Constraint
    Huang, Huang
    Chaturvedi, Vivek
    Quan, Gang
    Fan, Jeffrey
    Qiu, Meikang
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [6] Idris Ahmad Syahrin, 2015, 2015 20th Microoptics Conference (MOC), P1, DOI 10.1109/MOC.2015.7416432
  • [7] Reinforcement Learning for UAV Attitude Control
    Koch, William
    Mancuso, Renato
    West, Richard
    Bestavros, Azer
    [J]. ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2019, 3 (02)
  • [8] Development and Testing of a Two-UAV Communication Relay System
    Li, Boyang
    Jiang, Yifan
    Sun, Jingxuan
    Cai, Lingfeng
    Wen, Chih-Yung
    [J]. SENSORS, 2016, 16 (10)
  • [9] Li C. M., 2019, REINFORCEMENT LEARNI
  • [10] Wireless secure communication involving UAV: an overview of physical layer security
    Li, Jiawei
    Cheng, Ruixia
    Zhu, Junwen
    Tian, Yu
    Zhang, Yiwen
    [J]. 2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336