Path Planning for Cellular-Connected UAV: A DRL Solution With Quantum-Inspired Experience Replay

被引:48
作者
Li, Yuanjian [1 ]
Aghvami, A. Hamid [1 ]
Dong, Daoyi [2 ]
机构
[1] Kings Coll London, Ctr Telecommun Res CTR, London WC2R 2LS, England
[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2600, Australia
关键词
Autonomous aerial vehicles; Wireless communication; Optimization; Navigation; Trajectory; Antenna radiation patterns; Reinforcement learning; Drone; trajectory design; deep reinforcement learning; quantum-inspired experience replay; INTERFERENCE CANCELLATION; TRAJECTORY OPTIMIZATION; COMMUNICATION; NETWORKS;
D O I
10.1109/TWC.2022.3162749
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In cellular-connected unmanned aerial vehicle (UAV) network, a minimization problem on the weighted sum of time cost and expected outage duration is considered. Taking advantage of UAV's adjustable mobility, a UAV navigation approach is formulated to achieve the aforementioned optimization goal. Conventional offline optimization techniques suffer from inefficiency in accomplishing the formulated UAV navigation task due to the practical consideration of local building distribution and directional antenna radiation pattern. Alternatively, after mapping the navigation task into a Markov decision process (MDP), a deep reinforcement learning (DRL)-aided solution is proposed to help the UAV find the optimal flying direction within each time slot, and thus the designed trajectory towards the destination can be generated. To help the DRL agent commit a better trade-off between sampling priority and diversity, a novel quantum-inspired experience replay (QiER) framework is proposed, via relating experienced transition's importance to its associated quantum bit (qubit) and applying Grover iteration based amplitude amplification technique. Compared to several representative DRL-related and non-learning baselines, the effectiveness and supremacy of the proposed DRL-QiER solution are demonstrated and validated in numerical results.
引用
收藏
页码:7897 / 7912
页数:16
相关论文
共 15 条
  • [11] UAV Path Planning Based on the Average TD3 Algorithm With Prioritized Experience Replay
    Luo, Xuqiong
    Wang, Qiyuan
    Gong, Hongfang
    Tang, Chao
    IEEE ACCESS, 2024, 12 : 38017 - 38029
  • [12] Optimizing UAV Base Station Positioning through Quantum-Inspired Solution Workflow
    Saravanan, M.
    Pathmanaban, Viswanath
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 347 - 352
  • [13] 3D Global Path Planning Optimization for Cellular-Connected UAVs under Link Reliability Constraint
    Behjati, Mehran
    Nordin, Rosdiadee
    Zulkifley, Muhammad Aidiel
    Abdullah, Nor Fadzilah
    SENSORS, 2022, 22 (22)
  • [14] Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM
    Zhou, Quanxi
    Wang, Yongjing
    Shen, Ruiyu
    Nakazato, Jin
    Tsukada, Manabu
    Guan, Zhenyu
    IEEE JOURNAL ON MINIATURIZATION FOR AIR AND SPACE SYSTEMS, 2025, 6 (01): : 2 - 18
  • [15] A Multi-Objective Quantum-Inspired Seagull Optimization Algorithm Based on Decomposition for Unmanned Aerial Vehicle Path Planning
    Wang, Peng
    Deng, Zhiliang
    IEEE ACCESS, 2022, 10 : 110497 - 110511