Drone-Cell Trajectory Planning and Resource Allocation for Highly Mobile Networks: A Hierarchical DRL Approach

被引:53
|
作者
Shi, Weisen [1 ]
Li, Junling [1 ,2 ]
Wu, Huaqing [1 ]
Zhou, Conghao [1 ]
Cheng, Nan [3 ]
Shen, Xuemin [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Guangdong, Peoples R China
[3] Xidian Univ, Sch Telecommun, Xian 710071, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 12期
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Trajectory; Planning; Resource management; Throughput; Internet of Things; Radio access networks; Real-time systems; Drone cell; drone-assisted radio access network (RAN); space-air-ground integration; trajectory planning; VEHICULAR NETWORKS; DESIGN; 5G; UAVS;
D O I
10.1109/JIOT.2020.3020067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Drone cell (DC) is envisioned to enable the dynamic service provisioning for radio access networks (RANs), in response to the spatial and temporal unevenness of user traffic. In this article, we propose a hierarchical deep reinforcement learning (DRL)-based multi-DC trajectory planning and resource allocation (HDRLTPRA) scheme for high-mobility users. The objective is to maximize the accumulative network throughput while satisfying user fairness, DC power consumption, and DC-to-ground link quality constraints. To address the high uncertainties of the environment, we decouple the multi-DC TPRA problem into two hierarchical subproblems, i.e., the higher level global trajectory planning (GTP) subproblem and the lower level local TPRA (LTPRA) subproblem. First, the GTP subproblem is to address trajectory planning for multiple DCs in the RAN over a long time period. To solve the subproblem, we propose a multiagent DRL-based GTP (MARL-GTP) algorithm in which the nonstationary state space caused by the multi-DC environment is addressed by the multiagent fingerprint technique. Second, based on the GTP results, each DC solves the LTPRA subproblem independently to control the movement and transmit power allocation based on the real-time user traffic variations. A deep deterministic policy gradient (DEP)-based LTPRA (DEP-LTPRA) algorithm is then proposed to solve the LTPRA subproblem. With the two algorithms addressing both subproblems at different decision granularities, the multi-DC TPRA problem can be resolved by the HDRLTPRA scheme. Simulation results show that 40% network throughput improvement can be achieved by the proposed HDRLTPRA scheme over the nonlearning-based TPRA scheme.
引用
收藏
页码:9800 / 9813
页数:14
相关论文
共 23 条
  • [1] Resource Allocation and Trajectory Optimization in Multi-UAV Collaborative Vehicular Networks: An Extended Multiagent DRL Approach
    Zhang, Wenqian
    Tan, Lu
    Huang, Tao
    Huang, Xiaowen
    Huang, Mengting
    Zhang, Guanglin
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (08): : 9391 - 9404
  • [2] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
    Yin, Baolin
    Fang, Xuming
    Wang, Xianbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343
  • [3] Information freshness-oriented trajectory planning and resource allocation for UAV-assisted vehicular networks
    Gai, Hao
    Zhang, Haixia
    Guo, Shuaishuai
    Yuan, Dongfeng
    CHINA COMMUNICATIONS, 2023, 20 (05) : 244 - 262
  • [4] DRL-Based Resource Allocation and Trajectory Planning for NOMA-Enabled Multi-UAV Collaborative Caching 6G Network
    Qin, Peng
    Fu, Yang
    Zhang, Jing
    Geng, Suiyan
    Liu, Jiayan
    Zhao, Xiongwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8750 - 8764
  • [5] Resource Allocation Using Deep Learning in Mobile Small Cell Networks
    Zafar, Saniya
    Jangsher, Sobia
    Al-Dweik, Arafat
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (03): : 1903 - 1915
  • [6] Blockchain-Integrated UAV-Assisted Mobile Edge Computing: Trajectory Planning and Resource Allocation
    Wang, Die
    Jia, Yunjian
    Dong, Mianxiong
    Ota, Kaoru
    Liang, Liang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 1263 - 1275
  • [7] A Joint Optimization Algorithm for Trajectory Planning and Resource Allocation of Vehicle Mobile Base Stations for On-Demand Coverage Networks
    Zhao, Lingyu
    Zhu, Xiaorong
    PROCESSES, 2024, 12 (02)
  • [8] Resource Allocation in Multi-Cell Integrated Sensing and Communication Systems: A DRL Approach
    Wang, Xiaoming
    Wu, Huiling
    Xu, Youyun
    Cao, Haotong
    Kumar, Neeraj
    Rodrigues, Joel J. P. C.
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3210 - 3215
  • [9] Distributed Resource Allocation for Data Center Networks: A Hierarchical Game Approach
    Zhang, Huaqing
    Xiao, Yong
    Bu, Shengrong
    Yu, Richard
    Niyato, Dusit
    Han, Zhu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (03) : 778 - 789
  • [10] Energy-efficient trajectory planning and resource allocation in UAV communication networks under imperfect channel prediction
    Sheng, Min
    Zhao, Chenxi
    Liu, Junyu
    Teng, Wei
    Dai, Yanpeng
    Li, Jiandong
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (12)