Drone-Cell Trajectory Planning and Resource Allocation for Highly Mobile Networks: A Hierarchical DRL Approach

被引:59
作者
Shi, Weisen [1 ]
Li, Junling [1 ,2 ]
Wu, Huaqing [1 ]
Zhou, Conghao [1 ]
Cheng, Nan [3 ]
Shen, Xuemin [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Guangdong, Peoples R China
[3] Xidian Univ, Sch Telecommun, Xian 710071, Peoples R China
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
Trajectory; Planning; Resource management; Throughput; Internet of Things; Radio access networks; Real-time systems; Drone cell; drone-assisted radio access network (RAN); space-air-ground integration; trajectory planning; VEHICULAR NETWORKS; DESIGN; 5G; UAVS;
D O I
10.1109/JIOT.2020.3020067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Drone cell (DC) is envisioned to enable the dynamic service provisioning for radio access networks (RANs), in response to the spatial and temporal unevenness of user traffic. In this article, we propose a hierarchical deep reinforcement learning (DRL)-based multi-DC trajectory planning and resource allocation (HDRLTPRA) scheme for high-mobility users. The objective is to maximize the accumulative network throughput while satisfying user fairness, DC power consumption, and DC-to-ground link quality constraints. To address the high uncertainties of the environment, we decouple the multi-DC TPRA problem into two hierarchical subproblems, i.e., the higher level global trajectory planning (GTP) subproblem and the lower level local TPRA (LTPRA) subproblem. First, the GTP subproblem is to address trajectory planning for multiple DCs in the RAN over a long time period. To solve the subproblem, we propose a multiagent DRL-based GTP (MARL-GTP) algorithm in which the nonstationary state space caused by the multi-DC environment is addressed by the multiagent fingerprint technique. Second, based on the GTP results, each DC solves the LTPRA subproblem independently to control the movement and transmit power allocation based on the real-time user traffic variations. A deep deterministic policy gradient (DEP)-based LTPRA (DEP-LTPRA) algorithm is then proposed to solve the LTPRA subproblem. With the two algorithms addressing both subproblems at different decision granularities, the multi-DC TPRA problem can be resolved by the HDRLTPRA scheme. Simulation results show that 40% network throughput improvement can be achieved by the proposed HDRLTPRA scheme over the nonlearning-based TPRA scheme.
引用
收藏
页码:9800 / 9813
页数:14
相关论文
共 36 条
[11]  
Jain R., 1998, ACM T COMPUT SYST
[12]   OPTIMIZING SPACE-AIR-GROUND INTEGRATED NETWORKS BY ARTIFICIAL INTELLIGENCE [J].
Kato, Nei ;
Fadlullah, Zubair Md. ;
Tang, Fengxiao ;
Mao, Bomin ;
Tani, Shigenori ;
Okamura, Atsushi ;
Liu, Jiajia .
IEEE WIRELESS COMMUNICATIONS, 2019, 26 (04) :140-147
[13]  
Kulkarni TD, 2016, ADV NEUR IN, V29
[14]   UAV Communications for 5G and Beyond: Recent Advances and Future Trends [J].
Li, Bin ;
Fei, Zesong ;
Zhang, Yan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2241-2263
[15]   Energy-Efficient UAV-Assisted Mobile Edge Computing: Resource Allocation and Trajectory Optimization [J].
Li, Mushu ;
Cheng, Nan ;
Gao, Jie ;
Wang, Yinlu ;
Zhao, Lian ;
Shen, Xuemin .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) :3424-3438
[16]   Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach [J].
Liu, Chi Harold ;
Chen, Zheyu ;
Tang, Jian ;
Xu, Jie ;
Piao, Chengzhe .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :2059-2070
[17]   Space-Air-Ground Integrated Network: A Survey [J].
Liu, Jiajia ;
Shi, Yongpeng ;
Fadlullah, Zubair Md. ;
Kato, Nei .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (04) :2714-2741
[18]  
Lu X., 2019, UAV AIDED 5G COMMUNI
[19]   Beyond 5G With UAVs: Foundations of a 3D Wireless Cellular Network [J].
Mozaffari, Mohammad ;
Kasgari, Ali Taleb Zadeh ;
Saad, Walid ;
Bennis, Mehdi ;
Debbah, Merouane .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (01) :357-372
[20]   Mobile Unmanned Aerial Vehicles (UAVs) for Energy-Efficient Internet of Things Communications [J].
Mozaffari, Mohammad ;
Saad, Walid ;
Bennis, Mehdi ;
Debbah, Merouane .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (11) :7574-7589