Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning

被引:81
作者
Ren, Tao [1 ,2 ]
Niu, Jianwei [1 ,2 ,3 ]
Dai, Bin [1 ,2 ]
Liu, Xuefeng [1 ,2 ]
Hu, Zheyuan [4 ]
Xu, Mingliang [5 ]
Guizani, Mohsen [6 ]
机构
[1] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[3] Zhengzhou Univ, Res Inst Ind Technol, Sch Informat Engn, Zhengzhou 450001, Peoples R China
[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[5] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China
[6] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar
关键词
Optimization; Processor scheduling; Task analysis; Dynamic scheduling; Reinforcement learning; Computational modeling; Real-time systems; Computation offloading; hierarchical reinforcement learning (HRL); mobile edge computing (MEC); trajectory optimization; unmanned aerial vehicle; RESOURCE-ALLOCATION;
D O I
10.1109/JIOT.2021.3071531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the high maneuverability and flexibility, unmanned aerial vehicles (UAVs) have been considered as a promising paradigm to assist mobile edge computing (MEC) in many scenarios including disaster rescue and field operation. Most existing research focuses on the study of trajectory and computation-offloading scheduling for UAV-assisted MEC in stationary environments, and could face challenges in dynamic environments where the locations of UAVs and mobile devices (MDs) vary significantly. Some latest research attempts to develop scheduling policies for dynamic environments by means of reinforcement learning (RL). However, as these need to explore in high-dimensional state and action space, they may fail to cover in large-scale networks where multiple UAVs serve numerous MDs. To address this challenge, we leverage the idea of "divide-and-conquer" and propose HT3O, a scalable scheduling approach for large-scale UAV-assisted MEC. First, HT3O is built with neural networks via deep RL to obtain real-time scheduling policies for MEC in dynamic environments. More importantly, to make HT3O more scalable, we decompose the scheduling problem into two-layered subproblems and optimize them alternately via hierarchical RL. This not only substantially reduces the complexity of each subproblem, but also improves the convergence efficiency. Experimental results show that HT3O can achieve promising performance improvements over state-of-the-art approaches.
引用
收藏
页码:7095 / 7109
页数:15
相关论文
共 44 条
[1]   Mobile Edge Computing: A Survey [J].
Abbas, Nasir ;
Zhang, Yan ;
Taherkordi, Amir ;
Skeie, Tor .
IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (01) :450-465
[2]  
[Anonymous], 2015, ABS150902971 CORR
[3]   Energy-Optimized Partial Computation Offloading in Mobile-Edge Computing With Genetic Simulated-Annealing-Based Particle Swarm Optimization [J].
Bi, Jing ;
Yuan, Haitao ;
Duanmu, Shuaifei ;
Zhou, MengChu ;
Abusorrah, Abdullah .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3774-3785
[4]  
Chen XF, 2018, IEEE VTS VEH TECHNOL
[5]   An Efficient Computation Offloading Management Scheme in the Densely Deployed Small Cell Networks With Mobile Edge Computing [J].
Guo, Fengxian ;
Zhang, Heli ;
Ji, Hong ;
Li, Xi ;
Leung, Victor C. M. .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (06) :2651-2664
[6]   UAV-Enhanced Intelligent Offloading for Internet of Things at the Edge [J].
Guo, Hongzhi ;
Liu, Jiajia .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) :2737-2746
[7]   Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach [J].
He, Ying ;
Zhao, Nan ;
Yin, Hongxi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (01) :44-55
[8]  
Hou XW, 2020, IEEE CONF COMPUT, P150, DOI [10.1109/INFOCOMWKSHPS50562.2020.9163048, 10.1109/infocomwkshps50562.2020.9163048]
[9]   Joint Offloading and Trajectory Design for UAV-Enabled Mobile Edge Computing Systems [J].
Hu, Qiyu ;
Cai, Yunlong ;
Yu, Guanding ;
Qin, Zhijin ;
Zhao, Minjian ;
Li, Geoffrey Ye .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :1879-1892
[10]   Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks [J].
Huang, Liang ;
Bi, Suzhi ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) :2581-2593