Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing

被引:223
作者
Zhao, Nan [1 ]
Ye, Zhiyang [1 ]
Pei, Yiyang [2 ]
Liang, Ying-Chang [3 ,4 ]
Niyato, Dusit [5 ]
机构
[1] Hubei Univ Technol, Hubei Collaborat Innovat Ctr High Efficiency Util, Wuhan 430068, Peoples R China
[2] Singapore Inst Technol, Singapore 138683, Singapore
[3] Univ Elect Sci & Technol China UESTC, Ctr Intelligent Networking & Commun CINC, Chengdu 610056, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Guangdong, Peoples R China
[5] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Task analysis; Resource management; Trajectory; Wireless communication; Servers; Optimization; Manganese; Mobile edge computing; UAV networks; task offloading; cooperative offloading; deep reinforcement learning; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; MEC;
D O I
10.1109/TWC.2022.3153316
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Mobile edge computing can effectively reduce service latency and improve service quality by offloading computation-intensive tasks to the edges of wireless networks. Due to the characteristic of flexible deployment, wide coverage and reliable wireless communication, unmanned aerial vehicles (UAVs) have been employed as assisted edge clouds (ECs) for large-scale sparely-distributed user equipment. Considering the limited computation and energy capacities of UAVs, a collaborative mobile edge computing system with multiple UAVs and multiple ECs is investigated in this paper. The task offloading issue is addressed to minimize the sum of execution delays and energy consumptions by jointly designing the trajectories, computation task allocation, and communication resource management of UAVs. Moreover, to solve the above non-convex optimization problem, a Markov decision process is formulated for the multi-UAV assisted mobile edge computing system. To obtain the joint strategy of trajectory design, task allocation, and power management, a cooperative multi-agent deep reinforcement learning framework is investigated. Considering the high-dimensional continuous action space, the twin delayed deep deterministic policy gradient algorithm is exploited. The evaluation results demonstrate that our multi-UAV multi-EC task offloading method can achieve better performance compared with the other optimization approaches.
引用
收藏
页码:6949 / 6960
页数:12
相关论文
共 36 条
[1]   3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage [J].
Alzenad, Mohamed ;
El-Keyi, Amr ;
Lagum, Faraj ;
Yanikomeroglu, Halim .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2017, 6 (04) :434-437
[2]   Data Offloading in UAV-Assisted Multi-Access Edge Computing Systems Under Resource Uncertainty [J].
Apostolopoulos, Pavlos Athanasios ;
Fragkos, Georgios ;
Tsiropoulou, Eirini Eleni ;
Papavassiliou, Symeon .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (01) :175-190
[3]   Hierarchical Game-Theoretic and Reinforcement Learning Framework for Computational Offloading in UAV-Enabled Mobile Edge Computing Networks With Multiple Service Providers [J].
Asheralieva, Alia ;
Niyato, Dusit .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8753-8769
[4]   Edge Computing Assisted Autonomous Flight for UAV: Synergies between Vision and Communications [J].
Chen, Quan ;
Zhu, Hai ;
Yang, Lei ;
Chen, Xiaoqian ;
Pollin, Sofie ;
Vinogradov, Evgenii .
IEEE COMMUNICATIONS MAGAZINE, 2021, 59 (01) :28-33
[5]   Gradient estimation algorithms for the parameter identification of bilinear systems using the auxiliary model [J].
Ding, Feng ;
Xu, Ling ;
Meng, Dandan ;
Jin, Xue-Bo ;
Alsaedi, Ahmed ;
Hayat, Tasawar .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2020, 369
[6]   The innovation algorithms for multivariable state-space models [J].
Ding, Feng ;
Zhang, Xiao ;
Xu, Ling .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2019, 33 (11) :1601-1618
[7]  
Fujimoto S, 2018, Arxiv, DOI [arXiv:1802.09477, 10.48550/arXiv.1802.09477]
[8]   Energy Consumption Minimization in UAV-Assisted Mobile-Edge Computing Systems: Joint Resource Allocation and Trajectory Design [J].
Ji, Jiequ ;
Zhu, Kun ;
Yi, Changyan ;
Niyato, Dusit .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (10) :8570-8584
[9]   Riding on the Primary: A New Spectrum Sharing Paradigm for Wireless-Powered IoT Devices [J].
Kang, Xin ;
Liang, Ying-Chang ;
Yang, Jing .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (09) :6335-6347
[10]   Energy-Efficient UAV-Assisted Mobile Edge Computing: Resource Allocation and Trajectory Optimization [J].
Li, Mushu ;
Cheng, Nan ;
Gao, Jie ;
Wang, Yinlu ;
Zhao, Lian ;
Shen, Xuemin .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) :3424-3438