Multi-agent deep reinforcement learning for trajectory planning in UAVs-assisted mobile edge computing with heterogeneous requirements

被引:1
作者
Fan, Chenchen [1 ,2 ]
Xu, Hongyu [1 ]
Wang, Qingling [1 ,2 ]
机构
[1] Southeast Univ, Xinjiekou St, Nanjing 210096, Jiangsu, Peoples R China
[2] China Acad Elect & Informat Technol, Beijing 100041, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Mobile edge computing; Unmanned aerial vehicle; Counterfactual inference; Whale optimization algorithm; RESOURCE-ALLOCATION; DESIGN; MEC;
D O I
10.1016/j.comnet.2024.110469
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In heterogeneous wireless networks, massive user equipments (UEs) generate computing tasks with timevarying heterogeneous requirements. To improve the service quality, this paper formulates a unmanned aerial vehicles (UAVs)-assisted mobile edge computing (MEC) framework for time -varying heterogeneous task requirements. In the framework, the task delay and the number of successfully executed tasks are optimized by jointly controlling the trajectories of multiple UAVs. To address the considered trajectory planning optimization problem, a collaborative multi -agent deep reinforcement learning (MADRL) algorithm is proposed, where each UAV is regarded as a learning agent. First, a counterfactual inference based personalized policy update mechanism is proposed to evaluate the independent policy of agents by comparing the policy with a designed counterfactual policy. Based on this idea, each agent updates a personalized policy from both group and individual interests to improve its cooperation ability in dynamic and complex environments. Then, a diversified experience sampling mechanism is proposed to enhance the efficiency of policy evaluation and update with rich experiences provided by the environment interaction and the modified whale optimization algorithm. Finally, evaluation results demonstrate the superiority and effectiveness of the proposed MADRL algorithm.
引用
收藏
页数:15
相关论文
共 33 条
[1]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[2]   Reinforcement learning based tasks offloading in vehicular edge computing networks [J].
Cao, Shaohua ;
Liu, Di ;
Dai, Congcong ;
Wang, Chengqi ;
Yang, Yansheng ;
Zhang, Weishan ;
Zheng, Danyang .
COMPUTER NETWORKS, 2023, 234
[3]   A balanced whale optimization algorithm for constrained engineering design problems [J].
Chen, Huiling ;
Xu, Yueting ;
Wang, Mingjing ;
Zhao, Xuehua .
APPLIED MATHEMATICAL MODELLING, 2019, 71 :45-59
[4]   Towards Energy-Efficient Scheduling of UAV and Base Station Hybrid Enabled Mobile Edge Computing [J].
Dai, Bin ;
Niu, Jianwei ;
Ren, Tao ;
Hu, Zheyuan ;
Atiquzzaman, Mohammed .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) :915-930
[5]   Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading [J].
Gao, Ang ;
Wang, Qi ;
Liang, Wei ;
Ding, Zhiguo .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) :12888-12901
[6]  
He Y., 2023, IEEE Internet of Things Journal
[7]   Adaptive Computation Offloading Policy for Multi-Access Edge Computing in Heterogeneous Wireless Networks [J].
Ke, Hongchang ;
Wang, Hui ;
Sun, Weijia ;
Sun, Hongbin .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (01) :289-305
[8]   Intelligent Offloading and Resource Allocation in Heterogeneous Aerial Access IoT Networks [J].
Lakew, Demeke Shumeye ;
Tran, Anh-Tien ;
Dao, Nhu-Ngoc ;
Cho, Sungrae .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (07) :5704-5718
[9]   Multiagent Reinforcement Learning in Controlling Offloading Ratio and Trajectory for Multi-UAV Mobile-Edge Computing [J].
Lee, Wonseok ;
Kim, Taejoon .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) :3417-3429
[10]   Task offloading optimization of cruising UAV with fixed trajectory [J].
Liu, Peng ;
He, Han ;
Fu, Tingting ;
Lu, Huijuan ;
Alelaiwi, Abdulhameed ;
Wasi, Md Wasif Islam .
COMPUTER NETWORKS, 2021, 199