Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing

被引:79
作者
Song, Fuhong [1 ]
Xing, Huanlai [1 ]
Wang, Xinhan [1 ]
Luo, Shouxi [1 ]
Dai, Penglin [1 ]
Xiao, Zhiwen [1 ]
Zhao, Bowen [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
关键词
Mobile edge computing; multi-objective reinforcement learning; task offloading; trajectory control; unmanned aerial vehicle; ALGORITHM; OPTIMIZATION; ALLOCATION;
D O I
10.1109/TMC.2022.3208457
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies the trajectory control and task offloading (TCTO) problem in an unmanned aerial vehicle (UAV)-assisted mobile edge computing system, where a UAV flies along a planned trajectory to collect computation tasks from smart devices (SDs). We consider a scenario that SDs are not directly connected by the base station (BS) and the UAV has two roles to play: MEC server or wireless relay. The UAV makes task offloading decisions online, in which the collected tasks can be executed locally on the UAV or offloaded to the BS for remote processing. The TCTO problem involves multi-objective optimization as its objectives are to minimize the task delay and the UAV's energy consumption, and maximize the number of tasks collected by the UAV, simultaneously. This problem is challenging because the three objectives conflict with each other. The existing reinforcement learning (RL) algorithms, either single-objective RLs or single-policy multi-objective RLs, cannot well address the problem since they cannot output multiple policies for various preferences (i.e., weights) across objectives in a single run. An evolutionary multi-objective RL (EMORL) algorithm is applied to address the TCTO problem. We improve the multi-task multi-objective proximal policy optimization of the original EMORL by retaining all new learning tasks in the offspring population, which can preserve promissing learning tasks. The simulation results demonstrate that the proposed algorithm can obtain more excellent non-dominated policies by striking a balance between the three objectives regarding policy quality, compared with two evolutionary algorithms, two multi-policy RL algorithms, and the original EMORL.
引用
收藏
页码:7387 / 7405
页数:19
相关论文
共 50 条
[1]  
Abels A, 2019, PR MACH LEARN RES, V97
[2]   Data Offloading in UAV-Assisted Multi-Access Edge Computing Systems Under Resource Uncertainty [J].
Apostolopoulos, Pavlos Athanasios ;
Fragkos, Georgios ;
Tsiropoulou, Eirini Eleni ;
Papavassiliou, Symeon .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (01) :175-190
[3]  
Chen X., 2020, P IEEE 91 VEH TECHN, P1
[4]   Age of Information-Aware Resource Management in UAV-Assisted Mobile-Edge Computing Systems [J].
Chen, Xianfu ;
Wu, Celimuge ;
Chen, Tao ;
Liu, Zhi ;
Bennis, Mehdi ;
Ji, Yusheng .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[5]   Intelligent Task Offloading and Energy Allocation in the UAV-Aided Mobile Edge-Cloud Continuum [J].
Cheng, Zhipeng ;
Gao, Zhibin ;
Liwang, Minghui ;
Huang, Lianfen ;
Du, Xiaojiang ;
Guizani, Mohsen .
IEEE NETWORK, 2021, 35 (05) :42-49
[6]   Joint Optimization of Energy Consumption and Latency in Mobile Edge Computing for Internet of Things [J].
Cui, Laizhong ;
Xu, Chong ;
Yang, Shu ;
Huang, Joshua Zhexue ;
Li, Jianqiang ;
Wang, Xizhao ;
Ming, Zhong ;
Lu, Nan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03) :4791-4803
[7]   Towards Energy-Efficient Scheduling of UAV and Base Station Hybrid Enabled Mobile Edge Computing [J].
Dai, Bin ;
Niu, Jianwei ;
Ren, Tao ;
Hu, Zheyuan ;
Atiquzzaman, Mohammed .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) :915-930
[8]   An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints [J].
Deb, Kalyanmoy ;
Jain, Himanshu .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (04) :577-601
[9]   Energy-Efficient Resource Allocation in Multi-UAV-Assisted Two-Stage Edge Computing for Beyond 5G Networks [J].
Ei, Nway Nway ;
Alsenwi, Madyan ;
Tun, Yan Kyaw ;
Han, Zhu ;
Hong, Choong Seon .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) :16421-16432
[10]  
Fujimoto S, 2018, PR MACH LEARN RES, V80