Multiobjective Deep Reinforcement Learning for Computation Offloading and Trajectory Control in UAV-Base-Station-Assisted MEC

被引:2
作者
Huang, Hao [1 ]
Chai, Zheng-Yi [1 ]
Sun, Bao-Shan [1 ]
Kang, Hong-Shen [1 ]
Zhao, Ying-Jie [1 ]
机构
[1] Tiangong Univ, Sch Comp Sci, Tianjin Key Lab Autonomous Intelligence Technol &, Tianjin 300387, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期
基金
中国国家自然科学基金;
关键词
Autonomous aerial vehicles; Task analysis; Delays; Energy consumption; Real-time systems; Trajectory; Servers; Computation offloading; multiaccess edge computing (MEC); multiobjective reinforcement learning; trajectory control; unmanned aerial vehicle (UAV); RESOURCE-ALLOCATION;
D O I
10.1109/JIOT.2024.3420884
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicle (UAV) and base station jointly assisted multiaccess edge computing (UB-MEC) technology is a promising direction to provide flexible computing services for resource-limited devices. Due to the non-real-time observation of device loads and the dynamic nature of demand in UB-MEC, it is a highly challenging problem to make UAV respond in real time to meet user's dynamic preferences in UB-MEC. To this end, we propose a multiobjective deep reinforcement learning (MODRL) for computation offloading and trajectory control (COTC) of UAV. First, the problem is formulated as a multiobjective Markov decision process (MOMDP), where the traditional scalar rewards are extended to vector, corresponding to the number of task data collected, the completion delay, and the UAV's energy consumption, and the weights are dynamically adjusted to meet different user preferences. Then, considering the device load information stored in UAV is non-real-time, an attentional long short-term memory (ALSTM) network is designed to predict real-time states by autofocusing important historical information. The near on-policy experience replay (NOER) reviews experiences close to on-policy can better promote learning of current strategy. The simulation results show that the proposed algorithm can obtain the action policy which meets the user's time-varying preferences, and can achieve a good balance between different objectives under different preferences.
引用
收藏
页码:31805 / 31821
页数:17
相关论文
共 44 条
[11]   PPO2: Location Privacy-Oriented Task Offloading to Edge Computing Using Reinforcement Learning for Intelligent Autonomous Transport Systems [J].
Gao, Honghao ;
Huang, Wanqiu ;
Liu, Tong ;
Yin, Yuyu ;
Li, Youhuizi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) :7599-7612
[12]   MO-AVC: Deep-Reinforcement-Learning-Based Trajectory Control and Task Offloading in Multi-UAV-Enabled MEC Systems [J].
Gao, Zhen ;
Yang, Lei ;
Dai, Yu .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) :11395-11414
[13]  
Gers FA, 1999, IEE CONF PUBL, P850, DOI [10.1162/089976600300015015, 10.1049/cp:19991218]
[14]   A comprehensive survey on reinforcement-learning-based computation offloading techniques in Edge Computing Systems [J].
Hortelano, Diego ;
de Miguel, Ignacio ;
Duran Barroso, Ramon J. ;
Carlos Aguado, Juan ;
Merayo, Noemi ;
Ruiz, Lidia ;
Asensio, Adrian ;
Masip-Bruin, Xavi ;
Fernandez, Patricia ;
Abril, Evaristo J. .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 216
[15]   Numerology-Capable UAV-MEC for Future Generation Massive IoT Networks [J].
Hossain, Mohammad Arif ;
Hossain, Abdullah Ridwan ;
Ansari, Nirwan .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) :23860-23868
[16]   Incentive Mechanisms for Mobile Edge Computing: Present and Future Directions [J].
Huang, Xiaoyao ;
Zhang, Baoxian ;
Li, Cheng .
IEEE NETWORK, 2022, 36 (06) :199-205
[17]   Survey on computation offloading in UAV-Enabled mobile edge computing [J].
Huda, S. M. Asiful ;
Moh, Sangman .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 201
[18]   A Comprehensive Survey on Blockchain in Industrial Internet of Things: Motivations, Research Progresses, and Future Challenges [J].
Huo, Ru ;
Zeng, Shiqin ;
Wang, Zhihao ;
Shang, Jiajia ;
Chen, Wei ;
Huang, Tao ;
Wang, Shuo ;
Yu, F. Richard ;
Liu, Yunjie .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2022, 24 (01) :88-122
[19]   Timely Monitoring of Dynamic Sources With Observations From Multiple Wireless Sensors [J].
Kalor, Anders E. ;
Popovski, Petar .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) :1263-1276
[20]   Robust Computation Offloading and Trajectory Optimization for Multi-UAV-Assisted MEC: A Multiagent DRL Approach [J].
Li, Bin ;
Yang, Rongrong ;
Liu, Lei ;
Wang, Junyi ;
Zhang, Ning ;
Dong, Mianxiong .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) :4775-4786