Multiobjective Deep Reinforcement Learning for Computation Offloading and Trajectory Control in UAV-Base-Station-Assisted MEC

被引：0

作者：

Huang, Hao ^{[1
]}

Chai, Zheng-Yi ^{[1
]}

Sun, Bao-Shan ^{[1
]}

Kang, Hong-Shen ^{[1
]}

Zhao, Ying-Jie ^{[1
]}

机构：

[1] Tiangong Univ, Sch Comp Sci, Tianjin Key Lab Autonomous Intelligence Technol &, Tianjin 300387, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期

基金：

中国国家自然科学基金;

关键词：

Autonomous aerial vehicles; Task analysis; Delays; Energy consumption; Real-time systems; Trajectory; Servers; Computation offloading; multiaccess edge computing (MEC); multiobjective reinforcement learning; trajectory control; unmanned aerial vehicle (UAV); RESOURCE-ALLOCATION;

D O I：

10.1109/JIOT.2024.3420884

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unmanned aerial vehicle (UAV) and base station jointly assisted multiaccess edge computing (UB-MEC) technology is a promising direction to provide flexible computing services for resource-limited devices. Due to the non-real-time observation of device loads and the dynamic nature of demand in UB-MEC, it is a highly challenging problem to make UAV respond in real time to meet user's dynamic preferences in UB-MEC. To this end, we propose a multiobjective deep reinforcement learning (MODRL) for computation offloading and trajectory control (COTC) of UAV. First, the problem is formulated as a multiobjective Markov decision process (MOMDP), where the traditional scalar rewards are extended to vector, corresponding to the number of task data collected, the completion delay, and the UAV's energy consumption, and the weights are dynamically adjusted to meet different user preferences. Then, considering the device load information stored in UAV is non-real-time, an attentional long short-term memory (ALSTM) network is designed to predict real-time states by autofocusing important historical information. The near on-policy experience replay (NOER) reviews experiences close to on-policy can better promote learning of current strategy. The simulation results show that the proposed algorithm can obtain the action policy which meets the user's time-varying preferences, and can achieve a good balance between different objectives under different preferences.

引用

页码：31805 / 31821

页数：17

共 50 条

[21] Joint Optimization of Trajectory, Offloading, Caching, and Migration for UAV-Assisted MEC
Zhao, Mingxiong
Zhang, Rongqian
He, Zhenli
Li, Keqin
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 1981 - 1998
[22] Cooperative Data Sensing and Computation Offloading in UAV-Assisted Crowdsensing With Multi-Agent Deep Reinforcement Learning
Cai, Ting
Yang, Zhihua
Chen, Yufei
Chen, Wuhui
Zheng, Zibin
Yu, Yang
Dai, Hong-Ning
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (05): : 3197 - 3211
[23] Dynamic Trajectory Design and Bandwidth Adjustment for Energy-Efficient UAV-Assisted Relaying With Deep Reinforcement Learning in MEC IoT System
Du, Tianjiao
Gui, Xiaolin
Teng, Xiaoyu
Zhang, Kaiyuan
Ren, Dewang
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (23): : 37463 - 37479
[24] A Spiking Reinforcement Trajectory Planning for UAV-Assisted MEC Systems
Xia, Zeyang
Dong, Li
Jiang, Feibo
IEEE ACCESS, 2024, 12 : 54435 - 54448
[25] A Joint Trajectory and Computation Offloading Scheme for UAV-MEC Networks via Multi-Agent Deep Reinforcement Learning
Du, Xinyang
Li, Xuanheng
Zhao, Nan
Wang, Xianbin
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5438 - 5443
[26] Multiagent Deep Reinforcement Learning for Vehicular Computation Offloading in IoT
Zhu, Xiaoyu
Luo, Yueyi
Liu, Anfeng
Bhuiyan, Md Zakirul Alam
Zhang, Shaobo
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9763 - 9773
[27] Computing Over the Sky: Joint UAV Trajectory and Task Offloading Scheme Based on Optimization-Embedding Multi-Agent Deep Reinforcement Learning
Li, Xuanheng
Du, Xinyang
Zhao, Nan
Wang, Xianbin
IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1355 - 1369
[28] Deep Reinforcement Learning Based 3D-Trajectory Design and Task Offloading in UAV-Enabled MEC System
Liu, Chuanjie
Zhong, Yalin
Wu, Ruolin
Ren, Siyu
Du, Shuang
Guo, Bing
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3185 - 3195
[29] Multiobjective Deep Reinforcement Learning Assisted Resource Allocation for MEC-Caching-Coexist System
Li, Zan
Zhao, Zhongling
Shi, Jia
Si, Jiangbo
Xiao, Pei
Tafazolli, Rahim
Hu, Hang
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (04): : 6158 - 6170
[30] Deep Reinforcement Learning for Task Offloading and Power Allocation in UAV-Assisted MEC System
Zhao, Nan
Ren, Fan
Du, Wei
Ye, Zhiyang
INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2021, 12 (04) : 32 - 51

← 1 2 3 4 5 →