Edge Computing Task Offloading Optimization for a UAV-Assisted Internet of Vehicles via Deep Reinforcement Learning

被引：42

作者：

Yan, Ming ^{[1
,2
]}

Xiong, Rui ^{[1
,2
]}

Wang, Yan ^{[3
]}

Li, Chunguo ^{[4
]}

机构：

[1] Commun Univ China, Sch Informat & Commun Engn, Beijing 100024, Peoples R China

[2] Commun Univ China, Key Lab Acoust Visual Technol & Intelligent Contro, Beijing 100024, Peoples R China

[3] Commun Univ China, Sch Data Sci & Intelligent Media, Beijing 100024, Peoples R China

[4] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Unmanned aerial vehicle (UAV); Internet of Vehicles (IoV); task offloading; deep deterministic policy gradient (DDPG); MODEL;

D O I：

10.1109/TVT.2023.3331363

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the context of the unmanned aerial vehicle (UAV)-assisted vehicular networking system, more network factors need to be considered to ensure the safe operation of connected vehicles. A large volume of delay-sensitive and computationally demanding tasks necessitate offloading to UAVs or roadside units for processing. And the efficient allocation of various network resources of vehicles, UAVs, and roadside units under constrained conditions determines the efficiency of task offloading. Deep reinforcement learning (DRL) has demonstrated its efficacy as an experienced approach for solving such problems. In this article, we delve into the utilization of deep reinforcement learning to design an efficient UAV-assisted vehicular edge computing task offloading strategy. Under the constraints of limited network bandwidth and limited UAV power, the trajectory and the task offloading strategy of the UAV are jointly optimized. The primary objective of our proposed strategy is to achieve a notable reduction in the system delay of the edge computing network. Given the dynamic variability of tasks arrival, we employ a long short-term memory (LSTM) network with the attention mechanism and a deep deterministic policy gradient (DDPG) algorithm to effectively model the optimization problem as a Markov decision process. This approach can obtain the optimal policy through interactive learning from the UAV and the vehicle environment. The experiment results illustrate that this strategy outperforms other baseline strategies in terms of convergence speed, network delay, and task offloading ratio.

引用

页码：5647 / 5658

页数：12

共 40 条

[1]

Al-Hilo D., 2021, IEEETrans.Emerg.Top.Comput., early access, DOI [10.1109/TETC.2021.3068014.[12]Y., DOI 10.1109/TETC.2021.3068014.[12]Y]

[2] D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing [J].

Ale, Laha ;

King, Scott A. ;

Zhang, Ning ;

Sattar, Abdul Rahman ;

Skandaraniyam, Janahan .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) :19260-19272

[3]

[Anonymous], 2021, IEEE Internet Things J., V8, P7938

[4]

[Anonymous], 2022, Int. J. Aerosp. Eng., V2022, P1

[5]

[Anonymous], 2016, IEEE J.Sel. Areas Commun., V34, P3168

[6] Intelligent Pricing Model for Task Offloading in Unmanned Aerial Vehicle Mounted Mobile Edge Computing for Vehicular Network [J].

Baktayan, Asrar Ahmed ;

Al-Baltah, Ibrahim Ahmed ;

Abd Ghani, Abdul Azim .

JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2022, 18 (02) :111-123

[7]

Chao Wen, 2021, 2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)., P261, DOI 10.1109/ICAICE54393.2021.00059

[8]

Chen ZhiJiang, 2022, 2022 IEEE 22nd International Conference on Communication Technology (ICCT), P608, DOI 10.1109/ICCT56141.2022.10073166

[9] UAV-Enabled Mobile-Edge Computing for AI Applications: Joint Model Decision, Resource Allocation, and Trajectory Optimization [J].

Deng, Cailian ;

Fang, Xuming ;

Wang, Xianbin .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (07) :5662-5675

[10]

Fahlstrom T. J., 2022, Introduction to UAVSystems

← 1 2 3 4 →