Task offloading strategy and scheduling optimization for internet of vehicles based on deep reinforcement learning

被引：10

作者：

Zhao, Xu ^{[1
]}

Liu, Mingzhen ^{[2
]}

Li, Maozhen ^{[3
]}

机构：

[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China

[2] Xian Polytech Univ, Sch Comp Sci, Xian 710048, Peoples R China

[3] Brunel Univ London, Dept Elect & Elect Engn, Uxbridge UB8 3PH, England

来源：

AD HOC NETWORKS | 2023年 / 147卷

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; Internet of vehicles; Mobile edge computing; Scheduling optimization;

D O I：

10.1016/j.adhoc.2023.103193

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Driven by the construction of smart cities, networks and communication technologies are gradually infiltrating into the Internet of Things (IoT) applications in urban infrastructure, such as automatic driving. In the Internet of Vehicles (IoV) environment, intelligent vehicles will generate a lot of data. However, the limited computing power of in-vehicle terminals cannot meet the demand. To solve this problem, we first simulate the task offloading model of vehicle terminal in Mobile Edge Computing (MEC) environment. Secondly, according to the model, we design and implement a MEC server collaboration scheme considering both delay and energy consumption. Thirdly, based on the optimization theory, the system optimization solution is formulated with the goal of minimizing system cost. Because the problem to be resolved is a mixed binary nonlinear programming problem, we model the problem as a Markov Decision Process (MDP). The original resource allocation decision is turned into a Reinforcement Learning (RL) problem. In order to achieve the optimal solution, the Deep Reinforcement Learning (DRL) method is used. Finally, we propose a Deep Deterministic Policy Gradient (DDPG) algorithm to deal with task offloading and scheduling optimization in high-dimensional continuous action space, and the experience replay mechanism is used to accelerate the convergence and enhance the stability of the network. The simulation results show that our scheme has good performance optimization in terms of convergence, system delay, average task energy consumption and system cost. For example, compared with the comparison algorithm, the system cost performance has improved by 9.12% under different task sizes, which indicates that our scheme is more suitable for highly dynamic Internet of Vehicles environment.

引用

页数：13

共 40 条

[1] Natural actor-critic algorithms
Bhatnagar, Shalabh
Sutton, Richard S.
Ghavamzadeh, Mohammad
Lee, Mark
[J]. AUTOMATICA, 2009, 45 (11) : 2471 - 2482
[2] Revisiting Computation Partitioning in Future 5G-Based Edge Computing Environments
Cao, Jin
Yang, Lei
Cao, Jiannong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) : 2427 - 2438
[3] Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
Chen, Xianfu
Zhang, Honggang
Wu, Celimuge
Mao, Shiwen
Ji, Yusheng
Bennis, Mehdi
[J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03): : 4005 - 4018
[4] Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing
Chen, Xu
Jiao, Lei
Li, Wenzhong
Fu, Xiaoming
[J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (05) : 2827 - 2840
[5] Space/Aerial-Assisted Computing Offloading for IoT Applications: A Learning-Based Approach
Cheng, Nan
Lyu, Feng
Quan, Wei
Zhou, Conghao
He, Hongli
Shi, Weisen
Shen, Xuemin
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (05) : 1117 - 1129
[6] Ertam F, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), P755, DOI 10.1109/UBMK.2017.8093521
[7] Service Characteristics-Oriented Joint Optimization of Radio and Computing Resource Allocation in Mobile-Edge Computing
Feng, Jie
Liu, Lei
Pei, Qingqi
Hou, Fen
Yang, Tingting
Wu, Jinsong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 9407 - 9421
[8] Gao J., 2018, IEEE T VEH TECHNOL, V67, p12 288
[9] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
Grondman, Ivo
Busoniu, Lucian
Lopes, Gabriel A. D.
Babuska, Robert
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
[10] Incentive-Driven Task Allocation for Collaborative Edge Computing in Industrial Internet of Things
Hou, Wenjing
Wen, Hong
Zhang, Ning
Wu, Jinsong
Lei, Wenxin
Zhao, Runhui
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (01) : 706 - 718

← 1 2 3 4 →