Task offloading strategy and scheduling optimization for internet of vehicles based on deep reinforcement learning

被引:10
作者
Zhao, Xu [1 ]
Liu, Mingzhen [2 ]
Li, Maozhen [3 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Comp Sci, Xian 710048, Peoples R China
[3] Brunel Univ London, Dept Elect & Elect Engn, Uxbridge UB8 3PH, England
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Internet of vehicles; Mobile edge computing; Scheduling optimization;
D O I
10.1016/j.adhoc.2023.103193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Driven by the construction of smart cities, networks and communication technologies are gradually infiltrating into the Internet of Things (IoT) applications in urban infrastructure, such as automatic driving. In the Internet of Vehicles (IoV) environment, intelligent vehicles will generate a lot of data. However, the limited computing power of in-vehicle terminals cannot meet the demand. To solve this problem, we first simulate the task offloading model of vehicle terminal in Mobile Edge Computing (MEC) environment. Secondly, according to the model, we design and implement a MEC server collaboration scheme considering both delay and energy consumption. Thirdly, based on the optimization theory, the system optimization solution is formulated with the goal of minimizing system cost. Because the problem to be resolved is a mixed binary nonlinear programming problem, we model the problem as a Markov Decision Process (MDP). The original resource allocation decision is turned into a Reinforcement Learning (RL) problem. In order to achieve the optimal solution, the Deep Reinforcement Learning (DRL) method is used. Finally, we propose a Deep Deterministic Policy Gradient (DDPG) algorithm to deal with task offloading and scheduling optimization in high-dimensional continuous action space, and the experience replay mechanism is used to accelerate the convergence and enhance the stability of the network. The simulation results show that our scheme has good performance optimization in terms of convergence, system delay, average task energy consumption and system cost. For example, compared with the comparison algorithm, the system cost performance has improved by 9.12% under different task sizes, which indicates that our scheme is more suitable for highly dynamic Internet of Vehicles environment.
引用
收藏
页数:13
相关论文
共 40 条
  • [1] Natural actor-critic algorithms
    Bhatnagar, Shalabh
    Sutton, Richard S.
    Ghavamzadeh, Mohammad
    Lee, Mark
    [J]. AUTOMATICA, 2009, 45 (11) : 2471 - 2482
  • [2] Revisiting Computation Partitioning in Future 5G-Based Edge Computing Environments
    Cao, Jin
    Yang, Lei
    Cao, Jiannong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) : 2427 - 2438
  • [3] Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
    Chen, Xianfu
    Zhang, Honggang
    Wu, Celimuge
    Mao, Shiwen
    Ji, Yusheng
    Bennis, Mehdi
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03): : 4005 - 4018
  • [4] Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing
    Chen, Xu
    Jiao, Lei
    Li, Wenzhong
    Fu, Xiaoming
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (05) : 2827 - 2840
  • [5] Space/Aerial-Assisted Computing Offloading for IoT Applications: A Learning-Based Approach
    Cheng, Nan
    Lyu, Feng
    Quan, Wei
    Zhou, Conghao
    He, Hongli
    Shi, Weisen
    Shen, Xuemin
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (05) : 1117 - 1129
  • [6] Ertam F, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), P755, DOI 10.1109/UBMK.2017.8093521
  • [7] Service Characteristics-Oriented Joint Optimization of Radio and Computing Resource Allocation in Mobile-Edge Computing
    Feng, Jie
    Liu, Lei
    Pei, Qingqi
    Hou, Fen
    Yang, Tingting
    Wu, Jinsong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 9407 - 9421
  • [8] Gao J., 2018, IEEE T VEH TECHNOL, V67, p12 288
  • [9] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
    Grondman, Ivo
    Busoniu, Lucian
    Lopes, Gabriel A. D.
    Babuska, Robert
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
  • [10] Incentive-Driven Task Allocation for Collaborative Edge Computing in Industrial Internet of Things
    Hou, Wenjing
    Wen, Hong
    Zhang, Ning
    Wu, Jinsong
    Lei, Wenxin
    Zhao, Runhui
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (01) : 706 - 718