Research on deep reinforcement learning in Internet of vehicles edge computing based on Quasi-Newton method

被引:0
作者
Zhang, Jianwu [1 ]
Lu, Zetao [1 ]
Zhang, Qianhua [2 ,3 ]
Zhan, Ming [4 ]
机构
[1] School of Communication Engineering, Hangzhou Dianzi University, Hangzhou
[2] Research Center for Space Computing System, Zhejiang Lab, Hangzhou
[3] College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou
[4] College of Electronic and Information Engineering, Taizhou University, Taizhou
来源
Tongxin Xuebao/Journal on Communications | 2024年 / 45卷 / 05期
关键词
deep reinforcement learning; Internet of vehicles; Quasi-Newton method; task offloading;
D O I
10.11959/j.issn.1000-436x.2024101
中图分类号
学科分类号
摘要
To address the issues of ineffective task offloading decisions caused by multitasking and resource constraints in vehicular networks, the Quasi-Newton method deep reinforcement learning dual-phase online offloading (QNRLO) algorithm was proposed. The algorithm was designed by initially incorporating batch normalization techniques to optimize the training process of deep neural networks. Subsequently, optimization was performed using the Quasi-Newton method to effectively approximate the optimal solution. Through this dual-stage optimization, performance was significantly enhanced under conditions of multitasking and dynamic wireless channels, improving computational efficiency. By introducing Lagrange multipliers and a reconstructed dual function, the non-convex optimization problem was transformed into a convex optimization problem of the dual function, ensuring the global optimality of the algorithm. Additionally, system transmission time allocation in the vehicular network model was considered, enhancing the practicality of the algorithm. Compared to existing algorithms, the proposed algorithm improves the convergence and stability of task offloading significantly, addresses task offloading issues in vehicular networks effectively, and offers high practicality and reliability. © 2024 Editorial Board of Journal on Communications. All rights reserved.
引用
收藏
页码:90 / 100
页数:10
相关论文
共 34 条
  • [1] SONKOLY B, CZENTYE J, SZALAY M, Et al., Survey on placement methods in the edge and beyond, IEEE Communications Surveys & Tutorials, 23, 4, pp. 2590-2629, (2021)
  • [2] REN J, ZHANG D Y, HE S W, Et al., A survey on end-edge-cloud orchestrated network computing paradigms: transparent computing, mobile edge computing, fog computing, and cloudlet, ACM Computing Surveys, 52, 6, (2019)
  • [3] MAO Y Y, YOU C S, ZHANG J, Et al., A survey on mobile edge computing: the communication perspective, IEEE Communications Surveys & Tutorials, 19, 4, pp. 2322-2358, (2017)
  • [4] MACH P, BECVAR Z., Mobile edge computing: a survey on architecture and computation offloading, IEEE Communications Surveys & Tutorials, 19, 3, pp. 1628-1656, (2017)
  • [5] YOUSEFPOUR A, FUNG C, NGUYEN T, Et al., All one needs to know about fog computing and related edge computing paradigms: a complete survey, Journal of Systems Architecture, 98, pp. 289-330, (2019)
  • [6] ABDEL-HALIM I T, FAHMY H M A., Mobility prediction in vehicular ad-hoc networks: prediction aims, techniques, use cases, and research challenges, IEEE Intelligent Transportation Systems Magazine, 13, 2, pp. 105-126, (2021)
  • [7] GUPTA M, BENSON J, PATWA F, Et al., Secure V2V and V2I communication in intelligent transportation using cloudlets, IEEE Transactions on Services Computing, 15, 4, pp. 1912-1925, (2022)
  • [8] NING Z L, ZHANG K Y, WANG X J, Et al., Intelligent edge computing in Internet of vehicles: a joint computation offloading and caching solution, IEEE Transactions on Intelligent Transportation Systems, 22, 4, pp. 2212-2225, (2021)
  • [9] LUO Q Y, LI C L, LUAN T H, Et al., Self-learning based computation offloading for Internet of vehicles: model and algorithm, IEEE Transactions on Wireless Communications, 20, 9, pp. 5913-5925, (2021)
  • [10] BOZORGCHENANI A, MAGHSUDI S, TARCHI D, Et al., Computation offloading in heterogeneous vehicular edge networks: on-line and off-policy bandit solutions, IEEE Transactions on Mobile Computing, 21, 12, pp. 4233-4248, (2022)