Computation Offloading in Heterogeneous Vehicular Edge Networks: On-Line and Off-Policy Bandit Solutions

被引:58
作者
Bozorgchenani, Arash [1 ,2 ]
Maghsudi, Setareh [3 ,4 ]
Tarchi, Daniele [1 ]
Hossain, Ekram [5 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn, I-40126 Bologna, Italy
[2] Univ Lancaster, Dept Comp & Commun, Lancaster LA1 4YW, England
[3] Univ Tiibingen, Dept Comp Sci, D-72074 Tubingen, Germany
[4] Fraunhofer Heinrich Hertz Inst, D-10587 Berlin, Germany
[5] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 2N2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Task analysis; Edge computing; Servers; Base stations; Heuristic algorithms; Delays; Vehicle dynamics; Vehicular edge computing (VEC); computation offloading; heterogeneous networks; off-policy learning; on-line learning; bandit theory; MULTIARMED BANDIT; RESOURCE-ALLOCATION; CLOUD; SYSTEMS;
D O I
10.1109/TMC.2021.3082927
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid advancement of intelligent transportation systems (ITS) and vehicular communications, vehicular edge computing (VEC) is emerging as a promising technology to support low-latency ITS applications and services. In this paper, we consider the computation offloading problem from mobile vehicles/users in a heterogeneous VEC scenario, and focus on the network- and base station selection problems, where different networks have different traffic loads. In a fast-varying vehicular environment, computation offloading experience of users is strongly affected by the latency due to the congestion at the edge computing servers co-located with the base stations. However, as a result of the non-stationary property of such an environment and also information shortage, predicting this congestion is an involved task. To address this challenge, we propose an on-line learning algorithm and an off-policy learning algorithm based on multi-armed bandit theory. To dynamically select the least congested network in a piece-wise stationary environment, these algorithms predict the latency that the offloaded tasks experience using the offloading history. In addition, to minimize the task loss due to the mobility of the vehicles, we develop a method for base station selection. Moreover, we propose a relaying mechanism for the selected network, which operates based on the sojourn time of the vehicles. Through intensive numerical analysis, we demonstrate that the proposed learning-based solutions adapt to the traffic changes of the network by selecting the least congested network, thereby reducing the latency of offloaded tasks. Moreover, we demonstrate that the proposed joint base station selection and the relaying mechanism minimize the task loss in a vehicular environment.
引用
收藏
页码:4233 / 4248
页数:16
相关论文
共 56 条
[1]  
3GPP, 2010, 36814 3GPP STAND I
[2]   Vehicle as a Resource (VaaR) [J].
Abdelhamid, Sherin ;
Hassanein, Hossam S. ;
Takahara, Glen .
IEEE NETWORK, 2015, 29 (01) :12-17
[4]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[5]  
Bin Gao, 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications, P1459, DOI 10.1109/INFOCOM.2019.8737543
[6]   VANET-CLOUD: A GENERIC CLOUD COMPUTING MODEL FOR VEHICULAR AD HOC NETWORKS [J].
Bitam, Salim ;
Mellouk, Abdelhamid ;
Zeadally, Sherali .
IEEE WIRELESS COMMUNICATIONS, 2015, 22 (01) :96-102
[7]  
Boxma O., 2007, Performance Evaluation Review, V34, P13, DOI 10.1145/1243401.1243406
[8]  
Bozorgchenani A, 2018, IEEE WCNC
[9]   Multi-Objective Computation Sharing in Energy and Delay Constrained Mobile Edge Computing Environments [J].
Bozorgchenani, Arash ;
Mashhadi, Farshad ;
Tarchi, Daniele ;
Monroy, Sergio A. Salinas .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (10) :2992-3005
[10]   A Likelihood Ratio Approach to Sequential Change Point Detection for a General Class of Parameters [J].
Dette, Holger ;
Goesnnann, Josua .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (531) :1361-1377