Adaptive Learning-Based Task Offloading for Vehicular Edge Computing Systems

被引：296

作者：

Sun, Yuxuan ^{[1
]}

Guo, Xueying ^{[2
]}

Song, Jinhui ^{[1
]}

Zhou, Sheng ^{[1
]}

Jiang, Zhiyuan ^{[3
]}

Liu, Xin ^{[2
]}

Niu, Zhisheng ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China

[2] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA

[3] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2019年 / 68卷 / 04期

基金：

国家重点研发计划;

关键词：

Vehicular edge computing; task offloading; online learning; multi-armed bandit; CLOUD; VEHICLE;

D O I：

10.1109/TVT.2019.2895593

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The vehicular edge computing system integrates the computing resources of vehicles, and provides computing services for other vehicles and pedestrians with task offloading. However, the vehicular task offloading environment is dynamic and uncertain, with fast varying network topologies, wireless channel states, and computing workloads. These uncertainties bring extra challenges to task offloading. In this paper, we consider the task offloading among vehicles, and propose a solution that enables vehicles to learn the offloading delay performance of their neighboring vehicles while offloading computation tasks. We design an adaptive learning based task offloading (ALTO) algorithm based on the multi-armed bandit theory, in order to minimize the average offloading delay. ALTO works in a distributed manner without requiring frequent state exchange, and is augmented with input-awareness and occurrence-awareness to adapt to the dynamic environment. The proposed algorithm is proved to have a sublinear learning regret. Extensive simulations are carried out under both synthetic scenario and realistic highway scenario, and results illustrate that the proposed algorithm achieves low delay performance, and decreases the average delay up to 30% compared with the existing upper confidence bound based learning algorithm.

引用

页码：3061 / 3074

页数：14

共 34 条

[1] Vehicle as a Resource (VaaR) [J].

Abdelhamid, Sherin ;

Hassanein, Hossam S. ;

Takahara, Glen .

IEEE NETWORK, 2015, 29 (01) :12-17

[2]

Abdulazeez MB, 2016, PROCEEDINGS OF THE 15TH EUROPEAN CONFERENCE ON CYBER WARFARE AND SECURITY (ECCWS 2016), P1

[3]

[Anonymous], 2015, 11 ETSI

[4]

[Anonymous], 2017, 22886V1510 3GPP TR

[5]

[Anonymous], IEEE T WIRELESS COMM

[6]

[Anonymous], 2010, PhD thesis,

[7] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[8] VANET-CLOUD: A GENERIC CLOUD COMPUTING MODEL FOR VEHICULAR AD HOC NETWORKS [J].

Bitam, Salim ;

Mellouk, Abdelhamid ;

Zeadally, Sherali .

IEEE WIRELESS COMMUNICATIONS, 2015, 22 (01) :96-102

[9]

Bnaya Zahy., 2013, HUMAN, V2, P84

[10] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems [J].

Bubeck, Sebastien ;

Cesa-Bianchi, Nicolo .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01) :1-122

← 1 2 3 4 →