Meta Reinforcement Learning for Multi-Task Offloading in Vehicular Edge Computing

被引：24

作者：

Dai, Penglin ^{[1
,2
,3
]}

Huang, Yaorong ^{[1
,2
,3
]}

Hu, Kaiwen ^{[1
,2
,3
]}

Wu, Xiao ^{[1
,2
,3
]}

Xing, Huanlai ^{[1
,2
,3
]}

Yu, Zhaofei ^{[4
]}

机构：

[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China

[2] Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China

[3] Southwest Jiaotong Univ, Tangshan Inst, Tangshan 063000, Peoples R China

[4] Peking Univ, Inst Artificial Intelligence, Beijing 100871, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Servers; Heuristic algorithms; Computational modeling; Computer architecture; Delays; Vehicle dynamics; Meta reinforcement learning; multi-task offloading; vehicular edge computing; multi-task dependency; Seq2seq model; MEC;

D O I：

10.1109/TMC.2023.3247579

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mobile edge computing has been a promising solution to enable real-time service in vehicular networks. However, due to high dynamics of mobile environment and heterogeneous features of vehicular services, traditional expert-based or learning-based strategies has to update handcrafted parameters or retrain learning model, which leads to intolerant overhead. Therefore, this paper investigates the problem of multi-task offloading (MTO), where there exist multiple offloading scenarios with varying parameters, such as task topology, resource requirement and transmission/computation capability. The objective is to design a unified solution to minimize task execution time under different MTO scenarios. Accordingly, we develop a Seq2seq-based Meta Reinforcement Learning algorithm for MTO (SMRL-MTO). Specifically, a bidirectional gated recurrent units integrated with attention mechanism is designed to determine offloading action by encoding sequential offloading actions and showing different preferences to different parts of input sequence. Particularly, a meta reinforcement learning framework is designed based on model-agnostic meta learning, which trains a meta policy offline and fast adapts to new MTO scenario within a few training steps. Finally, we conduct performance evaluation based on task generator DAGGEN and realistic vehicular traces, which shows that the SMRL-MTO reduces task execution time by 11.36% on average compared with greedy algorithm.

引用

页码：2123 / 2138

页数：16

共 43 条

[1] Optimal Distribution of Workloads in Cloud-Fog Architecture in Intelligent Vehicular Networks [J].

Abbasi, Mahdi ;

Yaghoobikia, Mina ;

Rafiee, Milad ;

Khosravi, Mohammad R. ;

Menon, Varun G. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) :4706-4715

[2] Collision-Free Sequential Task Offloading for Mobile Edge Computing [J].

Al-Habob, Ahmed A. ;

Ibrahim, Ahmed ;

Dobre, Octavia A. ;

Garcia Armada, Ana .

IEEE COMMUNICATIONS LETTERS, 2020, 24 (01) :71-75

[3]

[Anonymous], 2023, Data source: Vehicle trajectory

[4] Vehicular Cloud Computing through Dynamic Computation Offloading [J].

Ashok, Ashwin ;

Steenkiste, Peter ;

Bai, Fan .

COMPUTER COMMUNICATIONS, 2018, 120 :125-137

[5]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]

[6] Edge Intelligence for Adaptive Multimedia Streaming in Heterogeneous Internet of Vehicles [J].

Dai, Penglin ;

Song, Feng ;

Liu, Kai ;

Dai, Yueyue ;

Zhou, Pan ;

Guo, Songtao .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (03) :1464-1478

[7] Asynchronous Deep Reinforcement Learning for Data-Driven Task Offloading in MEC-Empowered Vehicular Networks [J].

Dai, Penglin ;

Hu, Kaiwen ;

Wu, Xiao ;

Xing, Huanlai ;

Yu, Zhaofei .

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,

[8] A Probabilistic Approach for Cooperative Computation Offloading in MEC-Assisted Vehicular Networks [J].

Dai, Penglin ;

Hu, Kaiwen ;

Wu, Xiao ;

Xing, Huanlai ;

Teng, Fei ;

Yu, Zhaofei .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :899-911

[9]

Duan Yan, 2017, RL^2: Fast reinforcement learning via slow reinforcement learning

[10] Joint Task Offloading and Resource Allocation for Vehicular Edge Computing Based on V2I and V2V Modes [J].

Fan, Wenhao ;

Su, Yi ;

Liu, Jie ;

Li, Shenmeng ;

Huang, Wei ;

Wu, Fan ;

Liu, Yuan'an .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) :4277-4292

← 1 2 3 4 5 →