Meta Reinforcement Learning for Multi-Task Offloading in Vehicular Edge Computing

被引:24
作者
Dai, Penglin [1 ,2 ,3 ]
Huang, Yaorong [1 ,2 ,3 ]
Hu, Kaiwen [1 ,2 ,3 ]
Wu, Xiao [1 ,2 ,3 ]
Xing, Huanlai [1 ,2 ,3 ]
Yu, Zhaofei [4 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Tangshan Inst, Tangshan 063000, Peoples R China
[4] Peking Univ, Inst Artificial Intelligence, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Servers; Heuristic algorithms; Computational modeling; Computer architecture; Delays; Vehicle dynamics; Meta reinforcement learning; multi-task offloading; vehicular edge computing; multi-task dependency; Seq2seq model; MEC;
D O I
10.1109/TMC.2023.3247579
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile edge computing has been a promising solution to enable real-time service in vehicular networks. However, due to high dynamics of mobile environment and heterogeneous features of vehicular services, traditional expert-based or learning-based strategies has to update handcrafted parameters or retrain learning model, which leads to intolerant overhead. Therefore, this paper investigates the problem of multi-task offloading (MTO), where there exist multiple offloading scenarios with varying parameters, such as task topology, resource requirement and transmission/computation capability. The objective is to design a unified solution to minimize task execution time under different MTO scenarios. Accordingly, we develop a Seq2seq-based Meta Reinforcement Learning algorithm for MTO (SMRL-MTO). Specifically, a bidirectional gated recurrent units integrated with attention mechanism is designed to determine offloading action by encoding sequential offloading actions and showing different preferences to different parts of input sequence. Particularly, a meta reinforcement learning framework is designed based on model-agnostic meta learning, which trains a meta policy offline and fast adapts to new MTO scenario within a few training steps. Finally, we conduct performance evaluation based on task generator DAGGEN and realistic vehicular traces, which shows that the SMRL-MTO reduces task execution time by 11.36% on average compared with greedy algorithm.
引用
收藏
页码:2123 / 2138
页数:16
相关论文
共 43 条
[1]   Optimal Distribution of Workloads in Cloud-Fog Architecture in Intelligent Vehicular Networks [J].
Abbasi, Mahdi ;
Yaghoobikia, Mina ;
Rafiee, Milad ;
Khosravi, Mohammad R. ;
Menon, Varun G. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) :4706-4715
[2]   Collision-Free Sequential Task Offloading for Mobile Edge Computing [J].
Al-Habob, Ahmed A. ;
Ibrahim, Ahmed ;
Dobre, Octavia A. ;
Garcia Armada, Ana .
IEEE COMMUNICATIONS LETTERS, 2020, 24 (01) :71-75
[3]  
[Anonymous], 2023, Data source: Vehicle trajectory
[4]   Vehicular Cloud Computing through Dynamic Computation Offloading [J].
Ashok, Ashwin ;
Steenkiste, Peter ;
Bai, Fan .
COMPUTER COMMUNICATIONS, 2018, 120 :125-137
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]   Edge Intelligence for Adaptive Multimedia Streaming in Heterogeneous Internet of Vehicles [J].
Dai, Penglin ;
Song, Feng ;
Liu, Kai ;
Dai, Yueyue ;
Zhou, Pan ;
Guo, Songtao .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (03) :1464-1478
[7]   Asynchronous Deep Reinforcement Learning for Data-Driven Task Offloading in MEC-Empowered Vehicular Networks [J].
Dai, Penglin ;
Hu, Kaiwen ;
Wu, Xiao ;
Xing, Huanlai ;
Yu, Zhaofei .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[8]   A Probabilistic Approach for Cooperative Computation Offloading in MEC-Assisted Vehicular Networks [J].
Dai, Penglin ;
Hu, Kaiwen ;
Wu, Xiao ;
Xing, Huanlai ;
Teng, Fei ;
Yu, Zhaofei .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :899-911
[9]  
Duan Yan, 2017, RL^2: Fast reinforcement learning via slow reinforcement learning
[10]   Joint Task Offloading and Resource Allocation for Vehicular Edge Computing Based on V2I and V2V Modes [J].
Fan, Wenhao ;
Su, Yi ;
Liu, Jie ;
Li, Shenmeng ;
Huang, Wei ;
Wu, Fan ;
Liu, Yuan'an .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) :4277-4292