共 55 条
Fast Adaptive Task Offloading and Resource Allocation in Large-Scale MEC Systems via Multiagent Graph Reinforcement Learning
被引:12
作者:
Gao, Zhen
[1
]
Yang, Lei
[1
]
Dai, Yu
[2
]
机构:
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110167, Peoples R China
[2] Northeastern Univ, Sch Coll Software, Shenyang 110167, Peoples R China
关键词:
Task analysis;
Resource management;
Heuristic algorithms;
Internet of Things;
Complexity theory;
Training;
Observability;
Multiaccess edge computing (MEC);
multiagent reinforcement learning (MARL);
recurrent graph neural networks (R-GNNs);
resources allocation;
task offloading;
EDGE;
MANAGEMENT;
D O I:
10.1109/JIOT.2023.3285950
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
In multiaccess edge computing (MEC), when many mobile devices (MDs) offload their tasks to an edge server (ES), its resources might become constrained. These tasks may take a long time to complete or even be thrown away. Since the unknown information of both the ESs and other MDs, it is difficult for each MD to determine its offloading policy independently. Furthermore, most offloading methods have poor generalization to new environment since they focus on model architecture with a fixed quantity of MDs and ESs, preventing trained models from transferring to other environments. In this article, we provide a full decentralized offloading scheme based on the curriculum attention-weighted graph recurrent network-based multiagent actor-critic (CAGR-MAAC). First, we build MEC as a shared MD agents-ESs graph and an AGR-based message network is designed to enable each MD aggregate the information of ESs and other MDs and solve the partial observability of MD agents for MEC system. Second, a learnable differentiable encoder network is introduced to construct MD agent's local information encoding. Subsequently, the MD agent converts overall the information regarding the MEC system into a fixed-size embedding via an AGR Network to handle different quantity of MDs and ESs. Finally, we introduce curriculum learning to address the huge complexity of the MEC system and the training difficulties induced by the large amounts of MDs and ESs. Experiments demonstrate that compared with existing algorithms, CAGR-MAAC boosts task completion rates and decreases system costs by 13.01%-15.03% and 16.45%-18.56%, and can quickly adapt to the new environment.
引用
收藏
页码:758 / 776
页数:19
相关论文