Multi-Agent Deep Reinforcement Learning Based Dynamic Task Offloading in a Device-to-Device Mobile-Edge Computing Network to Minimize Average Task Delay with Deadline Constraints

被引:2
|
作者
He, Huaiwen [1 ]
Yang, Xiangdong [1 ,2 ]
Mi, Xin [1 ,2 ]
Shen, Hong [3 ]
Liao, Xuefeng [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp, Zhongshan Inst, Zhongshan 528400, Peoples R China
[2] Univ Elect Sci & Technol China, Comp Sci & Engn Sch, Chengdu 611731, Peoples R China
[3] Cent Queensland Univ, Engn & Technol, Rockhampton 4701, Australia
[4] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou 325027, Peoples R China
关键词
mobile edge computing; dynamic matching; D2D; delay constraint; multi-agent reinforcement learning; RESOURCE-ALLOCATION; MEC; D2D;
D O I
10.3390/s24165141
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Device-to-device (D2D) is a pivotal technology in the next generation of communication, allowing for direct task offloading between mobile devices (MDs) to improve the efficient utilization of idle resources. This paper proposes a novel algorithm for dynamic task offloading between the active MDs and the idle MDs in a D2D-MEC (mobile edge computing) system by deploying multi-agent deep reinforcement learning (DRL) to minimize the long-term average delay of delay-sensitive tasks under deadline constraints. Our core innovation is a dynamic partitioning scheme for idle and active devices in the D2D-MEC system, accounting for stochastic task arrivals and multi-time-slot task execution, which has been insufficiently explored in the existing literature. We adopt a queue-based system to formulate a dynamic task offloading optimization problem. To address the challenges of large action space and the coupling of actions across time slots, we model the problem as a Markov decision process (MDP) and perform multi-agent DRL through multi-agent proximal policy optimization (MAPPO). We employ a centralized training with decentralized execution (CTDE) framework to enable each MD to make offloading decisions solely based on its local system state. Extensive simulations demonstrate the efficiency and fast convergence of our algorithm. In comparison to the existing sub-optimal results deploying single-agent DRL, our algorithm reduces the average task completion delay by 11.0% and the ratio of dropped tasks by 17.0%. Our proposed algorithm is particularly pertinent to sensor networks, where mobile devices equipped with sensors generate a substantial volume of data that requires timely processing to ensure quality of experience (QoE) and meet the service-level agreements (SLAs) of delay-sensitive applications.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] A Deep Reinforcement Learning based Mobile Device Task Offloading Algorithm in MEC
    Li, Yang
    Shi, Bing
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 200 - 207
  • [22] Hierarchical Task Offloading for Vehicular Fog Computing Based on Multi-Agent Deep Reinforcement Learning
    Hou, Yukai
    Wei, Zhiwei
    Zhang, Rongqing
    Cheng, Xiang
    Yang, Liuqing
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (04) : 3074 - 3085
  • [23] A Multi-Agent Reinforcement Learning-Based Task-Offloading Strategy in a Blockchain-Enabled Edge Computing Network
    Liu, Chenlei
    Sun, Zhixin
    MATHEMATICS, 2024, 12 (14)
  • [24] Dynamic Offloading Strategy for Delay-Sensitive Task in Mobile-Edge Computing Networks
    Ai, Lihua
    Tan, Bin
    Zhang, Jiadi
    Wang, Rui
    Wu, Jun
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 526 - 538
  • [25] Optimization of lightweight task offloading strategy for mobile edge computing based on deep reinforcement learning
    Lu, Haifeng
    Gu, Chunhua
    Luo, Fei
    Ding, Weichao
    Liu, Xinping
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 102 : 847 - 861
  • [26] Task offloading and resource allocation for multi-UAV asset edge computing with multi-agent deep reinforcement learning
    Samah A. Zakaryia
    Mohamed Meaad
    Tamer Nabil
    Mohamed K. Hussein
    Computing, 2025, 107 (5)
  • [27] Multi-Agent Deep Reinforcement Learning for Efficient Computation Offloading in Mobile Edge Computing
    Jiao, Tianzhe
    Feng, Xiaoyue
    Guo, Chaopeng
    Wang, Dongqi
    Song, Jie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3585 - 3603
  • [28] Dynamic task offloading for Internet of Things in mobile edge computing via deep reinforcement learning
    Chen, Ying
    Gu, Wei
    Li, Kaixin
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022,
  • [29] Task offloading mechanism based on federated reinforcement learning in mobile edge computing
    Li, Jie
    Yang, Zhiping
    Wang, Xingwei
    Xia, Yichao
    Ni, Shijian
    DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (02) : 492 - 504
  • [30] Vehicle Speed Aware Computing Task Offloading and Resource Allocation Based on Multi-Agent Reinforcement Learning in a Vehicular Edge Computing Network
    Huang, Xinyu
    He, Lijun
    Zhang, Wanyue
    2020 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING (EDGE 2020), 2020, : 1 - 8