Multi-Agent Deep Reinforcement Learning Based Dynamic Task Offloading in a Device-to-Device Mobile-Edge Computing Network to Minimize Average Task Delay with Deadline Constraints

被引:2
|
作者
He, Huaiwen [1 ]
Yang, Xiangdong [1 ,2 ]
Mi, Xin [1 ,2 ]
Shen, Hong [3 ]
Liao, Xuefeng [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp, Zhongshan Inst, Zhongshan 528400, Peoples R China
[2] Univ Elect Sci & Technol China, Comp Sci & Engn Sch, Chengdu 611731, Peoples R China
[3] Cent Queensland Univ, Engn & Technol, Rockhampton 4701, Australia
[4] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou 325027, Peoples R China
关键词
mobile edge computing; dynamic matching; D2D; delay constraint; multi-agent reinforcement learning; RESOURCE-ALLOCATION; MEC; D2D;
D O I
10.3390/s24165141
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Device-to-device (D2D) is a pivotal technology in the next generation of communication, allowing for direct task offloading between mobile devices (MDs) to improve the efficient utilization of idle resources. This paper proposes a novel algorithm for dynamic task offloading between the active MDs and the idle MDs in a D2D-MEC (mobile edge computing) system by deploying multi-agent deep reinforcement learning (DRL) to minimize the long-term average delay of delay-sensitive tasks under deadline constraints. Our core innovation is a dynamic partitioning scheme for idle and active devices in the D2D-MEC system, accounting for stochastic task arrivals and multi-time-slot task execution, which has been insufficiently explored in the existing literature. We adopt a queue-based system to formulate a dynamic task offloading optimization problem. To address the challenges of large action space and the coupling of actions across time slots, we model the problem as a Markov decision process (MDP) and perform multi-agent DRL through multi-agent proximal policy optimization (MAPPO). We employ a centralized training with decentralized execution (CTDE) framework to enable each MD to make offloading decisions solely based on its local system state. Extensive simulations demonstrate the efficiency and fast convergence of our algorithm. In comparison to the existing sub-optimal results deploying single-agent DRL, our algorithm reduces the average task completion delay by 11.0% and the ratio of dropped tasks by 17.0%. Our proposed algorithm is particularly pertinent to sensor networks, where mobile devices equipped with sensors generate a substantial volume of data that requires timely processing to ensure quality of experience (QoE) and meet the service-level agreements (SLAs) of delay-sensitive applications.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Secure Task Offloading in Blockchain-Enabled Mobile Edge Computing With Deep Reinforcement Learning
    Samy, Ahmed
    Elgendy, Ibrahim A.
    Yu, Haining
    Zhang, Weizhe
    Zhang, Hongli
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4872 - 4887
  • [42] A Delay-Optimal Task Scheduling Strategy for Vehicle Edge Computing Based on the Multi-Agent Deep Reinforcement Learning Approach
    Nie, Xuefang
    Yan, Yunhui
    Zhou, Tianqing
    Chen, Xingbang
    Zhang, Dingding
    ELECTRONICS, 2023, 12 (07)
  • [43] Graph Convolutional Network Augmented Deep Reinforcement Learning for Dependent Task Offloading in Mobile Edge Computing
    Mo, Chu-To
    Chen, Jia-Hong
    Liao, Wanjiun
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [44] Optimizing Mobile Edge Computing Multi-Level Task Offloading via Deep Reinforcement Learning
    Yan, Peizhi
    Choudhury, Salimur
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [45] Deep Reinforcement Learning Based Task Offloading Strategy Under Dynamic Pricing in Edge Computing
    Shi, Bing
    Chen, Feiyang
    Tang, Xing
    SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 : 578 - 594
  • [46] A Multi-Agent Deep Reinforcement Learning Approach for Computation Offloading in 5G Mobile Edge Computing
    Gan, Zhaoyu
    Lin, Rongheng
    Zou, Hua
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 645 - 654
  • [47] Deep Reinforcement Learning and Markov Decision Problem for Task Offloading in Mobile Edge Computing
    Gao, Xiaohu
    Ang, Mei Choo
    Althubiti, Sara A.
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [48] Deep Reinforcement Learning and Markov Decision Problem for Task Offloading in Mobile Edge Computing
    Xiaohu Gao
    Mei Choo Ang
    Sara A. Althubiti
    Journal of Grid Computing, 2023, 21
  • [49] Enhancing task offloading in vehicular networks: A multi-agent cloud-edge-device framework
    Zhang, Peiying
    Wang, Enqi
    Tan, Lizhuang
    Kumar, Neeraj
    Wang, Jian
    Liu, Kai
    VEHICULAR COMMUNICATIONS, 2025, 53
  • [50] Correlation-Based Device Energy-Efficient Dynamic Multi-Task Offloading for Mobile Edge Computing
    Zhang, Siqi
    Yi, Na
    Ma, Yi
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,