A pipelining task offloading strategy via delay-aware multi-agent reinforcement learning in Cybertwin-enabled 6G network

被引:0
|
作者
Haiwen Niu [1 ,2 ,3 ]
Luhan Wang [1 ,2 ,3 ]
Keliang Du [1 ,2 ,3 ]
Zhaoming Lu [1 ,2 ,3 ]
Xiangming Wen [1 ,2 ,3 ]
Yu Liu [1 ,2 ,3 ]
机构
[1] Beijing Laboratory of Advanced Information Networks,Beijing Univof Posts&Telecom
[2] Beijing Key Lab of Network System Architecture and Convergence,Beijing Univof Posts&Telecom
[3] School of Information and Communication Engineering,Beijing Univof
关键词
D O I
暂无
中图分类号
TN929.5 [移动通信]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cybertwin-enabled 6th Generation(6G) network is envisioned to support artificial intelligence-native management to meet changing demands of 6G applications. Multi-Agent Deep Reinforcement Learning(MADRL) technologies driven by Cybertwins have been proposed for adaptive task offloading strategies. However, the existence of random transmission delay between Cybertwin-driven agents and underlying networks is not considered in related works, which destroys the standard Markov property and increases the decision reaction time to reduce the task offloading strategy performance. In order to address this problem, we propose a pipelining task offloading method to lower the decision reaction time and model it as a delay-aware Markov Decision Process(MDP). Then, we design a delay-aware MADRL algorithm to minimize the weighted sum of task execution latency and energy consumption. Firstly, the state space is augmented using the lastly-received state and historical actions to rebuild the Markov property. Secondly, Gate Transformer-XL is introduced to capture historical actions' importance and maintain the consistent input dimension dynamically changed due to random transmission delays. Thirdly, a sampling method and a new loss function with the difference between the current and target state value and the difference between real state-action value and augmented state-action value are designed to obtain state transition trajectories close to the real ones. Numerical results demonstrate that the proposed methods are effective in reducing reaction time and improving the task offloading performance in the random-delay Cybertwin-enabled 6G networks.
引用
收藏
页码:92 / 105
页数:14
相关论文
共 31 条
  • [1] Knowledge-Driven Multi-Agent Reinforcement Learning for Computation Offloading in Cybertwin-Enabled Internet of Vehicles
    Sun, Ruijin
    Yang, Xiao
    Cheng, Nan
    Wang, Xiucheng
    Li, Changle
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [2] DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning
    Yuan, Tingting
    Chung, Hwei-Ming
    Yuan, Jie
    Fu, Xiaoming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11763 - 11771
  • [3] A Multi-Agent Reinforcement Learning-Based Task-Offloading Strategy in a Blockchain-Enabled Edge Computing Network
    Liu, Chenlei
    Sun, Zhixin
    MATHEMATICS, 2024, 12 (14)
  • [4] Sustainable Task Offloading in UAV Networks via Multi-Agent Reinforcement Learning
    Sacco, Alessio
    Esposito, Flavio
    Marchetto, Guido
    Montuschi, Paolo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (05) : 5003 - 5015
  • [5] Vehicle Edge Computing Task Offloading Strategy Based on Multi-Agent Deep Reinforcement Learning
    Bo, Jianxiong
    Zhao, Xu
    JOURNAL OF GRID COMPUTING, 2025, 23 (02)
  • [6] QoS-Aware Task Offloading in Fog Environment Using Multi-agent Deep Reinforcement Learning
    Jain, Vibha
    Kumar, Bijendra
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2023, 31 (01)
  • [7] QoS-Aware Task Offloading in Fog Environment Using Multi-agent Deep Reinforcement Learning
    Vibha Jain
    Bijendra Kumar
    Journal of Network and Systems Management, 2023, 31
  • [8] Multi-UAV-enabled AoI-aware WPCN: A Multi-agent Reinforcement Learning Strategy
    Oubbati, Omar Sami
    Atiquzzaman, Mohammed
    Lakas, Abderrahmane
    Baz, Abdullah
    Alhakami, Hosam
    Alhakami, Wajdi
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [9] A Task Offloading and Resource Allocation Strategy Based on Multi-Agent Reinforcement Learning in Mobile Edge Computing
    Jiang, Guiwen
    Huang, Rongxi
    Bao, Zhiming
    Wang, Gaocai
    FUTURE INTERNET, 2024, 16 (09)
  • [10] Optimization of Task Offloading Strategy for Mobile Edge Computing Based on Multi-Agent Deep Reinforcement Learning
    Lu, Haifeng
    Gu, Chunhua
    Luo, Fei
    Ding, Weichao
    Zheng, Shuai
    Shen, Yifan
    IEEE ACCESS, 2020, 8 : 202573 - 202584