Toward Heterogeneous Environment: Lyapunov-Orientated ImpHetero Reinforcement Learning for Task Offloading

被引：12

作者：

Sun, Feng ^{[1
]}

Zhang, Zhenjiang ^{[1
]}

Chang, Xiaolin ^{[1
]}

Zhu, Kaige ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China

来源：

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2023年 / 20卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Task offloading; Lyapunov optimization; reinforcement learning; federated learning; RESOURCE-ALLOCATION; EDGE;

D O I：

10.1109/TNSM.2023.3266779

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Task offloading combined with reinforcement learning (RL) is a promising research direction in edge computing. However, the intractability in the training of RL and the heterogeneity of network devices have hindered the application of RL in large-scale networks. Moreover, traditional RL algorithms lack mechanisms to share information effectively in a heterogeneous environment, which makes it more difficult for RL algorithms to converge due to the lack of global information. This article focuses on the task offloading problem in a heterogeneous environment. First, we give a formalized representation of the Lyapunov function to normalize both data and virtual energy queue operations. Subsequently, we jointly consider the computing rate and energy consumption in task offloading and then derive the optimization target leveraging Lyapunov optimization. A Deep Deterministic Policy Gradient (DDPG)-based multiple continuous variable decision model is proposed to make the optimal offloading decision in edge computing. Considering the heterogeneous environment, we improve Hetero Federated Learning (HFL) by introducing Kullback-Leibler (KL) divergence to accelerate the convergence of our DDPG based model. Experiments demonstrate that our algorithm accelerates the search for the optimal task offloading decision in heterogeneous environment.

引用

页码：1572 / 1586

页数：15

共 43 条

[1] Adaptive Upgrade of Client Resources for Improving the Quality of Federated Learning Model [J].

AbdulRahman, Sawsan ;

Ould-Slimane, Hakima ;

Chowdhury, Rasel ;

Mourad, Azzam ;

Talhi, Chamseddine ;

Guizani, Mohsen .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (05) :4677-4687

[2] Delay-Aware and Energy-Efficient Computation Offloading in Mobile-Edge Computing Using Deep Reinforcement Learning [J].

Ale, Laha ;

Zhang, Ning ;

Fang, Xiaojie ;

Chen, Xianfu ;

Wu, Shaohua ;

Li, Longzhuang .

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (03) :881-892

[3] ModularFed: Leveraging modularity in federated learning frameworks [J].

Arafeh, Mohamad ;

Otrok, Hadi ;

Ould-Slimane, Hakima ;

Mourad, Azzam ;

Talhi, Chamseddine ;

Damiani, Ernesto .

INTERNET OF THINGS, 2023, 22

[4] Data independent warmup scheme for non-IID federated learning [J].

Arafeh, Mohamad ;

Ould-Slimane, Hakima ;

Otrok, Hadi ;

Mourad, Azzam ;

Talhi, Chamseddine ;

Damiani, Ernesto .

INFORMATION SCIENCES, 2023, 623 :342-360

[5] A Survey on IoT Intrusion Detection: Federated Learning, Game Theory, Social Psychology, and Explainable AI as Future Directions [J].

Arisdakessian, Sarhad ;

Wahab, Omar Abdel ;

Mourad, Azzam ;

Otrok, Hadi ;

Guizani, Mohsen .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (05) :4059-4092

[6] FoGMatch: An Intelligent Multi-Criteria IoT-Fog Scheduling Approach Using Game Theory [J].

Arisdakessian, Sarhad ;

Wahab, Omar Abdel ;

Mourad, Azzam ;

Otrok, Hadi ;

Kara, Nadjia .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (04) :1779-1789

[7] Lyapunov-Guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks [J].

Bi, Suzhi ;

Huang, Liang ;

Wang, Hui ;

Zhang, Ying-Jun Angela .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) :7519-7537

[8] Computation Rate Maximization for Wireless Powered Mobile-Edge Computing With Binary Computation Offloading [J].

Bi, Suzhi ;

Zhang, Ying Jun .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (06) :4177-4190

[9] On the feasibility of Federated Learning towards on-demand client deployment at the edge [J].

Chahoud, Mario ;

Otoum, Safa ;

Mourad, Azzam .

INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)

[10]

Chang Wang, 2018, 2018 IEEE Symposium on Computers and Communications (ISCC), P00366, DOI 10.1109/ISCC.2018.8538612

← 1 2 3 4 5 →