Federated Ensemble Model-Based Reinforcement Learning in Edge Computing

被引：13

作者：

Wang, Jin ^{[1
]}

Hu, Jia ^{[1
]}

Mills, Jed ^{[1
]}

Min, Geyong ^{[1
]}

Xia, Ming ^{[2
]}

Georgalas, Nektarios ^{[3
]}

机构：

[1] Univ Exeter, Dept Comp Sci, Exeter EX4 4PY, England

[2] Google, Mountain View, CA 94043 USA

[3] British Telecommun PLC, Appl Res Dept, London EC1A 7AJ, England

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2023年 / 34卷 / 06期

基金：

欧盟地平线“2020”; 英国工程与自然科学研究理事会;

关键词：

Computational modeling; Data models; Heuristic algorithms; Training; Edge computing; Reinforcement learning; Analytical models; Deep reinforcement learning; distributed machine learning; edge computing; federated learning;

D O I：

10.1109/TPDS.2023.3264480

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Federated learning (FL) is a privacy-preserving distributed machine learning paradigm that enables collaborative training among geographically distributed and heterogeneous devices without gathering their data. Extending FL beyond the supervised learning models, federated reinforcement learning (FRL) was proposed to handle sequential decision-making problems in edge computing systems. However, the existing FRL algorithms directly combine model-free RL with FL, thus often leading to high sample complexity and lacking theoretical guarantees. To address the challenges, we propose a novel FRL algorithm that effectively incorporates model-based RL and ensemble knowledge distillation into FL for the first time. Specifically, we utilise FL and knowledge distillation to create an ensemble of dynamics models for clients, and then train the policy by solely using the ensemble model without interacting with the environment. Furthermore, we theoretically prove that the monotonic improvement of the proposed algorithm is guaranteed. The extensive experimental results demonstrate that our algorithm obtains much higher sample efficiency compared to classic model-free FRL algorithms in the challenging continuous control benchmark environments under edge computing settings. The results also highlight the significant impact of heterogeneous client data and local model update steps on the performance of FRL, validating the insights obtained from our theoretical analysis.

引用

页码：1848 / 1859

页数：12

共 50 条

[41] Mobility-Aware Cooperative Caching in Vehicular Edge Computing Based on Asynchronous Federated and Deep Reinforcement Learning
Wu, Qiong
Zhao, Yu
Fan, Qiang
Fan, Pingyi
Wang, Jiangzhou
Zhang, Cui
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2023, 17 (01) : 66 - 81
[42] Toward Communication-Efficient Federated Learning in the Internet of Things With Edge Computing
Sun, Haifeng
Li, Shiqi
Yu, F. Richard
Qi, Qi
Wang, Jingyu
Liao, Jianxin
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (11) : 11053 - 11067
[43] FEARLESS: A Federated Reinforcement Learning Orchestrator for Serverless Edge Swarms
Sad, Christos
Masouros, Dimosthenis
Siozios, Kostas
IEEE EMBEDDED SYSTEMS LETTERS, 2025, 17 (01) : 34 - 37
[44] Differentially Private Asynchronous Federated Learning for Mobile Edge Computing in Urban Informatics
Lu, Yunlong
Huang, Xiaohong
Dai, Yueyue
Maharjan, Sabita
Zhang, Yan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) : 2134 - 2143
[45] Privacy-Preserving Asynchronous Federated Learning Mechanism for Edge Network Computing
Lu, Xiaofeng
Liao, Yuying
Lio, Pietro
Hui, Pan
IEEE ACCESS, 2020, 8 : 48970 - 48981
[46] Resource-Aware Personalized Federated Learning Based on Reinforcement Learning
Wu, Tingting
Li, Xiao
Gao, Pengpei
Yu, Wei
Xin, Lun
Guo, Manxue
IEEE COMMUNICATIONS LETTERS, 2025, 29 (01) : 175 - 179
[47] PrVFL: Pruning-Aware Verifiable Federated Learning for Heterogeneous Edge Computing
Wang, Xigui
Yu, Haiyang
Chen, Yuwen
Sinnott, Richard O.
Yang, Zhen
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 15062 - 15079
[48] FedPKR: Federated Learning With Non-IID Data via Periodic Knowledge Review in Edge Computing
Wang, Jinbo
Wang, Ruijin
Xu, Guangquan
He, Donglin
Pei, Xikai
Zhang, Fengli
Gan, Jie
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 9 (06): : 902 - 912
[49] Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach
Chen, Yuxuan
Li, Rongpeng
Yu, Xiaoxue
Zhao, Zhifeng
Zhang, Honggang
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, 26 (02) : 278 - 292
[50] Hybrid Blockchain-Based Resource Trading System for Federated Learning in Edge Computing
Fan, Sizheng
Zhang, Hongbo
Zeng, Yuchen
Cai, Wei
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 2252 - 2264

← 1 2 3 4 5 →