A Novel Resource Management Framework for Blockchain-Based Federated Learning in IoT Networks

被引：2

作者：

Mishra, Aman ^{[1
]}

Garg, Yash ^{[1
]}

Pandey, Om Jee ^{[1
]}

Shukla, Mahendra K. ^{[2
]}

Vasilakos, Athanasios V. ^{[3
]}

Hegde, Rajesh M. ^{[4
]}

机构：

[1] IIT BHU, Dept Elect Engn, Varanasi 221005, Uttar Pradesh, India

[2] ABV Indian Inst Informat Technol & Management ABV, Dept Informat Technol, Gwalior 474015, India

[3] Univ Agder, Ctr AI Res, N-4879 Grimstad, Norway

[4] Indian Inst Technol Dharwad, Dept Elect Engn, Dharwad 580011, Karnataka, India

来源：

IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING | 2024年 / 9卷 / 04期

关键词：

Internet of Things (IoT); actor-critic reinforcement learning; federated learning; blockchain; resource managemnet; queuing theory; exploration-exploitation; INTELLIGENCE; INTERNET;

D O I：

10.1109/TSUSC.2024.3358915

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

At present, the centralized learning models, used for IoT applications generating large amount of data, face several challenges such as bandwidth scarcity, more energy consumption, increased uses of computing resources, poor connectivity, high computational complexity, reduced privacy, and large latency towards data transfer. In order to address the aforementioned challenges, Blockchain-Enabled Federated Learning Networks (BFLNs) emerged recently, which deal with trained model parameters only, rather than raw data. BFLNs provide enhanced security along with improved energy-efficiency and Quality-of-Service (QoS). However, BFLNs suffer with the challenges of exponential increased action space in deciding various parameter levels towards training and block generation. Motivated by aforementioned challenges of BFLNs, in this work, we are proposing an actor-critic Reinforcement Learning (RL) method to model the Machine Learning Model Owner (MLMO) in selecting the optimal set of parameter levels, addressing the challenges of exponential grow of action space in BFLNs. Further, due to the implicit entropy exploration, actor-critic RL method balances the exploration-exploitation trade-off and shows better performance than most off-policy methods, on large discrete action spaces. Therefore, in this work, considering the mobile scenario of the devices, MLMO decides the data and energy levels that the mobile devices use for the training and determine the block generation rate. This leads to minimized system latency and reduced overall cost, while achieving the target accuracy. Specifically, we have used Proximal Policy Optimization (PPO) as an on-policy actor-critic method with it's two variants, one based on Monte Carlo (MC) returns and another based on Generalized Advantage Estimate (GAE). We analyzed that PPO has better exploration and sample efficiency, lesser training time, and consistently higher cumulative rewards, when compared to off-policy Deep Q-Network (DQN).

引用

页码：648 / 660

页数：13

共 39 条

[1] Optimizing the Energy Consumption of Blockchain-Based Systems Using Evolutionary Algorithms: A New Problem Formulation [J].

Alofi, Akram ;

Bokhari, Mahmoud A. ;

Bahsoon, Rami ;

Hendley, Robert .

IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2022, 7 (04) :910-922

[2] Drones' Edge Intelligence Over Smart Environments in B5G: Blockchain and Federated Learning Synergy [J].

Alsamhi, Saeed Hamood ;

Almalki, Faris A. ;

Afghah, Fatemeh ;

Hawbani, Ammar ;

Shvetsov, Alexey, V ;

Lee, Brian ;

Song, Houbing .

IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (01) :295-312

[3] FLoadNet: Load Balancing in Fog Networks With Cooperative Multiagent Using Actor-Critic Method [J].

Baek, Jungyeon ;

Kaddoum, Georges .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (01) :400-414

[4] Blockchain for Internet of Things: A Survey [J].

Dai, Hong-Ning ;

Zheng, Zibin ;

Zhang, Yan .

IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8076-8094

[5] Hybrid Blockchain-Based Resource Trading System for Federated Learning in Edge Computing [J].

Fan, Sizheng ;

Zhang, Hongbo ;

Zeng, Yuchen ;

Cai, Wei .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) :2252-2264

[6] Adaptive Resource Allocation in Future Wireless Networks With Blockchain and Mobile Edge Computing [J].

Guo, Fengxian ;

Yu, F. Richard ;

Zhang, Heli ;

Ji, Hong ;

Liu, Mengting ;

Leung, Victor C. M. .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (03) :1689-1703

[7] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[8] Blockchain for Secure and Efficient Data Sharing in Vehicular Edge Computing and Networks [J].

Kang, Jiawen ;

Yu, Rong ;

Huang, Xumin ;

Wu, Maoqiang ;

Maharjan, Sabita ;

Xie, Shengli ;

Zhang, Yan .

IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03) :4660-4670

[9]

Kingma Diederik P, 2014, ARXIV PREPRINT ARXIV

[10] Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation [J].

Li, Jun ;

Shao, Yumeng ;

Wei, Kang ;

Ding, Ming ;

Ma, Chuan ;

Shi, Long ;

Han, Zhu ;

Poor, H. Vincent .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (10) :2401-2415

← 1 2 3 4 →