Deep Reinforcement Learning Assisted Federated Learning Algorithm for Data Management of IIoT

被引:158
作者
Zhang, Peiying [1 ]
Wang, Chao [1 ]
Jiang, Chunxiao [2 ,3 ]
Han, Zhu [4 ,5 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Tsinghua Univ, Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
[3] Tsinghua Space Ctr, Beijing 100084, Peoples R China
[4] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77440 USA
[5] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea
关键词
Informatics; Data training; deep reinforcement learning (DRL); federated learning (FL); industrial Internet of Things (IIoT); IIoT equipment; INDUSTRIAL INTERNET; IOT;
D O I
10.1109/TII.2021.3064351
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous expanded scale of the industrial Internet of Things (IIoT) leads to IIoT equipments generating massive amounts of user data every moment. According to the different requirement of end users, these data usually have high heterogeneity and privacy, while most of users are reluctant to expose them to the public view. How to manage these time series data in an efficient and safe way in the field of IIoT is still an open issue, such that it has attracted extensive attention from academia and industry. As a new machine learning paradigm, federated learning (FL) has great advantages in training heterogeneous and private data. This article studies the FL technology applications to manage IIoT equipment data in wireless network environments. In order to increase the model aggregation rate and reduce communication costs, we apply deep reinforcement learning (DRL) to IIoT equipment selection process, specifically to select those IIoT equipment nodes with accurate models. Therefore, we propose a FL algorithm assisted by DRL, which can take into account the privacy and efficiency of data training of IIoT equipment. By analyzing the data characteristics of IIoT equipments, we use MNIST, fashion MNIST, and CIFAR-10 datasets to represent the data generated by IIoT. During the experiment, we employ the deep neural network model to train the data, and experimental results show that the accuracy can reach more than 97%, which corroborates the effectiveness of the proposed algorithm.
引用
收藏
页码:8475 / 8484
页数:10
相关论文
共 35 条
[21]   Robust and Communication-Efficient Federated Learning From Non-i.i.d. Data [J].
Sattler, Felix ;
Wiedemann, Simon ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) :3400-3413
[22]   Deep-Reinforcement-Learning-Based Spectrum Resource Management for Industrial Internet of Things [J].
Shi, Zhaoyuan ;
Xie, Xianzhong ;
Lu, Huabing ;
Yang, Helin ;
Kadoch, Michel ;
Cheriet, Mohamed .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3476-3489
[23]   Efficient Training Management for Mobile Crowd-Machine Learning: A Deep Reinforcement Learning Approach [J].
Tran The Anh ;
Nguyen Cong Luong ;
Niyato, Dusit ;
Kim, Dong In ;
Wang, Li-Chun .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (05) :1345-1348
[24]   Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks [J].
Wang, Jingjing ;
Jiang, Chunxiao ;
Zhang, Haijun ;
Ren, Yong ;
Chen, Kwang-Cheng ;
Hanzo, Lajos .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (03) :1472-1514
[25]   Distributed Q-Learning Aided Heterogeneous Network Association for Energy-Efficient IIoT [J].
Wang, Jingjing ;
Jiang, Chunxiao ;
Zhang, Kai ;
Hou, Xiangwang ;
Ren, Yong ;
Qian, Yi .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) :2756-2764
[26]   MTES: An Intelligent Trust Evaluation Scheme in Sensor-Cloud-Enabled Industrial Internet of Things [J].
Wang, Tian ;
Luo, Hao ;
Jia, Weijia ;
Liu, Anfeng ;
Xie, Mande .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) :2054-2062
[27]   In-Edge AI: Intelligentizing Mobile Edge Computing, Caching and Communication by Federated Learning [J].
Wang, Xiaofei ;
Han, Yiwen ;
Wang, Chenyang ;
Zhao, Qiyang ;
Chen, Xu ;
Chen, Min .
IEEE NETWORK, 2019, 33 (05) :156-165
[28]   Personalized Federated Learning for Intelligent IoT Applications: A Cloud-Edge Based Framework [J].
Wu, Qiong ;
He, Kaiwen ;
Chen, Xu .
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2020, 1 (01) :35-44
[29]   Scheduling Policies for Federated Learning in Wireless Networks [J].
Yang, Howard H. ;
Liu, Zuozhu ;
Quek, Tony Q. S. ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (01) :317-333
[30]   RMAF: Relu-Memristor-Like Activation Function for Deep Learning [J].
Yu, Yongbin ;
Adu, Kwabena ;
Tashi, Nyima ;
Anokye, Patrick ;
Wang, Xiangxiang ;
Ayidzoe, Mighty Abra .
IEEE ACCESS, 2020, 8 :72727-72741