Collective Deep Reinforcement Learning for Intelligence Sharing in the Internet of Intelligence-Empowered Edge Computing

被引:41
作者
Tang, Qinqin [1 ]
Xie, Renchao [2 ,3 ]
Yu, Fei Richard [4 ,5 ]
Chen, Tianjiao [6 ]
Zhang, Ran [2 ,3 ]
Huang, Tao [2 ,3 ]
Liu, Yunjie [2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[3] Purple Mt Labs, Nanjing 211111, Peoples R China
[4] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Guangdong, Peoples R China
[5] Carleton Univ, Sch Informat Technol, Ottawa, ON K1S 5B6, Canada
[6] China Mobile Res Inst, Beijing 100053, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Distributed intelligence sharing; Internet of intelligence; edge computing; collective deep reinforcement learning; NETWORKS; MODEL; COMMUNICATION; MANAGEMENT; REPUTATION; FRAMEWORK; DESIGN; SYSTEM; SECURE; 5G;
D O I
10.1109/TMC.2022.3199812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge intelligence is emerging as a new interdiscipline to push learning intelligence from remote centers to the edge of the network. However, with its widespread deployment, new challenges arise in terms of training efficiency and service of quality (QoS). Massive repetitive model training is ubiquitous due to the inevitable needs of users for the same types of data and training results. Additionally, a smaller volume of data samples will cause the over-fitting of models. To address these issues, driven by the Internet of intelligence, this article proposes a distributed edge intelligence sharing scheme, which allows distributed edge nodes to quickly and economically improve learning performance by sharing their learned intelligence. Considering the time-varying edge network states including data collection states, computing and communication states, and node reputation states, the distributed intelligence sharing is formulated as a multi-agent Markov decision process (MDP). Then, a novel collective deep reinforcement learning (CDRL) algorithm is designed to obtain the optimal intelligence sharing policy, which consists of local soft actor-critic (SAC) learning at each edge node and collective learning between different edge nodes. Simulation results indicate our proposal outperforms the benchmark schemes in terms of learning efficiency and intelligence sharing efficiency.
引用
收藏
页码:6327 / 6342
页数:16
相关论文
共 50 条
[1]  
Achiam J, 2017, Arxiv, DOI arXiv:1703.01732
[2]   Edge Intelligence (EI)-Enabled HTTP Anomaly Detection Framework for the Internet of Things (IoT) [J].
An, Yufei ;
Yu, F. Richard ;
Li, Jianqiang ;
Chen, Jianyong ;
Leung, Victor C. M. .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3554-3566
[3]   AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning [J].
Deng, Yongheng ;
Lyu, Feng ;
Ren, Ju ;
Wu, Huaqing ;
Zhou, Yuezhi ;
Zhang, Yaoxue ;
Shen, Xuemin .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) :1996-2009
[4]   Vehicular Blockchain-Based Collective Learning for Connected and Autonomous Vehicles [J].
Fu, Yuchuan ;
Yu, Fei Richard ;
Li, Changle ;
Luan, Tom H. ;
Zhang, Yao .
IEEE WIRELESS COMMUNICATIONS, 2020, 27 (02) :197-203
[5]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[6]   Edge-Aided Computing and Transmission Scheduling for LTE-U-Enabled IoT [J].
He, Hongli ;
Shan, Hangguan ;
Huang, Aiping ;
Ye, Qiang ;
Zhuang, Weihua .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (12) :7881-7896
[7]   Software Defined Networking for Energy Harvesting Internet of Things [J].
Huang, Xumin ;
Yu, Rong ;
Kang, Jiawen ;
Xia, Zhuoquan ;
Zhang, Yan .
IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (03) :1389-1399
[8]   Incentive Mechanism for Reliable Federated Learning: A Joint Optimization Approach to Combining Reputation and Contract Theory [J].
Kang, Jiawen ;
Xiong, Zehui ;
Niyato, Dusit ;
Xie, Shengli ;
Zhang, Junshan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) :10700-10714
[9]   Toward Secure Blockchain-Enabled Internet of Vehicles: Optimizing Consensus Management Using Reputation and Contract Theory [J].
Kang, Jiawen ;
Xiong, Zehui ;
Niyato, Dusit ;
Ye, Dongdong ;
Kim, Dong In ;
Zhao, Jun .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (03) :2906-2920
[10]   A Hybrid Computing Solution and Resource Scheduling Strategy for Edge Computing in Smart Manufacturing [J].
Li, Xiaomin ;
Wan, Jiafu ;
Dai, Hong-Ning ;
Imran, Muhammad ;
Xia, Min ;
Celesti, Antonio .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (07) :4225-4234