A Double-Timescale Reinforcement Learning Based Cloud-Edge Collaborative Framework for Decomposable Intelligent Services in Industrial Internet of Things

被引:0
作者
Zhang Qiuyang [1 ,2 ]
Wang Ying [1 ,2 ]
Wang Xue [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
computing service; edge intelligence; industrial internet of things (IIoT); reinforcement learning (RL); RESOURCE-ALLOCATION; NETWORK;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the proportion of intelligent services in the industrial internet of things (IIoT) rising rapidly, its data dependency and decomposability increase the difficulty of scheduling computing resources. In this paper, we propose an intelligent service computing framework. In the framework, we take the long-term rewards of its important participants, edge service providers, as the optimization goal, which is related to service delay and computing cost. Considering the different update frequencies of data deployment and service offloading, double-timescale reinforcement learning is utilized in the framework. In the small-scale strategy, the frequent concurrency of services and the difference in service time lead to the fuzzy relationship between reward and action. To solve the fuzzy reward problem, a reward mapping-based reinforcement learning (RMRL) algorithm is proposed, which enables the agent to learn the relationship between reward and action more clearly. The large time scale strategy adopts the improved Monte Carlo tree search (MCTS) algorithm to improve the learning speed. The simulation results show that the strategy is superior to popular reinforcement learning algorithms such as double Q-learning (DDQN) and dueling Q-learning (dueling-DQN) in learning speed, and the reward is also increased by 14%.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 32 条
[1]   Fog Computing for 5G Tactile Industrial Internet of Things: QoE-Aware Resource Allocation Model [J].
Aazam, Mohammad ;
Harras, Khaled A. ;
Zeadally, Sherali .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (05) :3085-3092
[2]   Deploying Fog Computing in Industrial Internet of Things and Industry 4.0 [J].
Aazam, Mohammad ;
Zeadally, Sherali ;
Harras, Khaled A. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (10) :4674-4682
[3]  
Baier H., 2012, 2012 IEEE Conference on Computational Intelligence and Games (CIG 2012), P227, DOI 10.1109/CIG.2012.6374160
[4]   Deep Reinforcement Learning-Based Dynamic Resource Management for Mobile Edge Computing in Industrial Internet of Things [J].
Chen, Ying ;
Liu, Zhiyong ;
Zhang, Yongchao ;
Wu, Yuan ;
Chen, Xin ;
Zhao, Lian .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) :4925-4934
[5]  
Crankshaw Daniel, 2020, SoCC '20: Proceedings of the 11th ACM Symposium on Cloud Computing, P477, DOI 10.1145/3419111.3421285
[6]   MEC enabled cooperative sensing and resource allocation for industrial IoT systems [J].
Dai, Yanpeng ;
Zhao, Lihong ;
Lyu, Ling .
CHINA COMMUNICATIONS, 2022, 19 (07) :214-225
[7]   Energy-aware task assignment for mobile cyber-enabled applications in heterogeneous cloud computing [J].
Gai, Keke ;
Qiu, Meikang ;
Zhao, Hui .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 111 :126-135
[8]   Resource Allocation for Latency-Aware Federated Learning in Industrial Internet of Things [J].
Gao, Weifeng ;
Zhao, Zhiwei ;
Min, Geyong ;
Ni, Qiang ;
Jiang, Yuhong .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) :8505-8513
[9]   Edge-Cloud Computing for Internet of Things Data Analytics: Embedding Intelligence in the Edge With Deep Learning [J].
Ghosh, Ananda Mohon ;
Grolinger, Katarina .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (03) :2191-2200
[10]   Incentive-Driven Task Allocation for Collaborative Edge Computing in Industrial Internet of Things [J].
Hou, Wenjing ;
Wen, Hong ;
Zhang, Ning ;
Wu, Jinsong ;
Lei, Wenxin ;
Zhao, Runhui .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (01) :706-718