Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things

被引:19
作者
Lin, Lixia [1 ]
Zhou, Wen'an [1 ]
Yang, Zhicheng [1 ]
Liu, Jianlong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Xitucheng Rd 10th, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Mobile edge computing; Non-Orthogonal Multiple Access; Delay-sensitive; Industrial internet of things; Prediction-based deep reinforcement learning; NONORTHOGONAL MULTIPLE-ACCESS; ENERGY-CONSUMPTION; EDGE; NETWORKS; MINIMIZATION; SYSTEMS;
D O I
10.1007/s12083-022-01348-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile Edge Computing (MEC) and Non-Orthogonal Multiple Access (NOMA) have been treated as promising technologies to process the delay-sensitive tasks in the Industrial Internet of Things (IIoT) network. The cooperation among multiple MEC servers is essential to improve the processing capacity of MEC systems. However, the dynamic IIoT environment with unknown changing models, including time-varying wireless channels, diversified task requests, and dynamic load on wireless resources and multiple MEC servers, may continuously affect the task offloading decision and NOMA user pairing, which brings great challenges to the resource management in the NOMA-MEC-based IIoT network. In order to solve this problem, we design a distributed deep reinforcement learning (DRL) based solution to improve the task satisfaction ratio by jointly optimizing the task offloading decision and the sub-channel assignment to support the binary computing offloading policy. For each IIoT device agent, to deal with the problem of partial state observability, the Recurrent Neural Network (RNN) is employed to predict the load states of sub-channels and MEC servers, which is further used for the decision of the RL agent. Simulation results show that the proposed prediction-based-DRL (P-DRL) method can achieve higher task satisfaction ratio than exiting schemes.
引用
收藏
页码:170 / 188
页数:19
相关论文
共 48 条
[21]   Deep Reinforcement Learning for Collaborative Edge Computing in Vehicular Networks [J].
Li, Mushu ;
Gao, Jie ;
Zhao, Lian ;
Shen, Xuemin .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2020, 6 (04) :1122-1135
[22]   Toward Computing Resource Reservation Scheduling in Industrial Internet of Things [J].
Liang, Fan ;
Yu, Wei ;
Liu, Xing ;
Griffith, David ;
Golmie, Nada .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (10) :8210-8222
[23]   Energy-Efficient Resource Allocation and Subchannel Assignment for NOMA-Enabled Multiaccess Edge Computing [J].
Liu, Lina ;
Sun, Bo ;
Tan, Xiaoqi ;
Tsang, Danny H. K. .
IEEE SYSTEMS JOURNAL, 2022, 16 (01) :1558-1569
[24]   Selective Offloading in Mobile Edge Computing for the Green Internet of Things [J].
Lyu, Xinchen ;
Tian, Hui ;
Jiang, Li ;
Vinel, Alexey ;
Maharjan, Sabita ;
Gjessing, Stein ;
Zhang, Yan .
IEEE NETWORK, 2018, 32 (01) :54-60
[25]   A Survey of Rate-Optimal Power Domain NOMA With Enabling Technologies of Future Wireless Networks [J].
Maraqa, Omar ;
Rajasekaran, Aditya S. ;
Al-Ahmadi, Saad ;
Yanikomeroglu, Halim ;
Sait, Sadiq M. .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (04) :2192-2235
[26]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[27]   Latency and energy aware rate maximization in MC-NOMA-based multi-access edge computing: A two-stage deep reinforcement learning approach [J].
Nduwayezu, Maurice ;
Yun, Ji-Hoon .
COMPUTER NETWORKS, 2022, 207
[28]   Mobile Edge Computing-Enabled Internet of Vehicles: Toward Energy-Efficient Scheduling [J].
Ning, Zhaolong ;
Huang, Jun ;
Wang, Xiaojie ;
Rodrigues, Joel J. P. C. ;
Guo, Lei .
IEEE NETWORK, 2019, 33 (05) :198-205
[29]   Deep Reinforcement Learning Based Resource Management for Multi-Access Edge Computing in Vehicular Networks [J].
Peng, Haixia ;
Shen, Xuemin .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04) :2416-2428
[30]   Learning Driven NOMA Assisted Vehicular Edge Computing via Underlay Spectrum Sharing [J].
Qian, Liping ;
Wu, Yuan ;
Yu, Ningning ;
Jiang, Fuli ;
Zhou, Haibo ;
Quek, Tony Q. S. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (01) :977-992