Edge-Enabled Two-Stage Scheduling Based on Deep Reinforcement Learning for Internet of Everything

被引：73

作者：

Zhou, Xiaokang ^{[1
]}

Liang, Wei ^{[2
]}

Yan, Ke ^{[3
]}

Li, Weimin ^{[4
]}

Wang, Kevin I-Kai ^{[5
]}

Ma, Jianhua ^{[6
]}

Jin, Qun ^{[7
]}

机构：

[1] Shiga Univ, Fac Data Sci, Hikone 5228522, Japan

[2] Hunan Univ Technol & Business, Base Int Sci & Technol Innovat & Cooperat Big Data, Changsha 410205, Peoples R China

[3] Natl Univ Singapore, Coll Design & Engn, Singapore 117566, Singapore

[4] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[5] Univ Auckland, Dept Elect Comp & Software Engn, Auckland 1010, New Zealand

[6] Hosei Univ, Fac Comp & Informat Sci, Tokyo 1028160, Japan

[7] Waseda Univ, Fac Human Sci, Tokorozawa 3591192, Japan

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Internet of Things; Processor scheduling; Reinforcement learning; Scheduling; Job shop scheduling; Optimal scheduling; Deep reinforcement learning; edge computing; Internet of Everything (IoE); makespan; two-stage scheduling; IOT;

D O I：

10.1109/JIOT.2022.3179231

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nowadays, the concept of Internet of Everything (IoE) is becoming a hotly discussed topic, which is playing an increasingly indispensable role in modern intelligent applications. These applications are known for their real-time requirements under limited network and computing resources, thus it becomes a highly demanding task to transform and compute tremendous amount of raw data in a cloud center. The edge-cloud computing infrastructure allows a large amount of data to be processed on nearby edge nodes and then only the extracted and encrypted key features are transmitted to the data center. This offers the potential to achieve an end-edge-cloud-based big data intelligence for IoE in a typical two-stage data processing scheme, while satisfying a data security constraint. In this study, a deep-reinforcement-learning-enhanced two-stage scheduling (DRL-TSS) model is proposed to address the NP-hard problem in terms of operation complexity in end-edge-cloud Internet of Things systems, which is able to allocate computing resources within an edge-enabled infrastructure to ensure computing task to be completed with minimum cost. A presorting scheme based on Johnson's rule is developed and applied to preprocess the two-stage tasks on multiple executors, and a DRL mechanism is developed to minimize the overall makespan based on a newly designed instant reward that takes into account the maximal utilization of each executor in edge-enabled two-stage scheduling. The performance of our method is evaluated and compared with three existing scheduling techniques, and experimental results demonstrate the ability of our proposed algorithm in achieving better learning efficiency and scheduling performance with a 1.1-approximation to the targeted optimal IoE applications.

引用

页码：3295 / 3304

页数：10

共 31 条

[1] Parallel Reinforcement Learning With Minimal Communication Overhead for IoT Environments [J].