Robustness challenges in Reinforcement Learning based time-critical cloud resource scheduling: A Meta-Learning based solution

被引：8

作者：

Liu, Hongyun ^{[1
,2
]}

Chen, Peng ^{[3
]}

Ouyang, Xue ^{[4
]}

Gao, Hui ^{[5
]}

Yan, Bing ^{[6
]}

Grosso, Paola ^{[1
]}

Zhao, Zhiming ^{[1
]}

机构：

[1] Univ Amsterdam, Informat Inst, NL-1098 XH Amsterdam, Netherlands

[2] Univ Amsterdam, Grad Sch Informat, NL-1098 XH Amsterdam, Netherlands

[3] Xihua Univ, Sch Comp & Software Engn, Chengdu 610039, Peoples R China

[4] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Peoples R China

[5] Shaanxi Univ Sci & Technol, Coll Elect & Control Engn, Xian 710021, Peoples R China

[6] Univ Adelaide, Sch Elect & Elect Engn, Adelaide, SA 5005, Australia

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 146卷

基金：

中国国家自然科学基金;

关键词：

Robustness; Reinforcement Learning; Meta Learning; Resource management; Task scheduling; Cloud computing; MANAGEMENT;

D O I：

10.1016/j.future.2023.03.029

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Cloud computing attracts increasing attention in processing dynamic computing tasks and automating the software development and operation pipeline. In many cases, the computing tasks have strict deadlines. The cloud resource manager (e.g., orchestrator) effectively manages the resources and provides tasks Quality of Service (QoS). Cloud task scheduling is tricky due to the dynamic nature of task workload and resource availability. Reinforcement Learning (RL) has attracted lots of research attention in scheduling. However, those RL-based approaches suffer from low scheduling performance robustness when the task workload and resource availability change, particularly when handling timecritical tasks. This paper focuses on both challenges of robustness and deadline guarantee among such RL, specifically Deep RL (DRL)-based scheduling approaches. We quantify the robustness measurements as the retraining time and investigate how to improve both robustness and deadline guarantee of DRL-based scheduling. We propose MLR-TC-DRLS, a practical, robust Meta Deep Reinforcement Learning-based scheduling solution to provide time-critical tasks deadline guarantee and fast adaptation under highly dynamic situations. We comprehensively evaluate MLR-TC-DRLS performance against RL-based and RL advanced variants-based scheduling approaches using real-world and synthetic data. The evaluations validate that our proposed approach improves the scheduling performance robustness of typical DRL variants scheduling approaches with 97%-98.5% deadline guarantees and 200%-500% faster adaptation.

引用

页码：18 / 33

页数：16

共 50 条

[41] PBRL-TChain: A performance-enhanced permissioned blockchain for time-critical applications based on reinforcement learning [J].

Zhang, Yiguang ;

Lin, Junxiong ;

Lu, Zhihui ;

Duan, Qiang ;

Huang, Shih-Chia .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 :301-313

[42] A Reinforcement Learning-Based Resource Allocation Scheme for Cloud Robotics [J].

Liu, Hang ;

Liu, Shiwen ;

Zheng, Kan .

IEEE ACCESS, 2018, 6 :17215-17222

[43] Joint scheduling and resource allocation based on reinforcement learning in integrated access and backhaul networks [J].

Kim, Joeun ;

Jeon, Youngil ;

Lee, Junhwan ;

Lee, Moon-Sik ;

Kwon, Taesoo .

ICT EXPRESS, 2025, 11 (03) :536-541

[44] A Meta-learning Method Based on Temporal Difference Error [J].

Kobayashi, Kunikazu ;

Mizoue, Hiroyuki ;

Kuremoto, Takashi ;

Obayashi, Masanao .

NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 :530-537

[45] Resource Allocation Reinforcement Learning for Quality of Service Maintenance in Cloud-Based Services [J].

Hong, Dupyo ;

Kim, DongWan ;

Min, Oh Jung ;

Shin, Yongtae .

2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, :517-521

[46] Scheduling Pattern of Time Triggered Ethernet Based on Reinforcement Learning [J].

He Feng ;

Xiong Li ;

Zhou Xuan ;

Li Haoruo ;

Xiong Huagang .

CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (05) :1022-1035

[47] Real-Time Microgrid Energy Scheduling Using Meta-Reinforcement Learning [J].

Shen, Huan ;

Shen, Xingfa ;

Chen, Yiming .

ENERGIES, 2024, 17 (10)

[48] Hierarchical Reinforcement Learning Based Resource Allocation for RAN Slicing [J].

Anil Akyildiz, Hasan ;

Faruk Gemici, Omer ;

Hokelek, Ibrahim ;

Ali Cirpan, Hakan .

IEEE ACCESS, 2024, 12 :75818-75831

[49] Cross-domain Resemblance Detection based on Meta-learning for Cloud Storage [J].

Li, Baisong ;

Tian, Wenlong ;

Li, Ruixuan ;

Xiao, Weijun ;

Fu, Zhongming ;

Ye, Xuming ;

Duan, Renjiao ;

Li, Yusheng ;

Xu, Zhiyong .

2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,

[50] Online scheduling of dependent tasks of cloud’s workflows to enhance resource utilization and reduce the makespan using multiple reinforcement learning-based agents [J].

Ali Asghari ;

Mohammad Karim Sohrabi ;

Farzin Yaghmaee .

Soft Computing, 2020, 24 :16177-16199

← 1 2 3 4 5 →