Cost-aware job scheduling for cloud inutances using deep reinforcement learning

被引:50
作者
Cheng, Feng [1 ]
Huang, Yifeng [2 ]
Tanpure, Bhavana [3 ]
Sawalani, Pawan [3 ]
Cheng, Long [2 ,4 ]
Liu, Cong [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Math, Chengdu, Peoples R China
[2] North China Elect Power Univ Beijing, Sch Control & Comp Engn, Beijing, Peoples R China
[3] Dublin City Univ, Sch Comp, Dublin, Ireland
[4] Dublin City Univ, Insight SFI Res Ctr Data Analyt, Dublin, Ireland
[5] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2022年 / 25卷 / 01期
基金
美国国家科学基金会;
关键词
Cloud computing; Deep reinforcement learning; Deep Q-learning; QoS; Job scheduling; Cost optimization;
D O I
10.1007/s10586-021-03436-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the services provided by cloud vendors are providing better performance, achieving auto-scaling, load-balancing, and optimized performance along with low infrastructure maintenance, more and more companies migrate their services to the cloud. Since the cloud workload is dynamic and complex, scheduling the jobs submitted by users in an effective way is proving to be a challenging task. Although a lot of advanced job scheduling approaches have been proposed in the past years, almost all of them are designed to handle batch jobs rather than real-time workloads, such as that user requests are submitted at any time with any amount of numbers. In this work, we have proposed a Deep Reinforcement Learning (DRL) based job scheduler that dispatches the jobs in real time to tackle this problem. Specifically, we focus on scheduling user requests in such a way as to provide the quality of service (QoS) to the end-user along with a significant reduction of the cost spent on the execution of jobs on the virtual instances. We have implemented our method by Deep Q-learning Network (DQN) model, and our experimental results demonstrate that our approach can significantly outperform the commonly used real-time scheduling algorithms.
引用
收藏
页码:619 / 631
页数:13
相关论文
共 45 条
  • [1] DVFS-Aware Consolidation for Energy-Efficient Clouds
    Arroba, Patricia
    Moya, Jose M.
    Ayala, Jose L.
    Buyya, Rajkumar
    [J]. 2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, : 494 - 495
  • [2] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [3] Banicescu I., 2012, SCALABLE COMPUTING T, P437
  • [4] An Autonomic Approach for the Selection of Robust Dynamic Loop Scheduling Techniques
    Boulmier, Anthony
    Banicescu, Ioana
    Ciorba, Florina M.
    Abdennadher, Nabil
    [J]. 2017 16TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC-2017), 2017, : 9 - 17
  • [5] A WOA-Based Optimization Approach for Task Scheduling in Cloud Computing Systems
    Chen, Xuan
    Cheng, Long
    Liu, Cong
    Liu, Qingzhi
    Liu, Jinwei
    Mao, Ying
    Murphy, John
    [J]. IEEE SYSTEMS JOURNAL, 2020, 14 (03): : 3117 - 3128
  • [6] Scalable Discovery of Hybrid Process Models in a Cloud Computing Environment
    Cheng, Long
    van Dongen, Boudewijn F.
    van der Aalst, Wil M. P.
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (02) : 368 - 380
  • [7] Cheng MX, 2018, ASIA S PACIF DES AUT, P129, DOI 10.1109/ASPDAC.2018.8297294
  • [8] Improving Robustness of Heterogeneous Serverless Computing Systems Via Probabilistic Task Pruning
    Denninnart, Chavit
    Gentry, James
    Salehi, Mohsen Amini
    [J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 6 - 15
  • [9] Nitrogen removal from groundwater using scoria: Kinetics, equilibria and microstructure
    Dong, Tianzi
    [J]. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH PART A-TOXIC/HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING, 2021, 56 (04): : 386 - 393
  • [10] Optimizing Cloud Data Center Energy Efficiency via Dynamic Prediction of CPU Idle Intervals
    Duan, Lide
    Zhan, Dongyuan
    Hohnerlein, Justin
    [J]. 2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 985 - 988