Resource Allocation With Workload-Time Windows for Cloud-Based Software Services: A Deep Reinforcement Learning Approach

被引:36
作者
Chen, Xing [1 ,2 ]
Yang, Lijian [1 ,2 ]
Chen, Zheyi [1 ,2 ]
Min, Geyong [3 ]
Zheng, Xianghan [1 ,2 ]
Rong, Chunming [4 ]
机构
[1] Fuzhou Univ, Coll Comp & Sci, Fuzhou 350116, Peoples R China
[2] Fujian Key Lab Network Comp & Intelligent Informa, Fuzhou 350116, Peoples R China
[3] Univ Exeter, Dept Comp Sci, Coll Engn Math & Phys Sci, Exeter EX4 4QF, Devon, England
[4] Univ Stavanger, Dept Elect Engn & Comp Sci, N-4036 Stavanger, Norway
基金
中国国家自然科学基金;
关键词
Cloud-based software services; resource allocation; workload-time windows; deep reinforcement learning; feedback control;
D O I
10.1109/TCC.2022.3169157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the workloads and service requests in cloud computing environments change constantly, cloud-based software services need to adaptively allocate resources for ensuring the Quality-of-Service (QoS) while reducing resource costs. However, it is very challenging to achieve adaptive resource allocation for cloud-based software services with complex and variable system states. Most of the existing methods only consider the current condition of workloads, and thus cannot well adapt to real-world cloud environments subject to fluctuating workloads. To address this challenge, we propose a novel Deep Reinforcement learning based resource Allocation method with workload-time Windows (DRAW) for cloud-based software services that considers both the current and future workloads in the resource allocation process. Specifically, an original Deep Q-Network (DQN) based prediction model of management operations is trained based on workload-time windows, which can be used to predict appropriate management operations under different system states. Next, a new feedback-control mechanism is designed to construct the objective resource allocation plan under the current system state through iterative execution of management operations. Extensive simulation results demonstrate that the prediction accuracy of management operations generated by the proposed DRAW method can reach 90.69%. Moreover, the DRAW can achieve the optimal/near-optimal performance and outperform other classic methods by 3 similar to 13% under different scenarios.
引用
收藏
页码:1871 / 1885
页数:15
相关论文
共 40 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Predictive Autoscaling of Microservices Hosted in Fog Microdata Center [J].
Abdullah, Muhammad ;
Iqbal, Waheed ;
Mahmood, Arif ;
Bukhari, Faisal ;
Erradi, Abdelkarim .
IEEE SYSTEMS JOURNAL, 2021, 15 (01) :1275-1286
[3]   Adaptive Resource Allocation and Provisioning in Multi-Service Cloud Environments [J].
Alsarhan, Ayoub ;
Itradat, Awni ;
Al-Dubai, Ahmed Y. ;
Zomaya, Albert Y. ;
Min, Geyong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (01) :31-42
[4]   A Hierarchical Receding Horizon Algorithm for QoS-Driven Control of Multi-laaS Applications [J].
Ardagna, Danilo ;
Ciavotta, Michele ;
Lancellotti, Riccardo ;
Guerriero, Michele .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2021, 9 (02) :418-434
[5]   A View of Cloud Computing [J].
Armbrust, Michael ;
Fox, Armando ;
Griffith, Rean ;
Joseph, Anthony D. ;
Katz, Randy ;
Konwinski, Andy ;
Lee, Gunho ;
Patterson, David ;
Rabkin, Ariel ;
Stoica, Ion ;
Zaharia, Matei .
COMMUNICATIONS OF THE ACM, 2010, 53 (04) :50-58
[6]   Proactive Container Auto-scaling for Cloud Native Machine Learning Services [J].
Buchaca, David ;
Berral, Josep LLuis ;
Wang, Chen ;
Youssef, Alaa .
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, :475-479
[7]   Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications' QoS [J].
Calheiros, Rodrigo N. ;
Masoumi, Enayat ;
Ranjan, Rajiv ;
Buyya, Rajkumar .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (04) :449-458
[8]   Self-Adaptive and Online QoS Modeling for Cloud-Based Software Services [J].
Chen, Tao ;
Bahsoon, Rami .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (05) :453-475
[9]   Resource Allocation for Cloud-Based Software Services Using Prediction-Enabled Feedback Control With Reinforcement Learning [J].
Chen, Xing ;
Zhu, Fangning ;
Chen, Zheyi ;
Min, Geyong ;
Zheng, Xianghan ;
Rong, Chunming .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) :1117-1129
[10]   Self-adaptive resource allocation for cloud-based software services based on iterative QoS prediction model [J].
Chen, Xing ;
Wang, Haijiang ;
Ma, Yun ;
Zheng, Xianghan ;
Guo, Longkun .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 :287-296