Scalable Virtual Machine Migration using Reinforcement Learning

被引:0
作者
Abdul Rahman Hummaida
Norman W. Paton
Rizos Sakellariou
机构
[1] University of Manchester,Department of Computer Science
来源
Journal of Grid Computing | 2022年 / 20卷
关键词
Reinforcement learning; Data centre scalability; Virtual machine migration; Hierarchical architecture; Distributed architecture;
D O I
暂无
中图分类号
学科分类号
摘要
Heuristic approaches require fixed knowledge of how resource allocation should be carried out, and this can be limiting when managing variable cloud workloads. Solutions based on Reinforcement Learning (RL) have been presented to manage cloud infrastructure, however, these tend to be centralized and suffer in their ability to maintain Quality of Service (QoS) for data centres with thousands of nodes. To address this, we propose a reinforcement learning management policy, which can run decentralized, and achieve fast convergence towards efficient resource allocation, resulting in lower SLA violations compared to centralized architectures. To address some of the common challenges in applying RL to cloud resource management, such as slow learning and state/action management, we use parallel learning and reduction of the state/action space. We apply a decision making approach to optimize the migration of a VM and choose a target node to host the VM in such a way that brings response time within SLA level. We have also demonstrate unique, multi-level reinforcement learning cooperation, that further reduces SLA violations. We use simulation to evaluate and demonstrate our proposal in practice, and compare the results obtained with an established heuristic, demonstrating significant improvement to SLA violations and higher scalability.
引用
收藏
相关论文
共 130 条
[1]  
Ahmad RW(2015)A survey on virtual machine migration and server consolidation frameworks for cloud data centers J. Netw. Comput. Appl. 52 11-25
[2]  
Gani A(2013)Applying reinforcement learning towards automating resource allocation and application scalability in the cloud Concurr. Comput. Pract. Exper. 25 1656-1674
[3]  
Hamid SHA(2012)Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers Concurr. Comput. Pract. Exper. 24 1397-1420
[4]  
Shiraz M(2019)Rlpas: Reinforcement learning-based proactive auto-scaler for resource provisioning in cloud environment Mobile Netw. Appl. 24 1348-1363
[5]  
Yousafzai A(2013)Coordinated self-configuration of virtual machines and appliances using a model-free learning approach IEEE Trans. Parallel Distrib. Syst. 24 681-690
[6]  
Xia F(2012)Depas: a decentralized probabilistic algorithm for auto-scaling Computing 94 701-730
[7]  
Barrett E(2015)Implementation and performance analysis of various vm placement strategies in cloudsim J. Cloud Comput. 4 20-210
[8]  
Howley E(2018)An autonomic resource provisioning approach for service-based cloud applications: a hybrid approach Fut. Gener. Comput. Syst. 78 191-3586
[9]  
Duggan J(2020)A novel energy-aware resource management technique using joint vm and container consolidation approach for green computing in cloud data centers Simul. Model. Pract. Theory 104 102127-140
[10]  
Beloglazov A(2021)Cloud resource scheduling with deep reinforcement learning and imitation learning IEEE Internet Things J. 8 3576-16