HUNTER: AI based holistic resource management for sustainable cloud computing

被引:46
作者
Tuli, Shreshth [1 ]
Gill, Sukhpal Singh [2 ]
Xu, Minxian [3 ]
Garraghan, Peter [4 ]
Bahsoon, Rami [5 ]
Dustdar, Schahram [6 ]
Sakellariou, Rizos [7 ]
Rana, Omer [8 ]
Buyya, Rajkumar [9 ]
Casale, Giuliano [1 ]
Jennings, Nicholas R. [1 ,10 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing, Peoples R China
[4] Univ Lancaster, Sch Comp & Commun, Lancaster, England
[5] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England
[6] Vienna Univ Technol, Distributed Syst Grp, Vienna, Austria
[7] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
[8] Cardiff Univ, Sch Comp Sci & Informat, Cardiff, Wales
[9] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst Clouds Lab, Melbourne, FL USA
[10] Loughborough Univ, Loughborough, Leics, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Holistic resource management; Energy-efficiency; Cloud computing; Artificial intelligence; Thermal management; DATA CENTERS; WORKLOAD MANAGEMENT; ENERGY; NETWORK; CONSOLIDATION; SIMULATION; ALGORITHM;
D O I
10.1016/j.jss.2021.111124
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The worldwide adoption of cloud data centers (CDCs) has given rise to the ubiquitous demand for hosting application services on the cloud. Further, contemporary data-intensive industries have seen a sharp upsurge in the resource requirements of modern applications. This has led to the provisioning of an increased number of cloud servers, giving rise to higher energy consumption and, consequently, sustainability concerns. Traditional heuristics and reinforcement learning based algorithms for energy-efficient cloud resource management address the scalability and adaptability related challenges to a limited extent. Existing work often fails to capture dependencies across thermal characteristics of hosts, resource consumption of tasks and the corresponding scheduling decisions. This leads to poor scalability and an increase in the compute resource requirements, particularly in environments with non-stationary resource demands. To address these limitations, we propose an artificial intelligence (AI) based holistic resource management technique for sustainable cloud computing called HUNTER. The proposed model formulates the goal of optimizing energy efficiency in data centers as a multi-objective scheduling problem, considering three important models: energy, thermal and cooling. HUNTER utilizes a Gated Graph Convolution Network as a surrogate model for approximating the Quality of Service (QoS) for a system state and generating optimal scheduling decisions. Experiments on simulated and physical cloud environments using the CloudSim toolkit and the COSCO framework show that HUNTER outperforms state-of-the-art baselines in terms of energy consumption, SLA violation, scheduling time, cost and temperature by up to 12, 35, 43, 54 and 3 percent respectively. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:15
相关论文
共 37 条
[1]   Thermal-Aware Virtual Machine Allocation for Heterogeneous Cloud Data Centers [J].
Akbari, Abbas ;
Khonsari, Ahmad ;
Ghoreyshi, Seyed Mohammad .
ENERGIES, 2020, 13 (11)
[2]   IoTSim-SDWAN: A simulation framework for interconnecting distributed datacenters over Software-Defined Wide Area Network (SD-WAN) [J].
Alwasel, Khaled ;
Jha, Devki Nandan ;
Hernandez, Eduardo ;
Puthal, Deepak ;
Barika, Mutaz ;
Varghese, Blesson ;
Garg, Saurabh Kumar ;
James, Philip ;
Zomaya, Albert ;
Morgan, Graham ;
Ranjan, Rajiv .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 143 :17-35
[3]   Novel energy and SLA efficient resource management heuristics for consolidation of virtual machines in cloud data centers [J].
Arianyan, Ehsan ;
Taheri, Hassan ;
Sharifian, Saeed .
COMPUTERS & ELECTRICAL ENGINEERING, 2015, 47 :222-240
[4]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[5]   FECBench: A Holistic Interference-aware Approach for Application Performance Modeling [J].
Barve, Yogesh D. ;
Shekhar, Shashank ;
Chhokra, Ajay ;
Khare, Shweta ;
Bhattacharjee, Anirban ;
Kang, Zhuangwei ;
Sun, Hongyang ;
Gokhale, Aniruddha .
2019 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2019, :211-221
[6]   Learn-as-you-go with Megh: Efficient Live Migration of Virtual Machines [J].
Basu, Debabrota ;
Wang, Xiayang ;
Hong, Yang ;
Chen, Haibo ;
Bressan, Stephane .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (08) :1786-1801
[7]   Energy-Efficient Cloud Computing [J].
Berl, Andreas ;
Gelenbe, Erol ;
Di Girolamo, Marco ;
Giuliani, Giovanni ;
De Meer, Hermann ;
Dang, Minh Quan ;
Pentikousis, Kostas .
COMPUTER JOURNAL, 2010, 53 (07) :1045-1051
[8]  
Butt AA, 2019, INT WIREL COMMUN, P1588
[9]   CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms [J].
Calheiros, Rodrigo N. ;
Ranjan, Rajiv ;
Beloglazov, Anton ;
De Rose, Cesar A. F. ;
Buyya, Rajkumar .
SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) :23-50
[10]   Thermal-Aware Scheduling in Green Data [J].
Chaudhry, Muhammad Tayyab ;
Ling, Teck Chaw ;
Manzoor, Atif ;
Hussain, Syed Asad ;
Kim, Jongwon .
ACM COMPUTING SURVEYS, 2015, 47 (03)