Adaptive parallel application resource remapping through the live migration of virtual machines

被引:14
作者
Atif, Muhammad [1 ]
Strazdins, Peter [2 ]
机构
[1] Australian Natl Univ, Natl Computat Infrastruct, Canberra, ACT 0200, Australia
[2] Australian Natl Univ, Coll Engn & Comp Sci, Dept Comp Sci, Canberra, ACT 0200, Australia
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2014年 / 37卷
关键词
Performance prediction; Virtualization; Heterogeneous clusters; Live migration; Cluster scheduling; Resource management;
D O I
10.1016/j.future.2013.06.028
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present ARRIVE-F, a novel open source framework which addresses the issue of heterogeneity in virtualized compute farms, such as those hosted by a cloud infrastructure provider. Unlike the previous attempts, our framework is not based on linear frequency models and does not require source code modifications or off-line profiling. The heterogeneous compute farm is first divided into a number of homogeneous sub-clusters. The framework then carries out a lightweight 'online' profiling of the CPU, communication and memory subsystems of all the active jobs in the compute farm. From this, it constructs a performance model to predict the execution times of each job on all the distinct sub-clusters in the compute farm. Based upon the predicted execution times, the framework is able to relocate the compute jobs to the currently best-suited hardware platforms such that the overall throughput of the compute farm is increased. We utilize the live migration feature of virtual machine monitors to migrate the job from one sub-cluster to another. The prediction accuracy of our performance estimation model is over 80%. The implementation of ARRIVE-F is lightweight, with an overhead of 3%. Experiments on a synthetic workload of scientific benchmarks show that we are able to improve the throughput of a moderately heterogeneous compute farm by up to 25%, with a time saving of up to 33%. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:148 / 161
页数:14
相关论文
共 26 条
[1]  
Atif M., 2011, TRCS1101 AUSTR NAT U
[2]   Optimizing Live Migration of Virtual Machines in SMP Clusters for HPC Applications [J].
Atif, Muhammad ;
Strazdins, Peter .
2009 6TH IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING, 2009, :51-58
[3]  
Atif Muhammad., 2009, Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing, HPCVirt '09, P9
[4]  
Barak Amnon, 1998, FUTURE GENER COMPUT
[5]  
Berman F., 1996, ACM IEEE C SUP, P39
[6]   EasyGrid: towards a framework for the automatic Grid enabling of legacy MPI applications [J].
Boeres, C ;
Rebello, VEF .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (05) :425-432
[7]   A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems [J].
Braun, TD ;
Siegel, HJ ;
Beck, N ;
Bölöni, LL ;
Maheswaran, M ;
Reuther, AI ;
Robertson, JP ;
Theys, MD ;
Yao, B ;
Hensgen, D ;
Freund, RF .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (06) :810-837
[8]  
Ching-Hsien Hsu, 2011, Data Management in Grid and Peer-to-Peer Systems. Proceedings 4th International Conference (GLOBE 2011), P120, DOI 10.1007/978-3-642-22947-3_11
[9]  
Clark C., 2005, P 2 ACM USENIX S NET
[10]  
Dongarra J., 2006, ENG GRID STATUS PERS