Pareto-Optimal Cloud Bursting

被引:40
作者
Farahabady, Mohammad Reza Hoseiny [1 ,2 ]
Lee, Young Choon [1 ]
Zomaya, Albert Y. [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Ctr Distr & High Performance Comp, Sydney, NSW 2006, Australia
[2] NICTA, Networks Res Grp, Eveleigh, NSW 1430, Australia
基金
澳大利亚研究理事会;
关键词
Cloud computing; cloud bursting; resource allocation; cost efficiency; pareto-frontier; approximation algorithm;
D O I
10.1109/TPDS.2013.218
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale Bag-of-Tasks (BoT) applications are characterized by their massively parallel, yet independent operations. The use of resources in public clouds to dynamically expand the capacity of a private computer system might be an appealing alternative to cope with such massive parallelism. To fully realize the benefit of this 'cloud bursting', the performance to cost ratio (or cost efficiency) must be thoroughly studied and incorporated into scheduling and resource allocation strategies. In this paper, we present PANDA, a framework for static scheduling BoT applications across resources in both private and public clouds. The framework at the core incorporates a fully polynomial-time approximation scheme (FPTAS) as a novel scheduling algorithm, which generates schedules with the best trade-off point between cost and performance; hence Pareto-optimality. We have theoretically discussed the complexity and correctness of our algorithms, and experimentally verified their efficacy and practicality using ISOMAP-a widely-used nonlinear manifold method as a real-world BoT application. Our evaluation conducted in a 'multi-cloud' environment of our 40-core private system and Amazon EC2 public cloud demonstrates the scheduling quality of PANDA is guaranteed to be within a measurable distance from the optimal solution. Results obtained from our experiments show such quality is 8 percent or less from the optimum. We also show the sensitivity and robustness of our scheduling solutions against performance errors in both resources and applications.
引用
收藏
页码:2670 / 2682
页数:13
相关论文
共 28 条
  • [1] [Anonymous], 2008, Cost of power in large-scale data centers
  • [2] Armstrong TimothyG., 2010, 2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers, P1
  • [3] Balasubramanian V, 2008, COMM COM INF SC, V21, P177
  • [4] ExPERT: Pareto-Efficient Task Replication on Grids and a Cloud
    Ben-Yehuda, Orna Agmon
    Schuster, Assaf
    Sharov, Artyom
    Silberstein, Mark
    Iosup, Alexandru
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 167 - 178
  • [5] Buyya R, 2000, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, P517
  • [6] Casanova H, 2010, LECT NOTES COMPUT SC, V6271, P168, DOI 10.1007/978-3-642-15277-1_17
  • [7] Chandrakant A.J.S., 2005, COST MODEL PLANNING
  • [8] A Practical System for Modelling Body Shapes from Single View Measurements
    Chen, Yu
    Robertson, Duncan
    Cipolla, Roberto
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [9] Cormen T., 2001, Introduction to Algorithms
  • [10] An optimal algorithm for Monte Carlo estimation
    Dagum, P
    Karp, R
    Luby, M
    Ross, S
    [J]. SIAM JOURNAL ON COMPUTING, 2000, 29 (05) : 1484 - 1496