Reliable resource provision policy for cloud computing

被引:13
作者
Tian G.-H. [1 ,2 ]
Meng D. [1 ]
Zhan J.-F. [1 ]
机构
[1] Institute of Computing Technology, Chinese Acad. of Sci.
[2] Graduate University of Chinese Acad. of Sci.
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2010年 / 33卷 / 10期
关键词
Cloud computing; Failure rules; Heterogeneous workload; Reliability; Resource provisioning;
D O I
10.3724/SP.J.1016.2010.01859
中图分类号
学科分类号
摘要
Cloud computing has become a hot topic, researchers proposed various resource sharing technique and resource provision technique. However, very limited literatures pay attentions to the reliability of dynamically provided resources. This paper proposes failure rules aware node resource provision policies for heterogeneous services consolidated in cloud computing infrastructure, and evaluates the proposed policy with simulation approach, i.e., implements a simulator of heterogeneous service consolidation platform, which take characteristics of heterogeneous services (both characteristics of resource utility and failure rules), into consideration, and uses two production traces to synthesize inputs. In order to evaluate wide ranges of failure rules, this paper proposes a multi-dimension failure modeling framework, i.e., adapt various factors about failure distribution involving temporal and spatial factors to study the proposed policy's capability. The results of evaluation indicate that the proposed resource provision policy is effective for providing robust nodes for heterogeneous services, i.e., the policy can mask more potential node reboot failures from services and leave less chances of unplanned failures, e.g., service failure or node reboot, compared with baseline fault re-provided policy. In addition, the policy is able to mask non-uniformly distribution among resource's reliability system wide. Meanwhile, the policy involves no negative impact on service performance and on node's resource utility, compared with baseline policy. Evaluation with failure rules about temporal and spatial factors indicates that the policy is useful for could computing environment.
引用
收藏
页码:1859 / 1872
页数:13
相关论文
共 19 条
  • [1] Rochwerger B., Breitgand D., Levy E., Et al., The Reservoir model and architecture for open federated cloud computing, IBM Journal of Research and Development, 53, 4, pp. 1-17, (2009)
  • [2] Nurmi D., Wolski R., Grzegorczyk C., Et al., The eucalyptus open-source cloud-computing system, Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 124-131, (2009)
  • [3] Armbrust M., Fox A., Griffith R., Et al., Above the clouds: A berkeley view of cloud computing, pp. 1-25, (2009)
  • [4] Vaquero L.M., Rodero-Merino L., Caceres J., Et al., A break in the clouds: Towards a cloud definition, ACM SIGCOMM Computer Communication Review, 39, 1, pp. 50-55, (2009)
  • [5] Irwin D., Chase J.S., Grit L., Et al., Sharing networked resources with brokered leases, Proceedings of the USENIX Technical Conference, pp. 199-212, (2006)
  • [6] Padala P., Shin K.G., Zhu X.-Y., Et al., Adaptive control of virtualized resources in utility computing environments, Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, pp. 289-302, (2007)
  • [7] Schroeder B., Gibson G.A., A large-scale study of failures in high-performance computing systems, Proceedings of DSN2006, pp. 249-258, (2006)
  • [8] Heath T., Martin R.P., Nguyen T.D., Improving cluster availability using workstation validation, Proceedings of the ACM SIGMETRICS, pp. 217-227, (2002)
  • [9] Tai A.T., Tso K.S., Sanders W.H., Et al., Chau: A performability-oriented software rejuvenation framework for distributed applications, Proceedings of the 35th DSN 2005, pp. 570-579, (2005)
  • [10] Vaidyanathan K., Harper R.E., Hunter S.W., Et al., Analysis and implementation of software rejuvenation in cluster systems, Proceedings of the ACM SIGMETRICS 2001, pp. 62-71, (2001)