Software Rejuvenation based Fault Tolerance Scheme for Cloud Applications

被引:26
作者
Liu, Jing [1 ]
Zhou, Jiantao [1 ]
Buyya, Rajkumar [2 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China
[2] Univ Melbourne, Dept Comp & Informat Syst, CLOUDS Lab, Melbourne, Vic 3010, Australia
来源
2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING | 2015年
关键词
software rejuvenation; failure prediction; live VM migration; checkpoint; cloud computing;
D O I
10.1109/CLOUD.2015.164
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud applications are typically composed of multiple cloud service components communicating with each other through web service interfaces, where each component fulfills specified functionalities. Lack of effective fault tolerance scheme is one of major obstacles for enhancing availability and efficiency of complex and aging cloud application systems. In this paper, we propose a holistic software rejuvenation based fault tolerance scheme for cloud applications, which contains three indispensible parts: adaptive failure detection, aging degree evaluation, and checkpoint with trace replay based component rejuvenation. Through a preliminary and qualitative evaluation, it shows that our new fault tolerance scheme brings promising improvement on the availability of cloud applications.
引用
收藏
页码:1115 / 1118
页数:4
相关论文
共 11 条
[1]  
ARAUJO J, 2011, P 2011 IEEE INT C SY, P1411
[2]   Workload-Based Software Rejuvenation in Cloud Systems [J].
Bruneo, Dario ;
Distefano, Salvatore ;
Longo, Francesco ;
Puliafito, Antonio ;
Scarpa, Marco .
IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (06) :1072-1085
[3]  
Cotroneo D., 2011, Proceedings of the 2011 IEEE Third International Workshop on Software Aging and Rejuvenation (WoSAR 2011), P1, DOI 10.1109/WoSAR.2011.15
[4]  
Di S, 2013, INT C HIGH PERFORM, P69, DOI 10.1109/HiPC.2013.6799101
[5]  
EGWUTUOHA IP, 2013, P IEEE 6 INT C CLOUD, P762, DOI DOI 10.1109/CLOUD.2013.69
[6]  
Jhawar R., 2014, CYBER SECURITY IT IN, P1
[7]  
Langner F, 2013, 2013 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2013), P896
[8]   Live Virtual Machine Migration via Asynchronous Replication and State Synchronization [J].
Liu, Haikun ;
Jin, Hai ;
Liao, Xiaofei ;
Yu, Chen ;
Xu, Cheng-Zhong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (12) :1986-1999
[9]   A Survey of Migration Mechanisms of Virtual Machines [J].
Medina, Violeta ;
Manuel Garcia, Juan .
ACM COMPUTING SURVEYS, 2014, 46 (03)
[10]  
MELO M, 2013, P 2013 IEEE INT C SY, P4110, DOI DOI 10.1109/SMC.2013.701