Experience Report: On the Impact of Software Faults in the Privileged Virtual Machine

被引:5
作者
Cerveira, Frederico [1 ]
Barbosa, Raul [1 ]
Madeira, Henrique [1 ]
机构
[1] Univ Coimbra, Dept Informat Engn, CISUC, P-3030290 Coimbra, Portugal
来源
2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE) | 2017年
基金
欧盟地平线“2020”;
关键词
Dependability; Software Faults; Virtualization; Fault Injection; PRACTICAL APPROACH;
D O I
10.1109/ISSRE.2017.39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud computing is revolutionizing how organizations treat computing resources. The privileged virtual machine is a key component in systems that use virtualization, but poses a dependability risk for several reasons. The activation of residual software faults that exist in every software project is a real threat and can impact the correct operation of the entire virtualized system. To study this question, we begin by performing a detailed analysis of the privileged virtual machine and its components, followed by software fault injection campaigns that target two of those important components - toolstack and a device driver. The obstacles faced during this experimental phase and how they were overcome is herein described with practitioners in mind. The results show that software faults in those components can have either no impact or lead to drastic failures, showing that the privileged virtual machine is a single point of failure that must be protected (for 4-9% of the faults). Most of the failures are detectable by monitoring basic functionalities, but some faults caused inconsistent states that manifest later on. No silent data failures (SDF) have been observed, but the number of faults injected so far only allows to conclude that SDF are not very frequent.
引用
收藏
页码:136 / 145
页数:10
相关论文
共 33 条
[1]  
Acree A. T., 1979, DTIC DOCUMENT
[2]  
Albinet A, 2004, 2004 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, P867
[3]  
[Anonymous], 2003, ACM SIGOPS OPERATING
[4]  
Candea G, 2004, USENIX Association Proceedings of the Sixth Symposium on Operating Systems Design and Implementation (OSDE '04), P31
[5]  
Catteddu D., 2010, CLOUD COMPUTING BENE, P17, DOI DOI 10.1007/978-3-642-16120-9_9
[6]   Recovery for Virtualized Environments [J].
Cerveira, Frederico ;
Barbosa, Raul ;
Madeira, Henrique ;
Araujo, Filipe .
2015 ELEVENTH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC), 2015, :25-36
[7]  
Chisnall David., 2008, DEFINITIVE GUIDE XEN
[8]  
Chou A., 2001, Operating Systems Review, V35, P73, DOI 10.1145/502059.502042
[9]   Practical and representative faultloads for large-scale software systems [J].
Costa, Pedro ;
Silva, Joao Gabriel ;
Madeira, Henrique .
JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 103 :182-197
[10]  
Cully B., 2008, P S NETW SYST DES IM, P161