An Aspect-Oriented Approach to Assessing Fault Tolerance

被引:2
作者
Cleveland, Jeffrey [1 ]
Loyall, Joseph [1 ]
Hanna, James [2 ]
机构
[1] Raytheon BBN Technol, Cambridge, MA USA
[2] US Air Force Res Lab, Rome, NY USA
来源
2014 IEEE MILITARY COMMUNICATIONS CONFERENCE: AFFORDABLE MISSION SUCCESS: MEETING THE CHALLENGE (MILCOM 2014) | 2014年
关键词
fault tolerance; assessment; testing; aspect-oriented programming; survivability;
D O I
10.1109/MILCOM.2014.228
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Fault tolerance and survivability are important aspects of many business-critical and mission-critical systems but it is still difficult to assess how well fault tolerance techniques work. Ensuring fault tolerance in military communication systems is particularly important due to the inevitability of hardware failure, data corruption, or service interruption and the risk that cascading failures could jeopardize critical military operations. In this paper, we present a fault tolerance assessment framework designed for distributed systems that provides automated injection of faults without changes to client or server code and automated assessment of whether the injected faults are tolerated. The framework applies aspect-oriented programming, specifically AspectJ, to inject faults and weave in assessment criteria. The framework supports both assessing the tolerance of direct faults, such as crashes and corruption, like traditional fault injectors, and conditional faults, which can be probabilistically, randomly, or periodically injected at runtime. This latter class of faults is not historically supported by fault injectors, but enables the assessment of tolerance to many important classes of faults threatening modern distributed military communication systems, including timing faults, resource exhaustion (e. g., denial-of-service), and integrity faults that are traditionally difficult to tolerate and assess. Additionally, the framework provides a centralized view for users enabling them to monitor and script coordinated tests comprising performance metrics and injected faults spanning services, applications, and hosts.
引用
收藏
页码:1374 / 1381
页数:8
相关论文
共 16 条
[1]  
Aksit M., 1996, SPECIAL ISSUES OBJEC, V1, P12
[2]  
Dawson S., 1996, P IEEE INT C PERF DE
[3]  
Dinn A, 2011, AOSD 2011 PROCEEDINGS AND COMPANION MATERIAL, P41
[4]  
DUZAN G, 2004, P INT C ASP OR SOFTW
[5]  
GRANT R, 2009, P SPIE DEFENSE TRANS
[6]  
House J., 1999, ASHRAE T S, P1067
[7]   FERRARI - A FLEXIBLE SOFTWARE-BASED FAULT AND ERROR INJECTION SYSTEM [J].
KANAWATI, GA ;
KANAWATI, NA ;
ABRAHAM, JA .
IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (02) :248-260
[8]  
Kaul D., 2007, P WORKSH BEST PRACT
[9]  
Kiczales G, 1997, LECT NOTES COMPUT SC, V1241, P220, DOI 10.1007/BFb0053381
[10]  
Kiczales G., 2001, ECOOP 2001 - Object-Oriented Programming. 15th European Conference. Proceedings (Lecture Notes in Computer Science Vol.2072), P327