Performance assessment and reliability analysis of dependable and distributed computing systems based on BDD and recursive merge

被引:7
作者
Chang, Yung-Ruei [3 ]
Huang, Chin-Yu [1 ]
Kuo, Sy-Yen [2 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30043, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
[3] Atom Energy Council, Inst Nucl Energy Res, Tao Yuan, Taiwan
关键词
Reliability; OBDD; Fault-coverage; System availability; Distributed system; NETWORK-RELIABILITY; ALGORITHMS; EFFICIENT; TREES; PATH;
D O I
10.1016/j.amc.2010.05.075
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
System reliability evaluation, sensitivity analysis, failure frequency analysis, importance measures, and optimal design are important issues that have become research topics for distributed dependable computing. Finding all of the Minimal File Spanning Trees (MFST) and avoiding repeatedly computing the redundant MFSTs have been key techniques for evaluating the reliability of a distributed computing system (DCS) in previous works. However, identifying all of the disjointed MFSTs is difficult and time consuming for large-scale networks. Although existing algorithms have been demonstrated to work well on medium-scale networks, they have two inherent drawbacks. First, they do not support efficient manipulation of Boolean algebra. The sum-of-disjoint-products method used by these algorithms is inefficient when dealing with large Boolean functions. Second, the tree-based partitioning algorithm does not merge isomorphic sub-problems, and therefore, redundant computations cannot be avoided. In this paper, we propose a new efficient algorithm for the reliability evaluation of a DCS based on the recursive merge and the binary decision diagram (BDD). Using the BDD substitution method, we can easily apply our algorithm to a network with imperfect nodes. The experimental results show a significant improvement in the execution time compared to previous works. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:403 / 413
页数:11
相关论文
共 24 条
[1]   SIMPLE METHOD FOR RELIABILITY EVALUATION OF A COMMUNICATION SYSTEM [J].
AGGARWAL, KK ;
GUPTA, JS ;
MISRA, KB .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1975, CO23 (05) :563-566
[2]  
BRYANT RE, 1986, IEEE T COMPUT, V35, P677, DOI 10.1109/TC.1986.1676819
[3]   OBDD-based evaluation of reliability and importance measures for multistate systems subject to imperfect fault coverage [J].
Chang, YR ;
Amari, SAV ;
Kuo, SY .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2005, 2 (04) :336-347
[4]   Computing system failure frequencies and reliability importance measures using OBDD [J].
Chang, YR ;
Amari, SV ;
Kuo, SY .
IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (01) :54-68
[5]   RELIABILITY-ANALYSIS OF DISTRIBUTED SYSTEMS BASED ON A FAST RELIABILITY ALGORITHM [J].
CHEN, DJ ;
HUANG, TH .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (02) :139-154
[6]   A heuristic approach to generating file spanning trees for reliability analysis of distributed computing systems [J].
Chen, DJ ;
Chen, RS ;
Huang, TH .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1997, 34 (10) :115-131
[7]  
HARIRI S, 1987, IEEE T COMPUT, V36, P1224, DOI 10.1109/TC.1987.1676862
[8]   Reliability evaluation for distributed computing networks with imperfect nodes [J].
Ke, WJ ;
Wang, SD .
IEEE TRANSACTIONS ON RELIABILITY, 1997, 46 (03) :342-349
[10]   ON COMPUTER-COMMUNICATION NETWORK RELIABILITY UNDER PROGRAM EXECUTION CONSTRAINTS [J].
KUMAR, A ;
RAI, S ;
AGRAWAL, DP .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1988, 6 (08) :1393-1400