Reliability and Performance Analysis of Architecture-Based Software Implementing Restarts and Retries Subject to Correlated Component Failures

被引:2
|
作者
Li, Xiao-Dan [1 ]
Yin, Yong-Feng [1 ]
Fiondella, Lance [2 ]
机构
[1] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[2] Univ Massachusetts, Dept Elect & Comp Engn, N Dartmouth, MA 02747 USA
基金
中国国家自然科学基金;
关键词
Software architecture; software reliability; software performance; software recovery mechanisms; correlated component failure; MODEL;
D O I
10.1142/S0218194015500266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High reliability and performance are essential attributes of software systems designed for critical real-time applications. To improve the reliability and performance of software, many systems incorporate some form of fault recovery mechanism. However, contemporary models of software reliability and performance rarely consider these fault recovery mechanisms. Another notable shortcoming of many software models is that they make the simplifying assumption that component failures are statistically independent, which disagrees with several experimental studies that have shown that the failures of software components can exhibit correlation. This paper presents an architecture-based model of software reliability and performance that explicitly considers a two-stage fault recovery mechanism implementing component restarts and application-level retries. The application architecture is characterized by a Discrete Time Markov Chain (DTMC) to represent the dynamic branching behavior of control between the components of the application. Correlations between the component failures are computed with an efficient numerical algorithm for a multivariate Bernoulli (MVB) distribution. We illustrate the utility of the model through a case study of an embedded software application. The results suggest that the model can be used to quantify the impact of software fault recovery and correlated component failures on application reliability and performance.
引用
收藏
页码:1307 / 1334
页数:28
相关论文
共 39 条
  • [1] Architecture-based software reliability analysis with interface failures
    Gokhale, Swapna S.
    Lipton, Michael W.
    ELEVENTH ISSAT INTERNATIONAL CONFERENCE RELIABILITY AND QUALITY IN DESIGN, PROCEEDINGS, 2005, : 6 - 11
  • [2] An analytical approach to architecture-based software performance and reliability prediction
    Gokhale, SS
    Wong, WE
    Horgan, JR
    Trivedi, KS
    PERFORMANCE EVALUATION, 2004, 58 (04) : 391 - 412
  • [3] Efficient Software Reliability Analysis With Correlated Component Failures
    Fiondella, Lance
    Rajasekaran, Sanguthevar
    Gokhale, Swapna S.
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (01) : 244 - 255
  • [4] Architecture-based software reliability analysis: Overview and limitations
    Gokhale, Swapna S.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (01) : 32 - 40
  • [5] Software reliability analysis considering correlated component failures with coupling measurement framework
    Xiaodan Li
    Yongfeng Yin
    Lance Fiondella
    Yibin Zhou
    JournalofSystemsEngineeringandElectronics, 2015, 26 (05) : 1114 - 1126
  • [6] Software reliability analysis considering correlated component failures with coupling measurement framework
    Li, Xiaodan
    Yin, Yongfeng
    Fiondella, Lance
    Zhou, Yibin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (05) : 1114 - 1126
  • [7] Architecture-based software reliability modeling
    Wang, WL
    Pan, D
    Chen, MH
    JOURNAL OF SYSTEMS AND SOFTWARE, 2006, 79 (01) : 132 - 146
  • [8] Quantifying software performance, reliability and security: An architecture-based approach
    Sharma, Vibhu Saujanya
    Trivedi, Kishor S.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (04) : 493 - 509
  • [10] Estimation of reliability and cost relationship for architecture-based software
    Guan H.
    Chen W.-R.
    Huang N.
    Yang H.-J.
    International Journal of Automation and Computing, 2010, 7 (4) : 603 - 610