DEPEND: A simulation-based environment for system level dependability analysis

被引:73
|
作者
Goswami, KK
Iyer, RK
Young, L
机构
[1] UNIV ILLINOIS, CTR RELIABLE & HIGH PERFORMANCE COMP, URBANA, IL 61801 USA
[2] TANDEM COMP INC, AUSTIN, TX 78728 USA
基金
美国国家航空航天局;
关键词
simulation; fault injection; dependability analysis; correlated errors; latent errors; intercomponent dependence; object-oriented design; Tandem TMR-based prototype analysis; validation;
D O I
10.1109/12.559803
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The paper presents the rationale for a functional simulation tool, called DEPEND, which provides an integrated design and fault injection environment for system level dependability analysis. The paper discusses the issues and problems of developing such a tool, and describes how DEPEND tackles them. Techniques developed to simulate realistic fault scenarios, reduce simulation time explosion, and handle the large fault model and component domain associated with system level analysis are presented. Examples are used to motivate and illustrate the benefits of this tool. To further illustrate its capabilities, DEPEND is used to simulate the Unix-based Tandem triple-modular-redundancy (TMR) based prototype fault-tolerant system and evaluate how well it handles near-coincident errors caused by correlated and latent faults. Issues such as memory scrubbing, re-integration policies, and workload dependent repair times, which affect how the system handles near-coincident errors, are also evaluated. Unlike any other simulation-based dependability studies, the accuracy of the simulation model is validated by comparing the results of the simulations with measurements obtained from fault injection experiments conducted on a production Tandem machine.
引用
收藏
页码:60 / 74
页数:15
相关论文
共 50 条
  • [1] Simulation-based Fault Injection with QEMU for Speeding-up Dependability Analysis of Embedded Software
    Ferraretto, Davide
    Pravadelli, Graziano
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2016, 32 (01): : 43 - 57
  • [2] Simulation-based Fault Injection with QEMU for Speeding-up Dependability Analysis of Embedded Software
    Davide Ferraretto
    Graziano Pravadelli
    Journal of Electronic Testing, 2016, 32 : 43 - 57
  • [3] Simulation-based analysis for sustainability of manufacturing system
    Lee, Ju Yeon
    Kang, Hyoung Seok
    Noh, Sang Do
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2012, 13 (07) : 1221 - 1230
  • [4] Simulation-based Performance Analysis and Tuning for a Two-level Directly Connected System
    Totoni, Ehsan
    Bhatele, Abhinav
    Bohm, Eric J.
    Jain, Nikhil
    Mendes, Celso L.
    Mokos, Ryan M.
    Zheng, Gengbin
    Kale, Laxmikant V.
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 340 - 347
  • [5] Simulation-based analysis for sustainability of manufacturing system
    Ju Yeon Lee
    Hyoung Seok Kang
    Sang Do Noh
    International Journal of Precision Engineering and Manufacturing, 2012, 13 : 1221 - 1230
  • [6] Design of Simulation-based Network Vulnerability Analysis System
    You, Yong-Jun
    Lee, Jang-Se
    Chi, Sung-Do
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (08): : 3551 - 3559
  • [7] On identity-aware replication in stochastic modeling for simulation-based dependability analysis of large interconnected systems
    Chiaradonna, Silvano
    Di Giandomenico, Felicita
    Masetti, Giulio
    PERFORMANCE EVALUATION, 2021, 147
  • [8] A fault-injection methodology for the system-level dependability analysis of multiprocessor embedded systems
    Miele, Antonio
    MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (06) : 567 - 580
  • [9] A Simulation-based Behavior Analysis for MCI Response System of Systems
    Park, Sumin
    Mihret, B. Zelalem
    Bae, Doo-Hwan
    2019 IEEE/ACM 7TH INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR SYSTEMS-OF-SYSTEMS AND 13TH WORKSHOP ON DISTRIBUTED SOFTWARE DEVELOPMENT, SOFTWARE ECOSYSTEMS AND SYSTEMS-OF-SYSTEMS (SESOS-WDES 2019), 2019, : 2 - 9
  • [10] Simulation-based metamodels for the analysis of scheduling decisions in a flexible manufacturing system operating in a tool-sharing environment
    Kumar, N. Suresh
    Sridharan, Rajagopalan
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2010, 51 (1-4) : 341 - 355