Experimental evaluation of error-detection mechanisms

被引:12
|
作者
Constantinescu, C [1 ]
机构
[1] Intel Corp, Enterprise Architecture Lab JF1 231, Hillsboro, OR 97124 USA
关键词
coverage probability; error detection; fault injection; statistical inference;
D O I
10.1109/TR.2002.805785
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Effective, error-detection is paramount for building highly dependable computing systems. A new methodology, based on physical and simulated fault injection, has been developed for assessing the effectiveness of error-detection mechanisms. This approach has 2 steps: 1) Transient faults are physically injected at the IC pin level of a prototype, in order to derive the error-detection coverage. Experiments are carried out in a 3-dimensional space of events. Fault location, time of occurrence, and duration of the injected fault are the dimensions of this space. 2) Simulated fault-injection is performed to assess the effectiveness of new error-detection mechanisms, designed to improve the detection coverage. Complex circuitry, based on checking for protocol violations, is considered. A temporal model of the protocol checker is used, and transient faults are injected in signal traces captured from the prototype system. These traces are used as inputs of the simulation engine. s-Confidence intervals of the error-detection coverage are derived, both for the initial design and-the new detection mechanism. Physical fault-injection, carried out on a prototype server, proved that several signals were sensitive to transient faults and error-detection coverage was unacceptably low. Simulated fault injection shows that an error-detection mechanism, based on checking for protocol violations, can appreciably increase the detection coverage, especially for transient faults longer that 200 nanoseconds. Additional research is required for improving the error-detection of shorter transients. Fault injection experiments also show that error-detection coverage is a function of fault duration: the shorter the transient fault, the lower the coverage. As a consequence, injecting faults that have a unique, predefined duration, as it was frequently done in the past, does not provide accurate information on the effectiveness of the error-detection mechanisms. Injecting only permanent faults leads to unrealistically high estimates of the coverage. These experiments prove that combined physical and simulated fault injection, performed in a 3-dimensional space of events, is a superior approach, which allows the designers to accurately assess the efficacy of various candidate error-detection mechanisms without building expensive test circuits.
引用
收藏
页码:53 / 57
页数:5
相关论文
共 50 条
  • [1] Experimental evaluation of three concurrent error detection mechanisms
    Vahdatpour, Ahreza
    Fazeli, Mahdi
    Miremadi, Seyed Ghassem
    2006 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, 2007, : 67 - +
  • [2] OPTIMAL CODES FOR ERROR-DETECTION
    KLOVE, T
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) : 479 - 489
  • [3] Language equations, maximality and error-detection
    Kari, L
    Konstantinidis, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2005, 70 (01) : 157 - 178
  • [4] Error-Detection in Enterprise Application Integration Solutions
    Frantz, Rafael Z.
    Corchuelo, Rafael
    Molina-Jimenez, Carlos
    ENTERPRISE INFORMATION SYSTEMS, PT 2, 2011, 220 : 170 - +
  • [5] A COMPARATIVE-STUDY OF ERROR-DETECTION IN DISCOURSE
    JARVELLA, RJ
    LUNDBERG, I
    KALLIOKOSKI, L
    READING AND WRITING, 1992, 4 (01) : 79 - 94
  • [6] GOOD ERROR-DETECTION CODES SATISFYING THE EXPURGATED BOUND
    HASHIMOTO, T
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (05) : 1347 - 1353
  • [7] Collaborative Facilitation through Error-Detection: A Classroom Experiment
    Gadgil, Soniya
    Nokes, Timothy J.
    COGNITION IN FLUX, 2010, : 2583 - 2588
  • [8] EVALUATING PROCESSOR-BEHAVIOR AND 3 ERROR-DETECTION MECHANISMS USING PHYSICAL FAULT-INJECTION
    MIREMADI, G
    TORIN, J
    IEEE TRANSACTIONS ON RELIABILITY, 1995, 44 (03) : 441 - 454
  • [9] CONCURRENT ERROR-DETECTION AND FAULT LOCATION IN AN FFT ARCHITECTURE
    LOMBARDI, F
    MUZIO, JC
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1992, 27 (05) : 728 - 736
  • [10] Educational Magic Tricks Based on Error-Detection Schemes
    Greenberg, Ronald, I
    ITICSE'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, 2017, : 170 - 175