Dual use of superscalar datapath for transient-fault detection and recovery

被引:73
作者
Ray, J [1 ]
Hoe, JC [1 ]
Falsafi, B [1 ]
机构
[1] Carnegie Mellon Univ, Comp Architecture Lab, Pittsburgh, PA 15213 USA
来源
34TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO-34, PROCEEDINGS | 2001年
关键词
D O I
10.1109/MICRO.2001.991120
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Diminutive devices and high clock frequency of future microprocessor generations are causing increased concerns for transient soft failures in hardware, necessitating fault detection and recovery mechanisms even in commodity processors. In this paper we propose a fault-tolerant extension for modern superscalar out-of-order datapath that can be supported by only modest additional hardware. In the proposed extensions, error-detection is achieved by verifying the redundant results of dynamically replicated threads of executions, while the error-recovery scheme employs the instruction-rewind mechanism to restart at a failed instruction. We study the performance impact of augmenting superscalar microarchitectures with this fault tolerance mechanism. An analytical performance model is used in conjunction with a performance simulator The simulation results of 11 SPEC95 and SPEC2000 benchmarks show that in the absence of faults, error detection causes a 2% to 45% reduction in throughput, which is in line with other proposed detection schemes. In the presence of transient faults, the fast error recovery scheme contributes very little additional slowdown.
引用
收藏
页码:214 / 224
页数:11
相关论文
共 22 条
  • [1] ANGHEL L, 2000, P 18 IEEE VLSI TEST
  • [2] Austin T., 1999, P 32 INT S MICR NOV
  • [3] Design challenges of technology scaling
    Borkar, S
    [J]. IEEE MICRO, 1999, 19 (04) : 23 - 29
  • [4] Burger D, 1997, 1342 U WISC MAD COMP
  • [5] *COMP COMP CORP, 1999, DAT INT COMP NONST H
  • [6] Single event effects in static and dynamic registers in a 0.25μm CMOS technology
    Faccio, F
    Kloukinas, K
    Marchioro, A
    Calin, T
    Cosculluela, J
    Nicolaidis, M
    Velazco, R
    [J]. IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1999, 46 (06) : 1434 - 1439
  • [7] Deep-submicron microprocessor design issues
    Flynn, MJ
    Hung, P
    Rudd, KW
    [J]. IEEE MICRO, 1999, 19 (04) : 11 - 22
  • [8] FRANKLIN M, 1995, P IEEE INT WORKSH DE
  • [9] HASNAIN A, 1992, P 30 INT REL PHYS S
  • [10] May T., 1978, P 16 INT REL PHYS S