HIGH-PERFORMANCE FAULT-TOLERANT VLSI SYSTEMS USING MICRO ROLLBACK

被引:49
作者
TAMIR, Y
TREMBLAY, M
机构
[1] Department of Computer Science, University of California, Los Angeles
关键词
Concurrent error detection; error detection latency; fault-tolerant architectures; high-performance processors; rollback; VLSI implementation; VLSI systems;
D O I
10.1109/12.54848
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A key to achieving a high degree of fault tolerance is the ability to detect errors as soon as they occur and prevent erroneous information from spreading throughout the system. In highly reliable systems, this is usually accomplished by checkers and isolation circuits in the communication paths from each module to the rest of the system. This additional circuitry reduces performance by requiring either longer clock cycles or additional pipeline stages. We present a technique, called micro rollback, which allows most of the performance penalty for concurrent error detection to be eliminated. Detection is performed in parallel with the transmission of information between modules, thus removing the delay for detection from the critical path. Erroneous information may thus reach its destination module several clock cycles before an error indication. Operations performed on this erroneous information are “undone” using a hardware mechanism for fast rollback of a few cycles. We discuss the implementation of a VLSI processor capable of micro rollback as well as several critical issues related to its use in a complete system. © 1990 IEEE
引用
收藏
页码:548 / 554
页数:7
相关论文
共 9 条
[1]  
CENSIER LM, 1978, IEEE T COMPUT, V27, P1112
[2]  
Hwu W. W., 1987, 14th Annual International Symposium on Computer Architecture. Conference Proceedings (Cat. No.87CH2420-8), P18, DOI 10.1145/30350.30353
[3]  
LAMPORT L, 1979, IEEE T COMPUT, V28, P690, DOI 10.1109/TC.1979.1675439
[4]  
PATEL JH, 1982, IEEE T COMPUT, V31, P589, DOI 10.1109/TC.1982.1676055
[5]  
Randell B., 1978, ACM COMPUT SURV, V10, P123, DOI DOI 10.1145/356725.356729
[6]   A 32-BIT NMOS MICROPROCESSOR WITH A LARGE REGISTER FILE [J].
SHERBURNE, RW ;
KATEVENIS, MGH ;
PATTERSON, DA ;
SEQUIN, CH .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1984, 19 (05) :682-689
[7]   IMPLEMENTING PRECISE INTERRUPTS IN PIPELINED PROCESSORS [J].
SMITH, JE ;
PLESZKUN, AR .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (05) :562-573
[8]  
Tamir Y., 1988, Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers. FTCS-18 (Cat. No.88CH2543-7), P234, DOI 10.1109/FTCS.1988.5325
[9]  
[No title captured]