CONCURRENT ERROR-DETECTION USING WATCHDOG PROCESSORS - A SURVEY

被引:261
作者
MAHMOOD, A [1 ]
MCCLUSKEY, EJ [1 ]
机构
[1] STANFORD UNIV,COMP SYST LAB,CTR RELIABLE COMP,STANFORD,CA 94305
基金
美国国家科学基金会;
关键词
COMPUTERS; DIGITAL; -; Reliability;
D O I
10.1109/12.2145
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Concurrent system-level error detection techniques using a watchdog processor are surveyed. A watchdog processor is a small and simple coprocessor that detects errors by monitoring the behavior of a system. Like replication, it does not depend on any fault model for error detection. However, it requires less hardware than replication. It is shown that a large number of errors can be detected by monitoring the control flow and memory-access behavior. Two techniques for control-flow checking are discussed and compared to current error-detection techniques. A scheme for memory-access checking based on capability-based addressing is described. The design of a watchdog for performing reasonableness checks on the output of a main processor by executing assertions is discussed.
引用
收藏
页码:160 / 174
页数:15
相关论文
共 68 条
[1]  
Andrews D. M., 1981, 5th International Conference on Software Engineering, P254
[2]  
Andrews D. M., 1979, Ninth Annual International Symposium on Fault-Tolerant Computing, P102
[3]  
ANDREWS DM, 1978, 12TH AS C CIRC SYST, P641
[4]   SERVICE FEATURES AND CALL PROCESSING PLAN [J].
ANDREWS, RJ ;
DRISCOLL, JJ ;
HERNDON, JA ;
RICHARDS, PC ;
ROBERTS, LR .
BELL SYSTEM TECHNICAL JOURNAL, 1969, 48 (08) :2713-+
[5]  
AVIZIENIS A, 1984, COMPUTER, V17, P67, DOI 10.1109/MC.1984.1659219
[6]  
AVIZIENIS A, 1981, P AFIPS C, V50, P27
[7]  
BENDIXEN GE, 1983, 5TH P IEEE AIAA DIG
[8]   ADVANCED FAULT ISOLATION SYSTEM FOR DIGITAL LOGIC [J].
BENOWITZ, N ;
CALHOUN, DF ;
ALDERSON, GE ;
BAUER, JE ;
JOECKEL, CT .
IEEE TRANSACTIONS ON COMPUTERS, 1975, C 24 (05) :489-497
[9]  
Chen L., 1978, 8TH ANN INT S FAULT, P3
[10]  
CONNET JR, 1972, JUN INT S FAULT TOL, P94