Fault-tolerant on-board computing for robotic space missions

被引:2
作者
Zima, Hans P. [1 ,2 ]
James, Mark L. [1 ]
Springer, Paul L. [1 ]
机构
[1] CALTECH, Jet Prop Lab, Pasadena, CA 91109 USA
[2] Univ Vienna, Inst Comp Sci, Vienna, Austria
基金
美国国家航空航天局;
关键词
space-borne computing; fault tolerance; introspection;
D O I
10.1002/cpe.1768
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper describes an approach to providing software fault tolerance for future deep-space robotic National Aeronautics and Space Administration missions, which will require a high degree of autonomy supported by an enhanced on-board computational capability. We focus on introspection-based adaptive fault tolerance guided by the specific requirements of applications. Introspection supports monitoring of the program execution with the goal of identifying, locating, and analyzing errors. Fault tolerance assertions for the introspection system can be provided by the user, domain-specific knowledge, or via the results of static or dynamic program analysis. This work is part of an on-going project at the Jet Propulsion Laboratory in Pasadena, California. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:2192 / 2204
页数:13
相关论文
共 25 条
[1]   Benefits from isolation in commodity multicore processors [J].
Aggarwal, Nidhi ;
Ranganathan, Parthasarathy ;
Jouppi, Norman P. ;
Smith, James E. .
COMPUTER, 2007, 40 (06) :49-+
[2]   Basic concepts and taxonomy of dependable and secure computing [J].
Avizienis, A ;
Laprie, JC ;
Randell, B ;
Landwehr, C .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2004, 1 (01) :11-33
[3]  
Avizienis A, 2000, 010028 CSD UCLA
[4]   Parallel programmability and the Chapel language [J].
Chamberlain, B. L. ;
Callahan, D. ;
Zima, H. P. .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2007, 21 (03) :291-312
[5]   An approach to data distributions in Chapel [J].
Diaconescu, R. E. ;
Zima, H. P. .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2007, 21 (03) :313-335
[6]   Fundamentals of fault-tolerant distributed computing in asynchronous environments [J].
Gärtner, FC .
ACM COMPUTING SURVEYS, 1999, 31 (01) :1-26
[7]  
Goldberg A, 2005, AEROSP CONF PROC, P507
[8]  
HAVELUND K, 2005, P VER SOFTW THEOR TO
[9]   ALGORITHM-BASED FAULT TOLERANCE FOR MATRIX OPERATIONS [J].
HUANG, KH ;
ABRAHAM, JA .
IEEE TRANSACTIONS ON COMPUTERS, 1984, 33 (06) :518-528
[10]   Toward application-aware security and reliability [J].
Iyer, Ravishankar K. ;
Kalbarczyk, Zbigniew ;
Pattabiraman, Karthik ;
Healey, William ;
Hwu, Wen-Mei W. ;
Klemperer, Peter ;
Farivar, Reza .
IEEE SECURITY & PRIVACY, 2007, 5 (01) :57-62