A PERFORMANCE ANALYSIS OF A BUDDY SYSTEM FOR FAULT TOLERANCE

被引:1
|
作者
FINKEL, D [1 ]
TRIPATHI, SK [1 ]
机构
[1] UNIV MARYLAND,INST ADV COMP STUDIES,DEPT COMP SCI,COLLEGE PK,MD 20742
基金
美国国家科学基金会;
关键词
Bulk Arrivals; Distributed Systems; Fault Tolerance; Performance Evaluation; Queuing Models;
D O I
10.1016/0166-5316(90)90010-G
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A model for fault tolerant computing in a distributed computing system is presented and analyzed. Each time a job is submitted, two copies of it are stored: one at its original node, where it will normally be executed, and the other at a second node, called the buddy node. If the original node fails, the copy at the buddy node will be executed, providing fault tolerance. By means of an iterative procedure, the average queue length and the average response time may be calculated, with some simplifying assumptions. Comparison with simulation results shows excellent agreement. Numerical results are presented to show the effects of varying the parameters on the performance of the system. © 1990.
引用
收藏
页码:177 / 185
页数:9
相关论文
共 50 条
  • [31] Increasing SCADA System Availability by Fault Tolerance Techniques
    Mikhail, Abrosimov
    Kamil, Iehab Abduljabbar
    Mahajan, Hemant
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [32] Reliable Visual Exploration System with Fault Tolerance Structure
    Chen, Weinan
    Zhu, Lei
    He, Li
    Guan, Yisheng
    Zhang, Hong
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [33] Architectural Support for Fault Tolerance in a Teradevice Dataflow System
    Weis, Sebastian
    Garbade, Arne
    Fechner, Bernhard
    Mendelson, Avi
    Giorgi, Roberto
    Ungerer, Theo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (02) : 208 - 232
  • [34] The applied study of fault tolerance of missile control system
    Yang, Q
    Liu, XC
    Han, DR
    He, XF
    ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 3083 - 3086
  • [35] Fault tolerance: A means to provide reliable computing system
    Karim, Lutful
    Shorfuzzaman, Mohammad
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 4, 2005, : 35 - 40
  • [36] Architectural Support for Fault Tolerance in a Teradevice Dataflow System
    Sebastian Weis
    Arne Garbade
    Bernhard Fechner
    Avi Mendelson
    Roberto Giorgi
    Theo Ungerer
    International Journal of Parallel Programming, 2016, 44 : 208 - 232
  • [37] A Distributed Fault Tolerance Mechanism for an IoT Healthcare system
    Zaiter, Meriem
    Hacini, Salima
    Moussa, Guedrez
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [38] Fault Tolerance Analysis of Spacecraft Formation Via Impulsive Dimension-Varying Switched System
    Pan, Jiao
    Yang, Hao
    Jiang, Bin
    2014 INTERNATIONAL CONFERENCE ON MECHATRONICS AND CONTROL (ICMC), 2014, : 153 - 158
  • [39] Design and Analysis of Peer-to-Peer Fault-Tolerance Approach in a Grid Computing System
    Tangmankhong, Thagorn
    Siripongwutikorn, Peerapon
    Achalakul, Tiranee
    CHIANG MAI JOURNAL OF SCIENCE, 2017, 44 (02): : 688 - 698
  • [40] A Novel Approach for Fault Tolerance Control System and Embedded System Security
    Khadse, Tushar S.
    Karmore, Swapnili P.
    1ST INTERNATIONAL CONFERENCE ON INFORMATION SECURITY & PRIVACY 2015, 2016, 78 : 799 - 806