Hypevisor-based fault-tolerance

被引:100
作者
Bressoud, TC [1 ]
Schneider, FB [1 ]
机构
[1] CORNELL UNIV,ITHACA,NY 14853
来源
ACM TRANSACTIONS ON COMPUTER SYSTEMS | 1996年 / 14卷 / 01期
关键词
algorithms; reliability; fault-tolerant computing system; primary/backup approach; virtual-machine manager;
D O I
10.1145/225535.225538
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Protocols to implement a fault-tolerant computing system are described. These protocols augment the hypervisor of a virtual-machine manager and coordinate a primary virtual machine with its backup. No modifications to the hardware, operating system, or application programs are required. A prototype system was constructed for HP's PA-RISC instruction-set architecture. Even though the prototype was not carefully tuned, it ran programs about a factor of 2 slower than a bare machine would.
引用
收藏
页码:80 / 107
页数:28
相关论文
共 50 条
  • [31] Computing Graph Spanners in Small Memory: Fault-Tolerance and Streaming
    Ausiello, Giorgio
    Franciosa, Paolo G.
    Italiano, Giuseppe F.
    Ribichini, Andrea
    COMPUTING AND COMBINATORICS, 2010, 6196 : 160 - +
  • [32] Reliability analysis of fault-tolerance voyage data recorder system
    Hao, Yanling
    Zhou, Wenjun
    2005 IEEE International Conference on Mechatronics and Automations, Vols 1-4, Conference Proceedings, 2005, : 2190 - 2193
  • [33] Design of a 12-Pulse Cycloconverter with Fault-Tolerance Capability
    Guerrero Barria, Victor
    Pontt Olivares, Jorge
    PROCEEDINGS OF THE 2011-14TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE 2011), 2011,
  • [34] Pars network: A multistage interconnection network with fault-tolerance capability
    Bistouni, Fathollah
    Jahanshahi, Mohsen
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 75 : 168 - 183
  • [35] A scalable fault-tolerance framework for mobile intelligent agent systems
    Vuong, S
    Chen, J
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XV, PROCEEDINGS: MOBILE/WIRELESS COMPUTING AND COMMUNICATION SYSTEMS III, 2002, : 416 - 419
  • [36] ReBEC: A replacement-based energy-efficient fault-tolerance design for associative caches
    Gao, Xin
    Cui, Naiyuan
    Nian, Jiawei
    Liang, Zongnan
    Gao, Jiaxuan
    Liu, Hongjin
    Yang, Mengfei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 39 - 52
  • [37] A Novel NoC-Based Design for Fault-Tolerance of Last-Level Caches in CMPs
    BanaiyanMofrad, Abbas
    Dutt, Nikil
    Girao, Gustavo
    CODES+ISSS'12:PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE-CODESIGN AND SYSTEM SYNTHESIS, 2012, : 63 - 72
  • [38] Architecture-Based Reliability-Sensitive Criticality Measure for Fault-Tolerance Cloud Applications
    Wang, Lei
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (11) : 2408 - 2421
  • [39] Reliability and Fault-Tolerance Assessment of PMSM Drive for Electric Aircraft Applications
    Siadatan, Alireza
    Kalantarikhalilabad, Ali
    Rezaei-Zare, Afshin
    2023 IEEE INTERNATIONAL ELECTRIC MACHINES & DRIVES CONFERENCE, IEMDC, 2023,
  • [40] Toward a Smart Cloud: A Review of Fault-Tolerance Methods in Cloud Systems
    Mukwevho, Mukosi Abraham
    Celik, Turgay
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (02) : 589 - 605