Efficient causal message logging protocol integrated with asynchronous checkpointing

被引:0
|
作者
Ahn, Jinho [1 ]
机构
[1] Kyonggi Univ, Dept Comp Sci, Suwon 443760, Gyeonggido, South Korea
关键词
distributed systems; message passing; fault-tolerance; asynchronous checkpointing; causal message logging; recovery;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Log-based rollback recovery is a well-known fault-tolerance technique to combine message logging with checkpointing. Among log-based recovery approaches, causal message logging has failure-free performance advantage of optimistic message logging while ensuring the always-no-orphans property in case of failures like pessimistic message logging. However, most previous causal message logging protocols may not progress surviving processes' execution while incurring a number of stable storage accesses during recovery. A previous protocol attempts to addresses these issues, but charaterizes centralized recovery behavior and may make the system's global state inconsistent when recovering concurrent process crashes. This paper proposes an efficient causal message logging protocol to enable surviving processes to progress their execution regardless of simultaneous process crashes and alleviate the limitation of the previous one by performing synchronous and distributed recovery. Also, the proposed protocol has each process keep only its latest checkpoint on the stable storage and perform globally consistent recovery in case of being integrated with asynchronous checkpointing because it forces each recovering process to obtain recovery information related to the process from the other recovering processes as well as all live processes.
引用
收藏
页码:300 / 305
页数:6
相关论文
共 50 条
  • [41] An efficient communication induced rollforward checkpointing and recovery protocol for distributed systems
    Gu, MM
    Zeng, L
    Liang, ZH
    Gupta, B
    COMPUTERS AND THEIR APPLICATIONS, 2000, : 298 - 302
  • [42] An efficient termination protocol for asynchronous iterative algorithms
    ElBaz, D
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS - PROCEEDINGS OF THE ISCA 9TH INTERNATIONAL CONFERENCE, VOLS I AND II, 1996, : 1 - 7
  • [43] Efficient reachability testing of asynchronous message-passing programs
    Lei, Y
    Tai, KC
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS, PROCEEDINGS, 2002, : 35 - 44
  • [44] A Communication-Efficient Causal Broadcast Protocol
    de Araujo, Joao Paulo
    Arantes, Luciana
    Duarte Junior, Elias P.
    Rodrigues, Luiz A.
    Sens, Pierre
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [45] MOCAVI: An Efficient Causal Protocol for Cellular Networks
    Lopez Dominguez, Eduardo
    Pomares Hernandez, Saul E.
    Rodriguez Gomez, Gustavo
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (01): : 136 - 144
  • [46] An efficient causal logging scheme for recoverable distributed shared memory systems
    Park, T
    Lee, I
    Yeom, HY
    PARALLEL COMPUTING, 2002, 28 (11) : 1549 - 1572
  • [47] An efficient message broadcasting MAC protocol for VANETs
    Zhiping Lin
    Yanglong Sun
    Yuliang Tang
    Zhaohui Liu
    Wireless Networks, 2020, 26 : 6043 - 6057
  • [48] Reasons for a Pessimistic or Optimistic Message Logging Protocol in MPI Uncoordinated Failure Recovery
    Bouteiller, Aurelien
    Ropars, Thomas
    Bosilca, George
    Morin, Christine
    Dongarra, Jack
    2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 336 - +
  • [49] Group Sender-based Message Logging Protocol for Conquering Simultaneous Failures
    Ahn, Jinho
    ADVANCES IN DIGITAL TECHNOLOGIES, 2015, 275 : 28 - 38
  • [50] Efficient logging algorithm for incremental replay of message-passing applications
    Zambonelli, Franco
    Netzer, Robert H.B.
    Proceedings of the International Parallel Processing Symposium, IPPS, 1999, : 392 - 398