A causal message logging protocol with asynchronous checkpointing for distributed systems

被引:0
|
作者
Ahn, J [1 ]
Kim, K [1 ]
Hwang, C [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Seoul 136701, South Korea
关键词
distributed systems; fault-tolerance; asynchronous checkpointing; causal message logging; recovery;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Causal message logging is an efficient approach for tolerating failures of processes in distributed systems because it has the advantages of both pessimistic and optimistic message logging approach. However, traditional causal message logging protocols prevent live processes from executing continuously their computation and require some synchronous logging to the stable storage during recovery. Although Elnozahy protocol solves the problems, it has the central recovery leader problem. Additionally, if it were integrated with asynchronous checkpointing, it may result in inconsistency problems in case of concurrent failures. In this paper we present a new causal message logging protocol with asynchronous checkpointing to need to maintain only the latest checkpoint of each process and allow live processes to execute continuously their computation even in concurrent failures during recovery. Moreover the protocol solves the problems of Elnozahy protocol and improves asynchrony during recovery because the protocol enables each recovering process to be responsible for only its recovery.
引用
收藏
页码:523 / 528
页数:6
相关论文
共 50 条
  • [21] Scalable Sender-Based Message Logging Protocol with Little Communication Overhead for Distributed Systems
    Ahn, Jinho
    PARALLEL PROCESSING LETTERS, 2019, 29 (02)
  • [22] Soft-Checkpointing Based Hybrid Synchronous Checkpointing Protocol for Mobile Distributed Systems
    Kumar, Parveen
    Garg, Rachit
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2011, 2 (01) : 1 - 13
  • [23] Movement-Based Checkpointing and Message Logging for Recovery in MANETs
    Parmeet Kaur Jaggi
    Awadhesh Kumar Singh
    Wireless Personal Communications, 2015, 83 : 1971 - 1993
  • [24] A communication-induced checkpointing and asynchronous recovery algorithm for multithreaded distributed systems
    Tantikul, T
    Manivannan, D
    PARALLEL AND DISTRIBUTED COMPUTING: APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2004, 3320 : 284 - 292
  • [25] A communication-induced checkpointing and asynchronous recovery protocol for mobile computing systems
    Tantikul, T
    Manivannan, D
    PDCAT 2005: SIXTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2005, : 70 - 74
  • [26] An efficient algorithm for causal message logging
    Lee, B
    Park, T
    Yeom, HY
    Cho, Y
    SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 19 - 25
  • [27] Movement-Based Checkpointing and Message Logging for Recovery in MANETs
    Jaggi, Parmeet Kaur
    Singh, Awadhesh Kumar
    WIRELESS PERSONAL COMMUNICATIONS, 2015, 83 (03) : 1971 - 1993
  • [28] A causal multicast protocol for mobile distributed systems
    Chi, KH
    Yen, LH
    Tseng, CC
    Huang, TL
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (12) : 2065 - 2074
  • [29] A causal broadcast protocol for distributed mobile systems
    Ohori, Chikara
    Inoue, Michiko
    Masuzawa, Toshimitsu
    Fujiwara, Hideo
    2001, John Wiley and Sons Inc. (32)
  • [30] An efficient communication induced rollforward checkpointing and recovery protocol for distributed systems
    Gu, MM
    Zeng, L
    Liang, ZH
    Gupta, B
    COMPUTERS AND THEIR APPLICATIONS, 2000, : 298 - 302