Non-blocking atomic commitment in distributed systems: A tutorial based on a generic protocol

被引:0
作者
Raynal, M [1 ]
机构
[1] Inst Rech Informat & Syst Aleatoires, F-35042 Rennes, France
来源
COMPUTER SYSTEMS SCIENCE AND ENGINEERING | 2000年 / 15卷 / 02期
关键词
atomic commitment; consensus problem; distributed systems; failure detection; fault-tolerant protocols; non-blocking protocols; reliable multicast; transaction;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Agreement problems allow a set of processes to agree on a common output value. These problems are of primary importance in distributed systems and difficult to solve in presence of failures. This paper is a tutorial on one of these problems whose practical interest is well known, namely the Non-Blocking Atomic Commitment Problem This tutorial is based on an original generic protocol that solves the problem at an abstract level. Instantiations of its generic statements are given for both synchronous and asynchronous distributed systems. It is shown that the main problem to ensure the "non-blocking" property lies in failures detection. While failures can be safely detected in synchronous distributed systems, they can only be "suspected" (maybe falsely) in asynchronous distributed systems. So, this tutorial also visits problems such as reliable communication, failures detection and consensus whose solutions are used when instantiating the generic protocol in a particular context. Seen that way, this paper can be seen as a state-of-the-art introduction to concepts, problems and protocols related to fault-tolerance in distributed systems.
引用
收藏
页码:77 / 86
页数:10
相关论文
共 18 条
  • [1] Non-Blocking Atomic Commitment with Byzantine Faults
    Luiz, Aldelir Fernando
    Lung, Lau Cheuk
    Correia, Miguel
    Stumm Junior, Valdir
    2014 BRAZILIAN SYMPOSIUM ON COMPUTER NETWORKS AND DISTRIBUTED SYSTEMS (SBRC), 2014, : 156 - 164
  • [2] FNB: Fast Non-Blocking Coordinated Checkpointing Protocol for Distributed Systems
    Zohra Abdelhafidi
    Mohamed Djoudi
    Nasreddine Lagraa
    Mohamed Bachir Yagoubi
    Theory of Computing Systems, 2015, 57 : 397 - 425
  • [3] Non-blocking atomic commit in asynchronous distributed systems with failure detectors
    Guerraoui, R
    DISTRIBUTED COMPUTING, 2002, 15 (01) : 17 - 25
  • [4] FNB: Fast Non-Blocking Coordinated Checkpointing Protocol for Distributed Systems
    Abdelhafidi, Zohra
    Djoudi, Mohamed
    Lagraa, Nasreddine
    Yagoubi, Mohamed Bachir
    THEORY OF COMPUTING SYSTEMS, 2015, 57 (02) : 397 - 425
  • [5] A non-blocking Checkpointing algorithm for distributed systems
    Guoliang L.
    Shuyu C.
    Xiaoqin Z.
    International Journal of Digital Content Technology and its Applications, 2011, 5 (07) : 230 - 238
  • [6] A new non-blocking synchronous checkpointing scheme for distributed systems
    Gupta, B
    Rahimi, S
    Naskar, P
    Proceedings of the ISCA 20th International Conference on Computers and Their Applications, 2005, : 26 - 31
  • [7] Software Model Checking for Distributed Systems with Selector-Based, Non-blocking Communication
    Artho, Cyrille
    Hagiya, Masami
    Potter, Richard
    Tanabe, Yoshinori
    Weitl, Franz
    Yamamoto, Mitsuharu
    2013 28TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2013, : 169 - 179
  • [8] A Distributed Counter-based Non-blocking Coordinated Checkpoint Algorithm for Grid Computing Applications
    El-Sayed, Gamal A.
    Hossny, Khadra A.
    2012 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS (ACTEA), 2012, : 80 - 85
  • [9] Non-Blocking One-Phase Commit Made Possible for Distributed Transactions over Replicated Data
    Zhu, Yuqing
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2874 - 2876
  • [10] An atomic commit protocol for gigabit-networked distributed database systems
    Al-Houmaily, YJ
    Chrysanthis, PK
    JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (09) : 809 - 833