Optimal Message-Driven Implementations of Omega with Mute Processes

被引:7
作者
Biely, Martin [1 ]
Widder, Josef [1 ]
机构
[1] Vienna Univ Technol, Ecole Polytech, Vienna, Austria
基金
奥地利科学基金会;
关键词
Algorithms; Reliability; Theory; Fault tolerance; message-driven distributed algorithm; unreliable failure detectors; lower bound; IMPOSSIBILITY; CONSENSUS;
D O I
10.1145/1462187.1462191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the complexity of algorithms in message-driven models. In such models, events in the computation can only be caused by message receptions, but not by the passage of time. Hutle and Widder [2005a] have shown that there is no deterministic message-driven self-stabilizing implementation of the eventually strong failure detector and thus Omega in systems with uncertainty in message delays and channels of unknown capacity using only bounded space. Under stronger assumptions it was shown that even the eventually perfect failure detector can be implemented in message-driven systems consisting of at least f + 2 processes (f being the upper bound on the number of processes that crash during an execution). In this article we show that f + 2 is in fact a lower bound in message-driven systems, even if nonstabilizing algorithms are considered. This contrasts time-driven models where f + 1 is sufficient for failure detector implementations. Moreover, we investigate algorithms where not all processes send message, that is, are active, but some (in a predetermined set) remain passive. Here, we show that the f + 2 processes required for message-driven systems must be active, while in time-driven systems it suffices that f processes are active. We also provide message-driven implementations of Omega. Our algorithms are efficient in the sense that not all processes have to send messages forever, which is an improvement to previous message-driven failure detector implementations.
引用
收藏
页数:22
相关论文
共 31 条
  • [1] Aguilera M.K., 2004, Proceedings of the 23rd Annual ACM Symposium on Principles of Distributed Computing, P328, DOI DOI 10.1145/1011767.1011816
  • [2] Aguilera M.K., 2003, Proceedings of the 22nd Annual Symposium on Principles of Distributed Computing, P306, DOI [10.1145/872035.872081, DOI 10.1145/872035.872081]
  • [3] Fault-tolerance and self-stabilization: impossibility results and solutions using self-stabilizing failure detectors
    Beauquier, J
    KekkonenMoneta, S
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1997, 28 (11) : 1177 - 1187
  • [4] Unreliable failure detectors for reliable distributed systems
    Chandra, TD
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (02) : 225 - 267
  • [5] The weakest failure detector for solving Consensus
    Chandra, TD
    Hadzilacos, V
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (04) : 685 - 722
  • [6] ON THE MINIMAL SYNCHRONISM NEEDED FOR DISTRIBUTED CONSENSUS
    DOLEV, D
    DWORK, C
    STOCKMEYER, L
    [J]. JOURNAL OF THE ACM, 1987, 34 (01) : 77 - 97
  • [7] CONSENSUS IN THE PRESENCE OF PARTIAL SYNCHRONY
    DWORK, C
    LYNCH, N
    STOCKMEYER, L
    [J]. JOURNAL OF THE ACM, 1988, 35 (02) : 288 - 323
  • [8] A FORMAL APPROACH TO DESIGNING DELAY-INSENSITIVE CIRCUITS
    EBERGEN, JC
    [J]. DISTRIBUTED COMPUTING, 1991, 5 (03) : 107 - 119
  • [9] Einstein A, 1905, ANN PHYS-BERLIN, V17, P891
  • [10] On the possibility of consensus in asynchronous systems with finite average response times
    Fetzer, C
    Schmid, U
    Süsskraut, M
    [J]. 25TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2005, : 271 - 280