Optimal Message-Driven Implementations of Omega with Mute Processes

被引：7

作者：

Biely, Martin ^{[1
]}

Widder, Josef ^{[1
]}

机构：

[1] Vienna Univ Technol, Ecole Polytech, Vienna, Austria

来源：

ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS | 2009年 / 4卷 / 01期

基金：

奥地利科学基金会;

关键词：

Algorithms; Reliability; Theory; Fault tolerance; message-driven distributed algorithm; unreliable failure detectors; lower bound; IMPOSSIBILITY; CONSENSUS;

D O I：

10.1145/1462187.1462191

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We investigate the complexity of algorithms in message-driven models. In such models, events in the computation can only be caused by message receptions, but not by the passage of time. Hutle and Widder [2005a] have shown that there is no deterministic message-driven self-stabilizing implementation of the eventually strong failure detector and thus Omega in systems with uncertainty in message delays and channels of unknown capacity using only bounded space. Under stronger assumptions it was shown that even the eventually perfect failure detector can be implemented in message-driven systems consisting of at least f + 2 processes (f being the upper bound on the number of processes that crash during an execution). In this article we show that f + 2 is in fact a lower bound in message-driven systems, even if nonstabilizing algorithms are considered. This contrasts time-driven models where f + 1 is sufficient for failure detector implementations. Moreover, we investigate algorithms where not all processes send message, that is, are active, but some (in a predetermined set) remain passive. Here, we show that the f + 2 processes required for message-driven systems must be active, while in time-driven systems it suffices that f processes are active. We also provide message-driven implementations of Omega. Our algorithms are efficient in the sense that not all processes have to send messages forever, which is an improvement to previous message-driven failure detector implementations.

引用

页数：22

共 31 条

[1]

Aguilera M.K., 2004, Proceedings of the 23rd Annual ACM Symposium on Principles of Distributed Computing, P328, DOI DOI 10.1145/1011767.1011816

[2]

Aguilera M.K., 2003, Proceedings of the 22nd Annual Symposium on Principles of Distributed Computing, P306, DOI [10.1145/872035.872081, DOI 10.1145/872035.872081]

[3] Fault-tolerance and self-stabilization: impossibility results and solutions using self-stabilizing failure detectors [J].

Beauquier, J ;

KekkonenMoneta, S .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1997, 28 (11) :1177-1187

[4] Unreliable failure detectors for reliable distributed systems [J].

Chandra, TD ;

Toueg, S .

JOURNAL OF THE ACM, 1996, 43 (02) :225-267

[5] The weakest failure detector for solving Consensus [J].

Chandra, TD ;

Hadzilacos, V ;

Toueg, S .

JOURNAL OF THE ACM, 1996, 43 (04) :685-722

[6] ON THE MINIMAL SYNCHRONISM NEEDED FOR DISTRIBUTED CONSENSUS [J].

DOLEV, D ;

DWORK, C ;

STOCKMEYER, L .

JOURNAL OF THE ACM, 1987, 34 (01) :77-97

[7] CONSENSUS IN THE PRESENCE OF PARTIAL SYNCHRONY [J].

DWORK, C ;

LYNCH, N ;

STOCKMEYER, L .

JOURNAL OF THE ACM, 1988, 35 (02) :288-323

[8] A FORMAL APPROACH TO DESIGNING DELAY-INSENSITIVE CIRCUITS [J].

EBERGEN, JC .

DISTRIBUTED COMPUTING, 1991, 5 (03) :107-119

[9]

Einstein A, 1905, ANN PHYS-BERLIN, V17, P891

[10] On the possibility of consensus in asynchronous systems with finite average response times [J].

Fetzer, C ;

Schmid, U ;

Süsskraut, M .

25TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2005, :271-280

← 1 2 3 4 →