Asynchronous implementation of failure detectors

被引:55
|
作者
Mostefaoui, A [1 ]
Mourgaya, E [1 ]
Raynal, M [1 ]
机构
[1] Univ Rennes 1, IRISA, F-35042 Rennes, France
来源
2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS | 2003年
关键词
D O I
10.1109/DSN.2003.1209946
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable failure detectors introduced by Chandra and Toueg are abstract mechanisms that provide information on process failures. On the one hand, failure detectors allow to state the minimal requirements on process failures that allow to solve problems that cannot be solved in purely asynchronous systems. But, on the other hand, they cannot be implemented in such. systems: their implementation requires that the underlying distributed system be enriched with additional assumptions. The usual failure detector implementations rely on additional synchrony assumptions (e.g., partial synchrony). This paper proposes a new look at the implementation of failure detectors and more specifically at Chandra-Toueg's failure detectors. The proposed approach does not rely on synchrony assumptions (e.g., it allows the communication delays to always increase). It is based on a query-response mechanism and assumes that the query/response messages exchanged obey a pattern where the responses from some processes to a query arrive among the (n - f) first ones (n being the total number of processes, f the maximum number of them that can crash, with 1 less than or equal to f < n). When we consider the particular case f = 1, and the implementation of a failure detector of the class denoted lozengeS (the weakest class that allows to solve the consensus problem), the additional assumption the underlying system has to satisfy boils down to a simple channel property, namely, there is eventually a pair of processes (p(i), p(j)) such that the channel connecting them is never the slowest among the channels connecting p(i) or p(j) to the other processes. A probabilistic analysis shows that this requirement is practically met in asynchronous distributed systems.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [1] Implementable failure detectors in asynchronous systems
    Garg, VK
    Mitchell, JR
    FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, 1998, 1530 : 158 - 169
  • [2] Asynchronous bounded lifetime failure detectors
    Friedman, R
    Mostefaoui, A
    Raynal, M
    INFORMATION PROCESSING LETTERS, 2005, 94 (02) : 85 - 91
  • [3] Anonymous Asynchronous Systems: The Case of Failure Detectors
    Bonnet, Francois
    Raynal, Michel
    DISTRIBUTED COMPUTING, 2010, 6343 : 206 - 220
  • [4] Anonymous asynchronous systems: the case of failure detectors
    François Bonnet
    Michel Raynal
    Distributed Computing, 2013, 26 : 141 - 158
  • [5] Mutual exclusion in asynchronous systems with failure detectors
    Delporte-Gallet, C
    Fauconnier, H
    Guerraoui, R
    Kouznetsov, P
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2005, 65 (04) : 492 - 505
  • [6] Anonymous asynchronous systems: the case of failure detectors
    Bonnet, Francois
    Raynal, Michel
    DISTRIBUTED COMPUTING, 2013, 26 (03) : 141 - 158
  • [7] Muteness failure detectors: Specification and implementation
    Doudou, A
    Garbinato, B
    Guerraoui, R
    Schiper, A
    DEPENDABLE COMPUTING - EDCC-3, 1999, 1667 : 71 - 87
  • [8] Implementation aspects of linear multiuser detectors in asynchronous CDMA systems
    Juntti, MJ
    Lilleberg, JO
    IEEE ISSSTA '96 - IEEE FOURTH INTERNATIONAL SYMPOSIUM ON SPREAD SPECTRUM TECHNIQUES & APPLICATIONS, PROCEEDINGS, VOLS 1-3, 1996, : 842 - 846
  • [9] Leader election in asynchronous distributed systems with unreliable failure detectors
    Park, SH
    Yamashita, M
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 687 - 693
  • [10] On the implementation of communication-optimal failure detectors
    Larrea, Mikel
    Lafuente, Alberto
    Soraluze, Iratxe
    Cortinas, Roberto
    Wieland, Joachim
    DEPENDABLE COMPUTING, PROCEEDINGS, 2007, 4746 : 25 - +