Consensus-based fault-tolerant total order multicast

被引:5
作者
Fritzke, U [1 ]
Ingels, P [1 ]
Mostefaoui, A [1 ]
Raynal, M [1 ]
机构
[1] Inst Rech Informat & Syst Aleatoires, F-35042 Rennes, France
关键词
asynchronous systems; consensus; groups; reliable multicast; total order; group multicast;
D O I
10.1109/71.910870
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
While Total Order Broadcast (or Atomic Broadcast) primitives have received a lot of attention, this paper concentrates on Total Order Multicast to Multiple Groups in the context of asynchronous distributed systems in which processes may suffer crash failures. "Multicast to Multiple Groups" means that each message is sent to a subset of the process groups composing the system, distinct messages possibly having distinct destination groups. "Total Order" means that all message deliveries must be totally ordered. This paper investigates a consensus-based approach to solve this problem and proposes a corresponding protocol to implement this multicast primitive. This protocol is based on two underlying building blocks, namely, Uniform Reliable Multicast and Uniform Consensus. Its design characteristics lie in the two following properties: The first one is a Minimality property, more precisely, only the sender of a message and processes of its destination groups have to participate in the total order multicast of the message. The second property is a Locality property: No execution of a consensus has to involve processes belonging to distinct groups (i.e., consensus is executed on a "per group" basis). This Locality property is particularly useful when one is interested in using the Total Order Multicast primitive in large-scale distributed systems. In addition to a correctness proof, an improvement that reduces the cost of the protocol is also suggested.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 23 条
  • [1] THE TOTEM SINGLE-RING ORDERING AND MEMBERSHIP PROTOCOL
    AMIR, Y
    MOSER, LE
    MELLIARSMITH, PM
    AGARWAL, DA
    CIARFELLA, P
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1995, 13 (04): : 311 - 342
  • [2] [Anonymous], P 23 INT S FAULT TOL
  • [3] ARG VK, 1999, PARALLEL PROCESSING
  • [4] BIRMAN K, 1991, ACM T COMPUT SYST, V9, P272, DOI 10.1145/128738.128742
  • [5] RELIABLE COMMUNICATION IN THE PRESENCE OF FAILURES
    BIRMAN, KP
    JOSEPH, TA
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1987, 5 (01): : 47 - 76
  • [6] Unreliable failure detectors for reliable distributed systems
    Chandra, TD
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (02) : 225 - 267
  • [7] The weakest failure detector for solving Consensus
    Chandra, TD
    Hadzilacos, V
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (04) : 685 - 722
  • [8] Multipoint communication: A survey of protocols, functions, and mechanisms
    Diot, C
    Dabbous, W
    Crowcroft, J
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1997, 15 (03) : 277 - 290
  • [9] IMPOSSIBILITY OF DISTRIBUTED CONSENSUS WITH ONE FAULTY PROCESS
    FISCHER, MJ
    LYNCH, NA
    PATERSON, MS
    [J]. JOURNAL OF THE ACM, 1985, 32 (02) : 374 - 382
  • [10] FRITZKE U, THESIS U RENNES