Deterministic Fault-Tolerant Distributed Computing in Linear Time and Communication

被引:1
|
作者
Chlebus, Bogdan S. [1 ]
Kowalski, Dariusz R. [1 ]
Olkowski, Jan [2 ]
机构
[1] Augusta Univ, Augusta, GA 30912 USA
[2] Univ Maryland, Baltimore, MD 21201 USA
关键词
crash failure; Byzantine fault; authentication; consensus; check-pointing; gossiping; Ramanujan graph; lower bound; RANDOMIZED CONSENSUS; MESSAGE COMPLEXITY; BYZANTINE; AGREEMENT; ALGORITHMS; PROTOCOLS; BOUNDS;
D O I
10.1145/3583668.3594599
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We develop deterministic algorithms for the problems of consensus, gossiping and checkpointing with nodes prone to failing. Distributed systems are modeled as synchronous complete networks. Failures are represented either as crashes or Byzantine faults with authentication. The algorithmic goal is to have both linear running time and linear amount of communication for as large an upper bound t on the number of faults as possible, with respect to the number of nodes n. For crash failures, these bounds of optimality are t = O (n/log n) for consensus and t = O (n/log(2) n) for gossiping and checkpointing. For the model of Byzantine faults with authentication, we show how to reach consensus in both linear running time and communication for t = O(root n). We show how to implement the consensus algorithm for crash failures in the single-port model such as to preserve the range of t for which both the running time and communication are optimal. We prove a lower bound Omega(l +log n) on the running time of algorithms for each of the considered problems.
引用
收藏
页码:344 / 354
页数:11
相关论文
共 50 条
  • [11] COMMUNICATION STRUCTURES IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    PRADHAN, DK
    MEYER, FJ
    NETWORKS, 1993, 23 (04) : 379 - 389
  • [13] A hybrid and adaptive model for fault-tolerant distributed computing
    Gorender, S
    Macêdo, R
    Raynal, M
    2005 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2005, : 412 - 421
  • [14] Active fault-tolerant system for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Yoshida, Takaichi
    AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2006, 4158 : 581 - 590
  • [15] Fundamentals of fault-tolerant distributed computing in asynchronous environments
    Gärtner, FC
    ACM COMPUTING SURVEYS, 1999, 31 (01) : 1 - 26
  • [16] Fault-tolerant distributed mass storage for LHC computing
    Wiebalck, A
    Breuer, PT
    Lindenstruth, V
    Steinbeck, TM
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 266 - 273
  • [17] An adaptive programming model for fault-tolerant distributed computing
    Gorender, Sergio
    Macedo, Raimundo Jose de Araujo
    Raynal, Michel
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (01) : 18 - 31
  • [18] A dynamic fault-tolerant model for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Najima, Horoki
    Yoshida, Takaichi
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 25 - +
  • [19] FAULT-TOLERANT COMPUTING
    TOY, WN
    ADVANCES IN COMPUTERS, 1987, 26 : 201 - 279
  • [20] FAULT-TOLERANT COMPUTING
    PRADHAN, DK
    COMPUTER, 1980, 13 (03) : 6 - 7