Deterministic Fault-Tolerant Distributed Computing in Linear Time and Communication

被引:1
|
作者
Chlebus, Bogdan S. [1 ]
Kowalski, Dariusz R. [1 ]
Olkowski, Jan [2 ]
机构
[1] Augusta Univ, Augusta, GA 30912 USA
[2] Univ Maryland, Baltimore, MD 21201 USA
来源
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, PODC 2023 | 2023年
关键词
crash failure; Byzantine fault; authentication; consensus; check-pointing; gossiping; Ramanujan graph; lower bound; RANDOMIZED CONSENSUS; MESSAGE COMPLEXITY; BYZANTINE; AGREEMENT; ALGORITHMS; PROTOCOLS; BOUNDS;
D O I
10.1145/3583668.3594599
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We develop deterministic algorithms for the problems of consensus, gossiping and checkpointing with nodes prone to failing. Distributed systems are modeled as synchronous complete networks. Failures are represented either as crashes or Byzantine faults with authentication. The algorithmic goal is to have both linear running time and linear amount of communication for as large an upper bound t on the number of faults as possible, with respect to the number of nodes n. For crash failures, these bounds of optimality are t = O (n/log n) for consensus and t = O (n/log(2) n) for gossiping and checkpointing. For the model of Byzantine faults with authentication, we show how to reach consensus in both linear running time and communication for t = O(root n). We show how to implement the consensus algorithm for crash failures in the single-port model such as to preserve the range of t for which both the running time and communication are optimal. We prove a lower bound Omega(l +log n) on the running time of algorithms for each of the considered problems.
引用
收藏
页码:344 / 354
页数:11
相关论文
共 50 条
  • [21] Anonymous and fault-tolerant shared-memory computing
    Guerraoui, Rachid
    Ruppert, Eric
    DISTRIBUTED COMPUTING, 2007, 20 (03) : 165 - 177
  • [22] On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement
    Kumar, Manish
    Molla, Anisur Rahaman
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (04) : 1115 - 1127
  • [23] LINEAR-TIME ALGORITHMS FOR FAULT-TOLERANT ROUTING IN HYPERCUBES AND STAR GRAPHS
    GU, QP
    PENG, ST
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (09) : 1171 - 1177
  • [24] DMap: A fault-tolerant and scalable distributed data structure
    Benz, Samuel
    Pedone, Fernando
    2018 IEEE 37TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2018, : 153 - 160
  • [25] EFFICIENT CHECKPOINTING PROCEDURES FOR FAULT-TOLERANT DISTRIBUTED SYSTEMS
    SALEH, K
    AGARWAL, A
    MICROPROCESSING AND MICROPROGRAMMING, 1994, 40 (06): : 427 - 438
  • [26] Fault-tolerant computation of distributed regular path queries
    Shoaran, Maryam
    Thomo, Alex
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (01) : 62 - 77
  • [27] Distributed Adaptive Fault-Tolerant Formation Control for Heterogeneous Multiagent Systems With Communication Link Faults
    Gong, Jianye
    Jiang, Bin
    Ma, Yajie
    Mao, Zehui
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (02) : 784 - 795
  • [28] Fault-tolerant Consensus in Anonymous Dynamic Network
    Zhang, Qinzi
    Tseng, Lewis
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 128 - 138
  • [29] Distributed Fault-Tolerant Containment Control for Linear Heterogeneous Multiagent Systems: A Hierarchical Design Approach
    Xiao, Shuyi
    Dong, Jiuxiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (02) : 971 - 981
  • [30] Distributed Active Fault-Tolerant Cooperative Control for Multiagent Systems With Communication Delays and External Disturbances
    Zhong, Yujiang
    Lyu, Guangran
    He, Xiao
    Zhang, Youmin
    Ge, Shuzhi Sam
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (07) : 4642 - 4652