Byzantine disk paxos: optimal resilience with byzantine shared memory

被引:46
作者
Abraham, I [1 ]
Chockler, G
Keidar, I
Malkhi, D
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91905 Jerusalem, Israel
[2] MIT, Lab Comp Sci & Artificial Intelligence, Cambridge, MA 02139 USA
[3] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
shared-memory emulations; T-tolerant object implementations; Byzantine failures; wait freedom; consensus; lower bounds;
D O I
10.1007/s00446-005-0151-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present Byzantine Disk Paxos, an asynchronous shared-memory consensus algorithm that uses a collection of n > 3t disks, t of which may fail by becoming non-responsive or arbitrarily corrupted. We give two constructions of this algorithm; that is, we construct two different t-tolerant (i.e., tolerating up to t disk failures) building blocks, each of which can be used, along with a leader oracle, to solve consensus. One building block is a t-tolerant wait-free shared safe register. The second building block is a t-tolerant regular register that satisfies a weaker termination (liveness) condition than wait freedom: its write operations are wait-free, whereas its read operations are guaranteed to return only in executions with a finite number of writes. We call this termination condition finite writes (FW), and show that wait-free consensus is solvable with FW-terminating registers and a leader oracle. We construct each of these t-tolerant registers from n > 3t base registers, t of which can be non-responsive or Byzantine. All the previous t-tolerant wait-free constructions in this model used at least 4t + I fault-prone registers, and we are not familiar with any prior FW-terminating constructions in this model. We further show tight lower bounds on the number of invocation rounds required for optimal resilience reliable register constructions, or more generally, constructions that use less than 4t + 1 fault-prone registers. Our lower bounds show that such constructions are inherently more costly than constructions that use 4t + 1 registers, and that our constructions have optimal round complexity. Furthermore, our wait-free construction is early-stopping, and it achieves the optimal round complexity with any number of actual failures.
引用
收藏
页码:387 / 408
页数:22
相关论文
共 31 条
  • [1] AFEK Y, 1993, LECT NOTES COMPUT SC, V725, P69
  • [2] Computing with faulty shared objects
    Afek, Y
    Greenberg, DS
    Merrit, M
    Taubenfeld, G
    [J]. JOURNAL OF THE ACM, 1995, 42 (06) : 1231 - 1274
  • [3] ATTIYA H, 2003, 22 S REL DISTR SYST
  • [4] Synchronous Byzantine quorum systems
    Bazzi, RA
    [J]. DISTRIBUTED COMPUTING, 2000, 13 (01) : 45 - 52
  • [5] Boichat R., 2003, SIGACT News, V34, P47, DOI 10.1145/637437.637447
  • [6] ASYNCHRONOUS CONSENSUS AND BROADCAST PROTOCOLS
    BRACHA, G
    TOUEG, S
    [J]. JOURNAL OF THE ACM, 1985, 32 (04) : 824 - 840
  • [7] The weakest failure detector for solving Consensus
    Chandra, TD
    Hadzilacos, V
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (04) : 685 - 722
  • [8] Chockler G, 2001, INT CON DISTR COMP S, P11, DOI 10.1109/ICDSC.2001.918928
  • [9] Chockler Gregory, 2002, P 21 ACM S PRINC DIS
  • [10] The timed asynchronous distributed system model
    Cristian, F
    Fetzer, C
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (06) : 642 - 657