HQ replication: A hybrid quorum protocol for byzantine fault tolerance

被引:0
作者
Cowling, James [1 ]
Myers, Daniel [1 ]
Liskov, Barbara [1 ]
Rodrigues, Rodrigo [2 ]
Shrira, Liuba [3 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] INESC ID, Inst Super Tecn, Lisbon, Portugal
[3] Brandeis Univ, Waltham, MA 02454 USA
来源
USENIX ASSOCIATION 7TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION | 2006年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There are currently two approaches to providing Byzantine-fault-tolerant state machine replication: a replica-based approach, e.g., BFT, that uses communication between replicas to agree on a proposed ordering of requests, and a quorum-based approach, such as Q/U, in which clients contact replicas directly to optimistically execute operations. Both approaches have shortcomings: the quadratic cost of inter-replica communication is unnecessary when there is no contention, and Q/U requires a large number of replicas and performs poorly under contention. We present HQ, a hybrid Byzantine-fault-tolerant state machine replication protocol that overcomes these problems. HQ employs a lightweight quorum-based protocol when there is no contention, but uses BFT to resolve contention when it arises. Furthermore, HQ uses only 3f + 1 replicas to tolerate f faults, providing optimal resilience to node failures. We implemented a prototype of HQ, and we compare its performance to BFT and Q/U analytically and experimentally. Additionally, in this work we use a new implementation of BFT designed to scale as the number of faults increases. Our results show that both HQ and our new implementation of BFT scale as f increases; additionally our hybrid approach of using BFT to handle contention works well.
引用
收藏
页码:177 / +
页数:2
相关论文
共 21 条
  • [1] Abd-El-Malek Michael, 2005, Proceedings of the twentieth ACM Symposium on Operating Systems Principles (SOSP), V39, P59, DOI DOI 10.1145/1095810.1095817
  • [2] Practical byzantine fault tolerance and proactive recovery
    Castro, M
    Liskov, B
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2002, 20 (04): : 398 - 461
  • [3] CHOCKLER G, 2001, P IEEE INT C DISTR C
  • [4] COWLING J, 2006, UNPUB HQ REPLICATION
  • [5] FRY C, 2004, P IEEE S REL DISTR S
  • [6] HERLIHY MP, 1987, C REC 14 ANN ACM S P
  • [7] KISTLER JJ, 1991, 13 ACM S OP SYST PRI, V25, P213
  • [8] IMPLEMENTATION OF RELIABLE DISTRIBUTED MULTIPROCESS SYSTEMS
    LAMPORT, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1978, 2 (02): : 95 - 114
  • [9] THE BYZANTINE GENERALS PROBLEM
    LAMPORT, L
    SHOSTAK, R
    PEASE, M
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1982, 4 (03): : 382 - 401
  • [10] LISKOV B, 2005, MITLCSTR994