HQ replication: A hybrid quorum protocol for byzantine fault tolerance

被引：0

作者：

Cowling, James ^{[1
]}

Myers, Daniel ^{[1
]}

Liskov, Barbara ^{[1
]}

Rodrigues, Rodrigo ^{[2
]}

Shrira, Liuba ^{[3
]}

机构：

[1] MIT, CSAIL, Cambridge, MA 02139 USA

[2] INESC ID, Inst Super Tecn, Lisbon, Portugal

[3] Brandeis Univ, Waltham, MA 02454 USA

来源：

USENIX ASSOCIATION 7TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION | 2006年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

There are currently two approaches to providing Byzantine-fault-tolerant state machine replication: a replica-based approach, e.g., BFT, that uses communication between replicas to agree on a proposed ordering of requests, and a quorum-based approach, such as Q/U, in which clients contact replicas directly to optimistically execute operations. Both approaches have shortcomings: the quadratic cost of inter-replica communication is unnecessary when there is no contention, and Q/U requires a large number of replicas and performs poorly under contention. We present HQ, a hybrid Byzantine-fault-tolerant state machine replication protocol that overcomes these problems. HQ employs a lightweight quorum-based protocol when there is no contention, but uses BFT to resolve contention when it arises. Furthermore, HQ uses only 3f + 1 replicas to tolerate f faults, providing optimal resilience to node failures. We implemented a prototype of HQ, and we compare its performance to BFT and Q/U analytically and experimentally. Additionally, in this work we use a new implementation of BFT designed to scale as the number of faults increases. Our results show that both HQ and our new implementation of BFT scale as f increases; additionally our hybrid approach of using BFT to handle contention works well.

引用

页码：177 / +

页数：2

共 21 条

[1] Abd-El-Malek Michael, 2005, Proceedings of the twentieth ACM Symposium on Operating Systems Principles (SOSP), V39, P59, DOI DOI 10.1145/1095810.1095817
[2] Practical byzantine fault tolerance and proactive recovery
Castro, M
Liskov, B
[J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2002, 20 (04): : 398 - 461
[3] CHOCKLER G, 2001, P IEEE INT C DISTR C
[4] COWLING J, 2006, UNPUB HQ REPLICATION
[5] FRY C, 2004, P IEEE S REL DISTR S
[6] HERLIHY MP, 1987, C REC 14 ANN ACM S P
[7] KISTLER JJ, 1991, 13 ACM S OP SYST PRI, V25, P213
[8] IMPLEMENTATION OF RELIABLE DISTRIBUTED MULTIPROCESS SYSTEMS
LAMPORT, L
[J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1978, 2 (02): : 95 - 114
[9] THE BYZANTINE GENERALS PROBLEM
LAMPORT, L
SHOSTAK, R
PEASE, M
[J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1982, 4 (03): : 382 - 401
[10] LISKOV B, 2005, MITLCSTR994

← 1 2 3 →