Latency-Aware Leader Selection for Geo-Replicated Byzantine Fault-Tolerant Systems

被引：13

作者：

Eischer, Michael ^{[1
]}

Distler, Tobias ^{[1
]}

机构：

[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Erlangen, Germany

来源：

2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS (DSN-W) | 2018年

关键词：

D O I：

10.1109/DSN-W.2018.00053

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In a geo-replicated setting, the response time of a leader-based Byzantine fault-tolerant (BFT) protocol often differs significantly depending on which of the replicas in the system is currently acting as leader. Identifying a single optimal leader position in general is impossible due to workload characteristics usually varying over the course of the day. As a consequence, the approach used in many existing BFT replication protocols, which assign the leader role in a static manner and only change the leader in case of suspected or detect faulty behavior, results in unnecessarily high latency in wide-area environments. In this paper we address this problem with ARCHER, a latency-aware mechanism to select the leader of a geo-replicated BFT system based on end-to-end response times measured by clients. To prevent faulty replicas from gaining an unfair advantage by sending protocol messages early, ARCHER relies on a hash-chain-based approach that enables clients to detect if a protocol phase has been skipped. In addition, ARCHER offers means to tolerate incorrect latency values reported by faulty clients and can also be extended to solve other selection problems such as the placement of active and passive replicas in resource-efficient BFT systems.

引用

页码：140 / 145

页数：6

共 50 条

[21] Replicated servers for fault-tolerant real-time systems using transputers
Sinha, A
Das, PK
Basu, D
INFORMATION AND SOFTWARE TECHNOLOGY, 1996, 38 (10) : 633 - 645
[22] LOW-COST MANAGEMENT OF REPLICATED DATA IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
JOSEPH, TA
BIRMAN, KP
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1986, 4 (01): : 54 - 70
[23] Increasing Performance in Byzantine Fault-Tolerant Systems with On-Demand Replica Consistency
Distler, Tobias
Kapitza, Ruediger
EUROSYS 11: PROCEEDINGS OF THE EUROSYS 2011 CONFERENCE, 2011, : 91 - 105
[24] Byzantine Fault-tolerant State-machine Replication from a Systems Perspective
Distler, Tobias
ACM COMPUTING SURVEYS, 2021, 54 (01)
[25] Proactive service migration for long-running Byzantine fault-tolerant systems
Zhao, W.
Zhang, H.
IET SOFTWARE, 2009, 3 (02) : 154 - 164
[26] Rigorous design of fault-tolerant transactions for replicated database systems using event B
Yadav, Divakar
Butler, Michael
RIGOROUS DEVELOPMENT OF COMPLEX FAULT-TOLERANT SYSTEMS, 2006, 4157 : 343 - +
[27] Implementing network partition-aware fault-tolerant CORBA systems
Beyer, Stefan
Munoz-Escoi, Francesc D.
Galdamez, Pablo
ARES 2007: SECOND INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, PROCEEDINGS, 2007, : 69 - +
[28] A Novel Low-Latency Regional Fault-Aware Fault-Tolerant Routing Algorithm for Wireless NoC
Ouyang, Yiming
Wang, Qi
Ru, Mengxuan
Liang, Huaguo
Li, Jianhua
IEEE ACCESS, 2020, 8 : 22650 - 22663
[29] Cost-aware Capacity Provisioning for Fault-tolerant Geo-distributed Data Centers
Tripathi, Rakesh
Vignesh, S.
Tamarapalli, Venkatesh
2016 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2016,
[30] Fault-tolerant Consensus of Leader-follower Multi-agent Systems
Zhao, Dan
Wu, Yonghong
Yang, Chao
Ling, Guang
Shen, Yanjun
Guan, Zhi-Hong
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3288 - 3293

← 1 2 3 4 5 →