Latency-Aware Leader Selection for Geo-Replicated Byzantine Fault-Tolerant Systems

被引:13
|
作者
Eischer, Michael [1 ]
Distler, Tobias [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Erlangen, Germany
来源
2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS (DSN-W) | 2018年
关键词
D O I
10.1109/DSN-W.2018.00053
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In a geo-replicated setting, the response time of a leader-based Byzantine fault-tolerant (BFT) protocol often differs significantly depending on which of the replicas in the system is currently acting as leader. Identifying a single optimal leader position in general is impossible due to workload characteristics usually varying over the course of the day. As a consequence, the approach used in many existing BFT replication protocols, which assign the leader role in a static manner and only change the leader in case of suspected or detect faulty behavior, results in unnecessarily high latency in wide-area environments. In this paper we address this problem with ARCHER, a latency-aware mechanism to select the leader of a geo-replicated BFT system based on end-to-end response times measured by clients. To prevent faulty replicas from gaining an unfair advantage by sending protocol messages early, ARCHER relies on a hash-chain-based approach that enables clients to detect if a protocol phase has been skipped. In addition, ARCHER offers means to tolerate incorrect latency values reported by faulty clients and can also be extended to solve other selection problems such as the placement of active and passive replicas in resource-efficient BFT systems.
引用
收藏
页码:140 / 145
页数:6
相关论文
共 50 条
  • [21] Replicated servers for fault-tolerant real-time systems using transputers
    Sinha, A
    Das, PK
    Basu, D
    INFORMATION AND SOFTWARE TECHNOLOGY, 1996, 38 (10) : 633 - 645
  • [22] LOW-COST MANAGEMENT OF REPLICATED DATA IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    JOSEPH, TA
    BIRMAN, KP
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1986, 4 (01): : 54 - 70
  • [23] Increasing Performance in Byzantine Fault-Tolerant Systems with On-Demand Replica Consistency
    Distler, Tobias
    Kapitza, Ruediger
    EUROSYS 11: PROCEEDINGS OF THE EUROSYS 2011 CONFERENCE, 2011, : 91 - 105
  • [24] Byzantine Fault-tolerant State-machine Replication from a Systems Perspective
    Distler, Tobias
    ACM COMPUTING SURVEYS, 2021, 54 (01)
  • [25] Proactive service migration for long-running Byzantine fault-tolerant systems
    Zhao, W.
    Zhang, H.
    IET SOFTWARE, 2009, 3 (02) : 154 - 164
  • [26] Rigorous design of fault-tolerant transactions for replicated database systems using event B
    Yadav, Divakar
    Butler, Michael
    RIGOROUS DEVELOPMENT OF COMPLEX FAULT-TOLERANT SYSTEMS, 2006, 4157 : 343 - +
  • [27] Implementing network partition-aware fault-tolerant CORBA systems
    Beyer, Stefan
    Munoz-Escoi, Francesc D.
    Galdamez, Pablo
    ARES 2007: SECOND INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, PROCEEDINGS, 2007, : 69 - +
  • [28] A Novel Low-Latency Regional Fault-Aware Fault-Tolerant Routing Algorithm for Wireless NoC
    Ouyang, Yiming
    Wang, Qi
    Ru, Mengxuan
    Liang, Huaguo
    Li, Jianhua
    IEEE ACCESS, 2020, 8 : 22650 - 22663
  • [29] Cost-aware Capacity Provisioning for Fault-tolerant Geo-distributed Data Centers
    Tripathi, Rakesh
    Vignesh, S.
    Tamarapalli, Venkatesh
    2016 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2016,
  • [30] Fault-tolerant Consensus of Leader-follower Multi-agent Systems
    Zhao, Dan
    Wu, Yonghong
    Yang, Chao
    Ling, Guang
    Shen, Yanjun
    Guan, Zhi-Hong
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3288 - 3293