Design and implementation of a consistent time service for fault-tolerant distributed systems

被引:2
|
作者
Zhao, W [1 ]
Moser, LE [1 ]
Melliar-Smith, PM [1 ]
机构
[1] Eternal Syst Inc, Santa Barbara, CA 93111 USA
来源
2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS | 2003年
关键词
D O I
10.1109/DSN.2003.1209945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clock-related operations are one of the many sources of replica non-determinism and of replica inconsistency in fault-tolerant distributed systems. In passive replication, if the primary server crashes, the next clock value returned by the new primary server might have actually rolled back in time, which can lead to undesirable consequences for the replicated application. The same problem can happen for active replication where the result of the first replica to respond is taken as the next clock value. In this paper, we describe the design and implementation of a consistent time service for fault-tolerant distributed systems. The consistent time service introduces a group clock that is consistent across the replicas and that ensures the determinism of the replicas with respect to clock-related operations. The group clock is monotonically increasing, is transparent to the application and is fault-tolerant. The consistent time service guarantees the consistency of the group clock even when faults occur, when new replicas are added into the group and when failed replicas recover.
引用
收藏
页码:341 / 350
页数:10
相关论文
共 50 条
  • [32] Module replication for fault-tolerant real-time distributed systems
    Technical Univ of Crete, Crete, Greece
    IEEE Trans Reliab, 1 (8-18):
  • [33] Module replication for fault-tolerant real-time distributed systems
    Varvarigou, TA
    Trotter, J
    IEEE TRANSACTIONS ON RELIABILITY, 1998, 47 (01) : 8 - 18
  • [34] DISTRIBUTED FAULT-TOLERANT REAL-TIME SYSTEMS - THE MARS APPROACH
    KOPETZ, H
    DAMM, A
    KOZA, C
    MULAZZANI, M
    SCHWABL, W
    SENFT, C
    ZAINLINGER, R
    IEEE MICRO, 1989, 9 (01) : 25 - 40
  • [35] Units of computation in fault-tolerant distributed systems
    Ahuja, M
    Mishra, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 40 (02) : 194 - 209
  • [36] ON RELIABILITY MODELING OF FAULT-TOLERANT DISTRIBUTED SYSTEMS
    THAMBIDURAI, P
    PARK, YK
    TRIVEDI, KS
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 136 - 142
  • [37] A FAULT-TOLERANT COMMUNICATION ARCHITECTURE FOR DISTRIBUTED SYSTEMS
    PRADHAN, DK
    REDDY, SM
    IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (09) : 863 - 870
  • [38] A Novel Fault-Tolerant Scheme for Distributed Systems
    Zhang, Xiaoqin
    Wei, Zhidong
    Zhang, Fenggui
    Liu, Guoliang
    CEIS 2011, 2011, 15
  • [39] VLSI implementation of a fault-tolerant distributed clock generation
    Ferringer, M.
    Fuchs, G.
    Steininger, A.
    Kempf, G.
    21ST IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT-TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2006, : 563 - +
  • [40] Distributed Voting for Fault-Tolerant Nanoscale Systems
    Namazi, Ali
    Nourani, Mehrdad
    2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 569 - 574