Booting clock synchronization in partially synchronous systems with hybrid process and link failures

被引:17
作者
Widder, Josef
Schmid, Ulrich
机构
[1] Vienna Univ Technol, Embedded Comp Syst Grp, A-1040 Vienna, Austria
[2] Ecole Polytech, Lab Informat LIX, F-91128 Palaiseau, France
关键词
fault-tolerant distributed algorithms; initial clock synchronization; system start-up; hybrid failure models; link failures; partially synchronous systems;
D O I
10.1007/s00446-007-0026-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper provides description and analysis of a new clock synchronization algorithm for synchronous and partially synchronous systems with unknown upper and lower bounds on delays. It is purely message-driven, timer-free and relies on a hybrid failure model incorporating both process and link failures, in both time and value domain. Unlike existing solutions, our algorithm works during both system start-up and normal operation: Whereas bounded precision (the mutual deviation of any two clocks) can always be guaranteed, accuracy (clocks being within a linear envelope of real-time) and hence progress is only ensured when sufficiently many correct processes are eventually up and running. By means of a detailed analysis, we provide formulas for resilience, precision and envelope bounds.
引用
收藏
页码:115 / 140
页数:26
相关论文
共 58 条
[1]  
ANCEAUME E, 1998, RR3526 INRIA
[2]  
Attiya H., 2004, Distributed computing: fundamentals, simulations, and advanced topics, V19
[3]  
Azadmanesh MH, 2000, IEEE T COMPUT, V49, P1031, DOI 10.1109/12.888039
[4]  
BIELY M, 2001, 1831116 TU WEIN DEP
[5]  
Biely M, 2006, LECT NOTES COMPUT SC, V4280, P110
[6]   Unreliable failure detectors for reliable distributed systems [J].
Chandra, TD ;
Toueg, S .
JOURNAL OF THE ACM, 1996, 43 (02) :225-267
[7]   The weakest failure detector for solving Consensus [J].
Chandra, TD ;
Hadzilacos, V ;
Toueg, S .
JOURNAL OF THE ACM, 1996, 43 (04) :685-722
[8]  
Charron-Bost B., 2006, LSRREPORT2006004 EPF
[9]  
Charron-Bost B, 2006, 12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, P287
[10]   An efficient TDMA start-up and restart synchronization approach for distributed embedded systems [J].
Claesson, V ;
Lönn, H ;
Suri, N .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2004, 15 (08) :725-739