Fault-containing self-stabilizing distributed protocols

被引:0
作者
Sukumar Ghosh
Arobinda Gupta
Ted Herman
Sriram V. Pemmaraju
机构
[1] The University of Iowa,Department of Computer Science
[2] Indian Institute of Technology,Department of Computer Science and Engineering
来源
Distributed Computing | 2007年 / 20卷
关键词
Distributed algorithms; Self-stabilization; Fault-containment; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
Self-stabilization is an elegant approach for designing a class of fault-tolerant distributed protocols. A self-stabilizing protocol is guaranteed to eventually converge to a legitimate state after a transient fault. However, even a minor transient fault can cause vast disruption in the system before legitimacy is reached. This paper introduces the notion of fault-containment to address this particular weakness of self-stabilizing systems. Informally, a fault-containing self-stabilizing protocol, in addition to providing self- stabilization, contains the effects of faults. This ensures that disruption during recovery from faults, is proportional to the extent of the faults. The paper begins with a formal framework for specifying and evaluating fault-containing self-stabilizing protocols. The main result of the paper is a transformer that converts any non-reactive self-stabilizing protocol into an equivalent fault-containing self-stabilizing protocol that can repair any single fault in the system in O(1) time. For a large class of input protocols, the corresponding output protocols produced by the transformer have O(1) space overhead. The small time and space overhead make the fault-containing self-stabilizing protocol a practical alternative to the original self-stabilizing protocol. The transformer is based on a novel stabilizing timer paradigm that significantly simplifies the task of fault-containment.
引用
收藏
页码:53 / 73
页数:20
相关论文
共 24 条
  • [1] Arora A.(1994)Distributed reset IEEE Trans. Comput. 43 1026-1038
  • [2] Gouda M.G.(1997)Component based design of multitolerance IEEE Trans. Softw. Eng. 43 1026-1038
  • [3] Arora A.(1985)Complexity of network synchronization J. ACM 32 804-823
  • [4] Kulkarni S.S.(1999)Self-stabilizing algorithms for finding centers and medians of trees SIAM J. Comput. 29 600-614
  • [5] Awerbuch B.(1991)A self-stabilizing algorithm for constructing spanning trees Informat. Process. Lett. 39 147-151
  • [6] Bruell S.C.(1974)Self stabilizing systems in spite of distributed control Communi. ACM 17 643-644
  • [7] Ghosh S.(1996)An exercise in fault-containment: Self-stabilizing leader election Informat. Process. Lett. 5 281-288
  • [8] Karaata M.H.(1996)A fault-containing self-stabilizing algorithm for spanning trees J. Comput. Informat. 2 322-338
  • [9] Pemmaraju S.V.(1997)A self-stabilizing algorithm for the maximum flow problem Distributed Comput. 10 167-180
  • [10] Chen N.S.(1999)Stabilizing time-adaptive protocols Theor. Comput. Sci. 220 93-111