An adaptive programming model for fault-tolerant distributed computing

被引:14
|
作者
Gorender, Sergio
Macedo, Raimundo Jose de Araujo
Raynal, Michel
机构
[1] Univ Fed Bahia, Dept Comp Sci, Distributed Syst Lab, BR-40170110 Salvador, BA, Brazil
[2] Univ Rennes 1, IRISA, F-35042 Rennes, France
关键词
adaptability; asynchronous/synchronous distributed system; consensus; distributed computing model; fault tolerance; quality of service;
D O I
10.1109/TDSC.2007.3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The capability of dynamically adapting to distinct runtime conditions is an important issue when designing distributed systems where negotiated quality of service (QoS) cannot always be delivered between processes. Providing fault tolerance for such dynamic environments is a challenging task. Considering such a context, this paper proposes an adaptive programming model for fault-tolerant distributed computing, which provides upper-layer applications with process state information according to the current system synchrony ( or QoS). The underlying system model is hybrid, composed by a synchronous part ( where there are time bounds on processing speed and message delay) and an asynchronous part ( where there is no time bound). However, such a composition can vary over time, and, in particular, the system may become totally asynchronous ( e. g., when the underlying system QoS degrade) or totally synchronous. Moreover, processes are not required to share the same view of the system synchrony at a given time. To illustrate what can be done in this programming model and how to use it, the consensus problem is taken as a benchmark problem. This paper also presents an implementation of the model that relies on a negotiated quality of service ( QoS) for communication channels.
引用
收藏
页码:18 / 31
页数:14
相关论文
共 50 条
  • [31] Fault-Tolerant Consensus of Multi-Agent System With Distributed Adaptive Protocol
    Chen, Shun
    Ho, Daniel W. C.
    Li, Lulu
    Liu, Ming
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2142 - 2155
  • [32] SUPPORTING FAULT-TOLERANT PARALLEL PROGRAMMING IN LINDA
    BAKKEN, DE
    SCHLICHTING, RD
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (03) : 287 - 302
  • [33] Efficient Fault-Tolerant Consensus for Collaborative Services in Edge Computing
    Jing, Guanlin
    Zou, Yifei
    Yu, Dongxiao
    Luo, Chuanwen
    Cheng, Xiuzhen
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (08) : 2139 - 2150
  • [34] CLOSURE AND CONVERGENCE - A FOUNDATION OF FAULT-TOLERANT COMPUTING
    ARORA, A
    GOUDA, M
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) : 1015 - 1027
  • [35] A public blockchain consensus mechanism for fault-tolerant distributed computing in LEO satellite communications
    Zhang, Zhen
    Guo, Bing
    Zhu, Lidong
    Shen, Yan
    Qin, Chaoxia
    Li, Chengjie
    CHINA COMMUNICATIONS, 2022, 19 (07) : 110 - 123
  • [36] Fault-Tolerant Dynamic Deduplication for Utility Computing
    Leesakul, Waraporn
    Townend, Paul
    Garraghan, Peter
    Xu, Jie
    2014 IEEE 17TH INTERNATIONAL SYMPOSIUM ON OBJECT/COMPONENT/SERVICE-ORIENTED REAL-TIME DISTRIBUTED COMPUTING (ISORC), 2014, : 397 - 404
  • [37] Fault-tolerant configuration of distributed discrete controllers
    Fujimoto, Y
    Sekiguchi, T
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2003, 50 (01) : 86 - 93
  • [38] Fault-Tolerant Multiplayer Tracking Control for Autonomous Vehicle via Model-Free Adaptive Dynamic Programming
    Pan, Huihui
    Zhang, Chi
    Sun, Weichao
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (04) : 1395 - 1406
  • [39] A Novel Fault-Tolerant Scheme for Distributed Systems
    Zhang, Xiaoqin
    Wei, Zhidong
    Zhang, Fenggui
    Liu, Guoliang
    CEIS 2011, 2011, 15
  • [40] DETECTING UNREALIZABILITY OF DISTRIBUTED FAULT-TOLERANT SYSTEMS
    Finkbeiner, Bernd
    Tentrup, Leander
    LOGICAL METHODS IN COMPUTER SCIENCE, 2015, 11 (03)