Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS

被引:9
|
作者
Lauer, Michael [2 ]
Amy, Matthieu [3 ]
Fabre, Jean-Charles [3 ]
Roy, Matthieu [1 ]
Excoffon, William [3 ]
Stoicescu, Miruna [4 ]
机构
[1] CNRS, LAAS, Ave Colonel Roche, F-31400 Toulouse, France
[2] Univ Toulouse, UPS, LAAS, F-31400 Toulouse, France
[3] Univ Toulouse, INP, LAAS, F-31400 Toulouse, France
[4] ESA, ESOC, Darmstadt, Germany
来源
2016 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE) | 2016年
关键词
D O I
10.1109/HASE.2016.30
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Systems are expected to evolve during their service life in order to cope with changes of various natures, ranging from fluctuations in available resources to additional features requested by users. For dependable embedded systems, the challenge is even greater, as evolution must not impair dependability attributes. Resilient computing implies maintaining dependability properties when facing changes. Resilience encompasses several aspects, among which evolvability, i.e., the capacity of a system to evolve during its service life. In this paper, we discuss the evolution of systems with respect to their dependability mechanisms, and show how such mechanisms can evolve accordingly. From a component-based approach that enables to clarify the concepts, the process and the techniques to be used to address resilient computing, in particular regarding the adaptation of fault tolerance (or safety) mechanisms, we show how Adaptive Fault Tolerance (AFT) can be implemented with ROS. Beyond implementation, we draw the lessons learned from this work and discuss the limits of this runtime support to implement such resilient computing features in embedded systems.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 50 条
  • [1] Resilient computing on ROS using adaptive fault tolerance
    Lauer, Michael
    Amy, Matthieu
    Fabre, Jean-Charles
    Roy, Matthieu
    Excoffon, William
    Stoicescu, Miruna
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (03)
  • [2] Multicenter Hierarchical Federated Learning With Fault-Tolerance Mechanisms for Resilient Edge Computing Networks
    Chen, Xiaohong
    Xu, Guanying
    Xu, Xuesong
    Jiang, Haichong
    Tian, Zhiping
    Ma, Tao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 47 - 61
  • [3] Analysis of Adaptive Fault Tolerance for Resilient Computing
    Excoffon, William
    Fabre, Jean-Charles
    Lauer, Michael
    2017 13TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2017), 2017, : 50 - 57
  • [4] Local decisions and triggering mechanisms for adaptive fault-tolerance
    Stanley-Marbell, P
    Marculescu, D
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2004, : 968 - 973
  • [5] A Fault-Tolerance Shim for Serverless Computing
    Sreekanti, Vikram
    Wu, Chenggang
    Chhatrapati, Saurav
    Gonzalez, Joseph E.
    Hellerstein, Joseph M.
    Faleiro, Jose M.
    PROCEEDINGS OF THE FIFTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS'20), 2020,
  • [6] Fault-Tolerance in the Scope of Cloud Computing
    Rehman, A. U.
    Aguiar, Rui L.
    Barraca, Joao Paulo
    IEEE ACCESS, 2022, 10 : 63422 - 63441
  • [7] <bold>Exploit Failure Prediction for Adaptive Fault-Tolerance in Cluster </bold>Computing
    Li, Yawei
    Lan, Zhiling
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID: SPANNING THE WORLD AND BEYOND, 2006, : 531 - +
  • [8] Adaptive fault-tolerance QoS for whiteboard errors based on RCSM for ubiquitous computing
    Ko, Eung Nam
    Kim, SoonGohn
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 571 - +
  • [9] A new fault-tolerance framework for grid computing
    Derbal, Youcef
    MULTIAGENT AND GRID SYSTEMS, 2006, 2 (02) : 115 - 133
  • [10] METHODS AND MODELS FOR COMPUTING SURVIVABILITY AND FAULT-TOLERANCE OF A NETWORK
    GAGIN, AA
    MICROELECTRONICS AND RELIABILITY, 1993, 33 (10): : 1533 - 1552