High performance fault-tolerance for clouds

被引:0
|
作者
Kyriazis, Dimosthenis [1 ]
Anagnostopoulos, Vasileios [1 ]
Arcangeli, Andrea [2 ]
Gilbert, David [2 ]
Kalogeras, Dimitrios [3 ]
Kat, Ronen [4 ]
Klein, Cristian [5 ]
Kokkinos, Panagiotis [3 ]
Kuperman, Yossi [4 ]
Nider, Joel [4 ]
Svard, Petter [5 ]
Tomas, Luis [5 ]
Varvarigos, Emmanuel [3 ]
Varvarigou, Theodora [1 ]
机构
[1] Natl Tech Univ Athens, Iroon Polytech 9, Athens, Greece
[2] Red Hat Ltd, Cork, Ireland
[3] Patras Univ Campus, Comp Technol Inst & Press Diophantus, Rion, Greece
[4] IBM Haifa Res Lab, Haifa, Israel
[5] Umea Univ, SE-90187 Umea, Sweden
来源
2015 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC) | 2015年
关键词
cloud computing; fault-tolerance; high-performance; live-migration; resource consolidation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing and virtualized infrastructures are currently the baseline environments for the provision of services in different application domains. While the number of service consumers increasingly grows, service providers aim at exploiting infrastructures that enable non-disruptive service provisioning, thus minimizing or even eliminating downtime. Nonetheless, to achieve the latter current approaches are either application-specific or cost inefficient, requiring the use of dedicated hardware. In this paper we present the reference architecture of a fault-tolerance scheme, which not only enhances cloud environments with the aforementioned capabilities but also achieves high-performance as required by mission critical every day applications. To realize the proposed approach, a new paradigm for memory and I/O externalization and consolidation is introduced, while current implementation references are also provided.
引用
收藏
页码:251 / 257
页数:7
相关论文
共 50 条
  • [1] Resilience for Collaborative Applications on Clouds Fault-Tolerance for Distributed HPC Applications
    Toan Nguyen
    Desideri, Jean-Antoine
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT IV, 2012, 7336 : 418 - 433
  • [2] A Two-Level Fault-Tolerance Technique for High Performance Computing Applications
    Aseeri, Aishah M.
    Fadel, Mai A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (12) : 46 - 54
  • [3] Fault-Tolerance in the Scope of Cloud Computing
    Rehman, A. U.
    Aguiar, Rui L.
    Barraca, Joao Paulo
    IEEE ACCESS, 2022, 10 : 63422 - 63441
  • [4] Modeling and simulation of high redundancy actuator for fault-tolerance
    Manohar, G. Arun
    Vasu, V.
    Srikanth, K.
    MATERIALS TODAY-PROCEEDINGS, 2018, 5 (09) : 18867 - 18873
  • [5] High-performance and energy-efficient fault-tolerance core mapping in NoC
    Beechu, Naresh Kumar Reddy
    Harishchandra, Vasantha Moodabettu
    Balachandra, Nithin Kumar Yernad
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2017, 16 : 1 - 10
  • [6] Simulation relations for fault-tolerance
    Demasi, Ramiro
    Castro, Pablo F.
    Maibaum, Thomas S. E.
    Aguirre, Nazareno
    FORMAL ASPECTS OF COMPUTING, 2017, 29 (06) : 1013 - 1050
  • [7] A unified fault-tolerance protocol
    Miner, P
    Geser, A
    Pike, L
    Maddalon, J
    FORMAL TECHNIQUES, MODELLING AND ANALYSIS OF TIMED AND FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 2004, 3253 : 167 - 182
  • [8] Fault simulation to validate fault-tolerance in Ada
    Napier, J
    Chen, LP
    May, J
    Hughes, G
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2000, 15 (01): : 61 - 67
  • [9] Fault-tolerance of a Laboratory Computer Cluster
    Mollova, Stoyanka
    Georgieva, Penka
    Kostadinov, Atanas
    2018 20TH INTERNATIONAL SYMPOSIUM ON ELECTRICAL APPARATUS AND TECHNOLOGIES (SIELA), 2018,
  • [10] The global fault-tolerance of interconnection networks
    Harutyunyan, Hovhannes A.
    Morosan, Calin D.
    SNPD 2006: SEVENTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 171 - +