High performance fault-tolerance for clouds

被引：0

作者：

Kyriazis, Dimosthenis ^{[1
]}

Anagnostopoulos, Vasileios ^{[1
]}

Arcangeli, Andrea ^{[2
]}

Gilbert, David ^{[2
]}

Kalogeras, Dimitrios ^{[3
]}

Kat, Ronen ^{[4
]}

Klein, Cristian ^{[5
]}

Kokkinos, Panagiotis ^{[3
]}

Kuperman, Yossi ^{[4
]}

Nider, Joel ^{[4
]}

Svard, Petter ^{[5
]}

Tomas, Luis ^{[5
]}

Varvarigos, Emmanuel ^{[3
]}

Varvarigou, Theodora ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Iroon Polytech 9, Athens, Greece

[2] Red Hat Ltd, Cork, Ireland

[3] Patras Univ Campus, Comp Technol Inst & Press Diophantus, Rion, Greece

[4] IBM Haifa Res Lab, Haifa, Israel

[5] Umea Univ, SE-90187 Umea, Sweden

来源：

2015 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC) | 2015年

关键词：

cloud computing; fault-tolerance; high-performance; live-migration; resource consolidation;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Cloud computing and virtualized infrastructures are currently the baseline environments for the provision of services in different application domains. While the number of service consumers increasingly grows, service providers aim at exploiting infrastructures that enable non-disruptive service provisioning, thus minimizing or even eliminating downtime. Nonetheless, to achieve the latter current approaches are either application-specific or cost inefficient, requiring the use of dedicated hardware. In this paper we present the reference architecture of a fault-tolerance scheme, which not only enhances cloud environments with the aforementioned capabilities but also achieves high-performance as required by mission critical every day applications. To realize the proposed approach, a new paradigm for memory and I/O externalization and consolidation is introduced, while current implementation references are also provided.

引用

页码：251 / 257

页数：7

共 50 条

[1] Resilience for Collaborative Applications on Clouds Fault-Tolerance for Distributed HPC Applications
Toan Nguyen
Desideri, Jean-Antoine
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT IV, 2012, 7336 : 418 - 433
[2] A Two-Level Fault-Tolerance Technique for High Performance Computing Applications
Aseeri, Aishah M.
Fadel, Mai A.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (12) : 46 - 54
[3] Fault-Tolerance in the Scope of Cloud Computing
Rehman, A. U.
Aguiar, Rui L.
Barraca, Joao Paulo
IEEE ACCESS, 2022, 10 : 63422 - 63441
[4] Modeling and simulation of high redundancy actuator for fault-tolerance
Manohar, G. Arun
Vasu, V.
Srikanth, K.
MATERIALS TODAY-PROCEEDINGS, 2018, 5 (09) : 18867 - 18873
[5] High-performance and energy-efficient fault-tolerance core mapping in NoC
Beechu, Naresh Kumar Reddy
Harishchandra, Vasantha Moodabettu
Balachandra, Nithin Kumar Yernad
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2017, 16 : 1 - 10
[6] Simulation relations for fault-tolerance
Demasi, Ramiro
Castro, Pablo F.
Maibaum, Thomas S. E.
Aguirre, Nazareno
FORMAL ASPECTS OF COMPUTING, 2017, 29 (06) : 1013 - 1050
[7] A unified fault-tolerance protocol
Miner, P
Geser, A
Pike, L
Maddalon, J
FORMAL TECHNIQUES, MODELLING AND ANALYSIS OF TIMED AND FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 2004, 3253 : 167 - 182
[8] Fault simulation to validate fault-tolerance in Ada
Napier, J
Chen, LP
May, J
Hughes, G
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2000, 15 (01): : 61 - 67
[9] Fault-tolerance of a Laboratory Computer Cluster
Mollova, Stoyanka
Georgieva, Penka
Kostadinov, Atanas
2018 20TH INTERNATIONAL SYMPOSIUM ON ELECTRICAL APPARATUS AND TECHNOLOGIES (SIELA), 2018,
[10] The global fault-tolerance of interconnection networks
Harutyunyan, Hovhannes A.
Morosan, Calin D.
SNPD 2006: SEVENTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 171 - +

← 1 2 3 4 5 →