High-availability clusters: A taxonomy, survey, and future directions

被引:8
|
作者
Somasekaram, Premathas [1 ]
Calinescu, Radu [1 ]
Buyya, Rajkumar [2 ]
机构
[1] Univ York, Dept Comp Sci, Deramore Lane, York YO10 5GH, N Yorkshire, England
[2] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic, Australia
关键词
Clustering; Dependability; Enterprise system; High availability; High availability clusters; Reliability; CLOUD; REPLICATION; ARCHITECTURE; SYSTEMS;
D O I
10.1016/j.jss.2021.111208
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The delivery of key services in domains ranging from finance and manufacturing to healthcare and transportation is underpinned by a rapidly growing number of mission-critical enterprise applications. Ensuring the continuity of these complex applications requires the use of software-managed infras-tructures called high-availability clusters (HACs). HACs employ sophisticated techniques to monitor the health of key enterprise application layers and of the resources they use, and to seamlessly restart or relocate application components after failures. In this paper, we first describe the manifold uses of HACs to protect essential layers of a critical application and present the architecture of high availability clusters. We then propose a taxonomy that covers all key aspects of HACs-deployment patterns, application areas, types of cluster, topology, cluster management, failure detection and recovery, consistency and integrity, and data synchronisation; and we use this taxonomy to provide a comprehensive survey of the end-to-end software solutions available for the HAC deployment of enterprise applications. Finally, we discuss the limitations and challenges of existing HAC solutions, and we identify opportunities for future research in the area. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:32
相关论文
共 50 条
  • [21] A systematic survey on fault-tolerant solutions for distributed data analytics: Taxonomy, comparison, and future directions
    Isukapalli, Sucharitha
    Srirama, Satish Narayana
    COMPUTER SCIENCE REVIEW, 2024, 53
  • [22] A High-availability Urban Rail Cloud Platform Based on OpenStack: Design, Implementation and Availability Analysis
    Zhu L.
    Li Z.
    Tang T.
    Wang X.
    Tiedao Xuebao/Journal of the China Railway Society, 2024, 46 (02): : 94 - 104
  • [23] Edge AI: A Taxonomy, Systematic Review and Future Directions
    Gill, Sukhpal Singh
    Golec, Muhammed
    Hu, Jianmin
    Xu, Minxian
    Du, Junhui
    Wu, Huaming
    Walia, Guneet Kaur
    Murugesan, Subramaniam Subramanian
    Ali, Babar
    Kumar, Mohit
    Ye, Kejiang
    Verma, Prabal
    Kumar, Surendra
    Cuadrado, Felix
    Uhlig, Steve
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (01):
  • [24] High-Availability Computing Platform with Sensor Fault Resilience
    Lee, Yen-Lin
    Arizky, Shinta Nuraisya
    Chen, Yu-Ren
    Liang, Deron
    Wang, Wei-Jen
    SENSORS, 2021, 21 (02) : 1 - 16
  • [25] A comprehensive survey of DDoS defense solutions in SDN: Taxonomy, research challenges, and future directions
    Kaur, Sukhveer
    Kumar, Krishan
    Aggarwal, Naveen
    Singh, Gurdeep
    COMPUTERS & SECURITY, 2021, 110 (110)
  • [26] A comprehensive survey on the reliability of mobile wireless sensor networks: Taxonomy, challenges, and future directions
    Yue, Ying-Gao
    He, Ping
    INFORMATION FUSION, 2018, 44 : 188 - 204
  • [27] Efficient and Adaptive Stateful Replication for Stream Processing Engines in High-Availability Cluster
    Feng, Yi-Hsuan
    Huang, Nen-Fu
    Wu, Yen-Min
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (11) : 1788 - 1796
  • [28] Construction of High-Availability Bank System in Virtualized Environments
    Neng, Huang
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 561 - 568
  • [29] Towards an Environment Supporting Resilience, High-Availability, Reproducibility and Reliability for Cloud Applications
    Stankovski, Vlado
    Taherizadeh, Salman
    Taylor, Ian
    Jones, Andrew
    Mastroianni, Carlo
    Becker, Bruce
    Suhartanto, Heru
    2015 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2015, : 383 - 386
  • [30] Sustaining Web services high-availability using communities
    Maamar, Zakaria
    Sheng, Quan Z.
    Bensliman, Djamal
    ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 834 - +