Fault tolerance in big data storage and processing systems: A review on challenges and solutions

被引:16
|
作者
Saadoon, Muntadher [1 ]
Ab Hamid, Siti Hafizah [1 ]
Sofian, Hazrina [1 ]
Altarturi, Hamza H. M. [1 ]
Azizul, Zati Hakim [1 ]
Nasuha, Nur [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Software Engn, Kuala Lumpur 50603, Malaysia
关键词
Fault tolerance; Fault detection; Fault recovery; Big data storage; Big data processing; FAILURE RECOVERY; MAPREDUCE; RELIABILITY; AVAILABILITY; REPLICATION; NETWORKS; HADOOP;
D O I
10.1016/j.asej.2021.06.024
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Big data systems are sufficiently stable to store and process a massive volume of rapidly changing data. However, big data systems are composed of large-scale hardware resources that make their subspecies easily fail. Fault tolerance is the main property of such systems because it maintains availability, reliability, and constant performance during faults. Achieving an efficient fault tolerance solution in a big data system is challenging because fault tolerance must meet some constraints related to the system performance and resource consumption. This study aims to provide a consistent understanding of fault tolerance in big data systems and highlights common challenges that hinder the improvement in fault tolerance efficiency. The fault tolerance solutions applied by previous studies intended to address the identified challenges are reviewed. The paper also presents a perceptive discussion of the findings derived from previous studies and proposes a list of future directions to address the fault tolerance challenges. (C) 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Cost of Fault-Tolerance on Data Stream Processing
    Vianello, Valerio
    Patino-Martinez, Marta
    Azqueta-Alzuar, Ainhoa
    Jimenez-Peris, Ricardo
    EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 17 - 27
  • [22] A comprehensive study on fault tolerance in stream processing systems
    Wang, Xiaotong
    Zhang, Chunxi
    Fang, Junhua
    Zhang, Rong
    Qian, Weining
    Zhou, Aoying
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (02)
  • [23] A comprehensive study on fault tolerance in stream processing systems
    Xiaotong Wang
    Chunxi Zhang
    Junhua Fang
    Rong Zhang
    Weining Qian
    Aoying Zhou
    Frontiers of Computer Science, 2022, 16
  • [24] A comprehensive study on fault tolerance in stream processing systems
    Xiaotong WANG
    Chunxi ZHANG
    Junhua FANG
    Rong ZHANG
    Weining QIAN
    Aoying ZHOU
    Frontiers of Computer Science, 2022, 16 (02) : 80 - 97
  • [25] Big Data Distributed Storage and Processing Case Studies
    Islam, Tariqul
    Abid, Mehedi Hasan
    THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 826 - 837
  • [26] Performance Challenges and Solutions in Big Data Platform Hadoop
    Singh B.
    Verma H.K.
    Madaan V.
    Recent Advances in Computer Science and Communications, 2023, 16 (09)
  • [27] Review on LDPC Codes for Big Data Storage
    Bhuvaneshwari, P. V.
    Tharini, C.
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 117 (02) : 1601 - 1625
  • [28] Processing Big Data in streaming for fault prediction: an industrial application
    Corallo, Angelo
    Crespino, Annamaria
    DiBiccari, Carla
    Lazoi, Mariangela
    Lezzi, Marianna
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 730 - 736
  • [29] Optimization of Management and Processing of Big Data on a Platform for Distributed Data Storage
    Neric, Vedrana
    Sarajlic, Nermin
    Hadzic, Dulaga
    ELEKTROTEHNISKI VESTNIK, 2024, 91 (05): : 272 - 283
  • [30] Parallel Processing Systems for Big Data: A Survey
    Zhang, Yunquan
    Cao, Ting
    Li, Shigang
    Tian, Xinhui
    Yuan, Liang
    Jia, Haipeng
    Vasilakos, Athanasios V.
    PROCEEDINGS OF THE IEEE, 2016, 104 (11) : 2114 - 2136