Fault tolerance in big data storage and processing systems: A review on challenges and solutions

被引:16
|
作者
Saadoon, Muntadher [1 ]
Ab Hamid, Siti Hafizah [1 ]
Sofian, Hazrina [1 ]
Altarturi, Hamza H. M. [1 ]
Azizul, Zati Hakim [1 ]
Nasuha, Nur [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Software Engn, Kuala Lumpur 50603, Malaysia
关键词
Fault tolerance; Fault detection; Fault recovery; Big data storage; Big data processing; FAILURE RECOVERY; MAPREDUCE; RELIABILITY; AVAILABILITY; REPLICATION; NETWORKS; HADOOP;
D O I
10.1016/j.asej.2021.06.024
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Big data systems are sufficiently stable to store and process a massive volume of rapidly changing data. However, big data systems are composed of large-scale hardware resources that make their subspecies easily fail. Fault tolerance is the main property of such systems because it maintains availability, reliability, and constant performance during faults. Achieving an efficient fault tolerance solution in a big data system is challenging because fault tolerance must meet some constraints related to the system performance and resource consumption. This study aims to provide a consistent understanding of fault tolerance in big data systems and highlights common challenges that hinder the improvement in fault tolerance efficiency. The fault tolerance solutions applied by previous studies intended to address the identified challenges are reviewed. The paper also presents a perceptive discussion of the findings derived from previous studies and proposes a list of future directions to address the fault tolerance challenges. (C) 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Big Data Technologies and Analytics: A Review of Emerging Solutions
    Abdelhafez, Hoda Ahmed
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2014, 1 (02) : 1 - 17
  • [42] Fault-Tolerance Implementation in Typical Distributed Stream Processing Systems
    Chen, Wuhong
    Tsai, Jichiang
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1167 - 1186
  • [43] Big Graph Processing Systems: State-of-the-art and Open Challenges
    Elshawi, Radwa
    Batarfi, Omar
    Fayoumi, Ayman
    Barnawi, Ahmed
    Sakr, Sherif
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 24 - 33
  • [44] Intelligent Identification over Power Big Data: Opportunities, Solutions, and Challenges
    Luo, Liang
    Li, Xingmei
    Yang, Kaijiang
    Wei, Mengyang
    Chen, Jiong
    Yang, Junqian
    Yao, Liang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 134 (03): : 1565 - 1595
  • [45] Fault Tolerance in Distributed Systems Using Fused Data Structures
    Balasubramanian, Bharath
    Garg, Vijay K.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (04) : 701 - 715
  • [46] Task Scheduling in Big Data - Review, Research Challenges, and Prospects
    Govindarajan, Kannan
    Kamburugamuve, Supun
    Wickramasinghe, Pulasthi
    Abeykoon, Vibhatha
    Fox, Geoffrey
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 165 - 173
  • [47] A Mini-Review of Machine Learning in Big Data Analytics: Applications, Challenges, and Prospects
    Nti, Isaac Kofi
    Quarcoo, Juanita Ahia
    Aning, Justice
    Fosu, Godfred Kusi
    BIG DATA MINING AND ANALYTICS, 2022, 5 (02): : 81 - 97
  • [48] Achieving privacy-preserving big data aggregation with fault tolerance in smart grid
    Guan, Zhitao
    Si, Guanlin
    DIGITAL COMMUNICATIONS AND NETWORKS, 2017, 3 (04) : 242 - 249
  • [49] A Review on Recent Trends in Query Processing and Optimization in Big Data
    Deepak Kumar
    Vijay Kumar Jha
    Wireless Personal Communications, 2022, 124 : 633 - 654
  • [50] A Review on Recent Trends in Query Processing and Optimization in Big Data
    Kumar, Deepak
    Jha, Vijay Kumar
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 124 (01) : 633 - 654