A classification framework for straggler mitigation and management in a heterogeneous Hadoop cluster: A state-of-art survey

被引:2
|
作者
Bawankule, Kamalakant Laxman [1 ]
Dewang, Rupesh Kumar [1 ]
Singh, Anil Kumar [1 ]
机构
[1] Motilal Nehru Natl Inst Technol Allahabad, Dept Comp Sci & Engn, Prayagraj, India
关键词
Big Data; Hadoop; HDFS; MapReduce; Stragglers; Data placement; Speculative execution; Heterogeneous environment; Contents; IMPROVING MAPREDUCE PERFORMANCE; DATA PLACEMENT STRATEGY; SPECULATIVE EXECUTION; SCHEDULING ALGORITHM; DATA LOCALITY; AWARE; DESIGN;
D O I
10.1016/j.jksuci.2022.02.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hadoop is the most economical and cheap software framework that allows distributed storage and par-allel processing of more extensive data sets. Hadoop distributed file system (HDFS) allows distributed storage and parallel processing of vast data sets using MapReduce. However, Hadoop's current imple-mentation believes that computing nodes connected in a cluster are homogeneous and distribute the tasks equally. This equal load distribution creates the load imbalance during storage, resource contention during task scheduling, hardware degradation by its excess use, and software misconfiguration during cluster management which are the leading causes of stragglers in heterogeneous Hadoop clusters. Due to hardware heterogeneity, Hadoop's performance degrades in the heterogeneous environment. In our study, the paper reviews and analyzes significant studies. It presents the new classification taxonomy to broadly classify the existing straggler management and mitigation techniques into two approaches: proactive and reactive. It analyses and compares the state of art studies and identifies their limitation based on their results. Finally, the systematic review discusses the open issues and the potential direc-tions for future work to manage and mitigate stragglers from the heterogeneous Hadoop clusters.(c) 2022 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:7621 / 7644
页数:24
相关论文
共 50 条
  • [1] Straggler Mitigation in Hadoop MapReduce Framework: A Review
    Ajibade, Lukuman Saheed
    Abu Bakar, Kamalrulnizam
    Aliyu, Ahmed
    Danish, Tasneem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 870 - 878
  • [2] INTEGER PROGRAMMING ALGORITHMS - FRAMEWORK AND STATE-OF-ART SURVEY
    GEOFFRION, AM
    MARSTEN, RE
    MANAGEMENT SCIENCE SERIES A-THEORY, 1972, 18 (09): : 465 - 491
  • [3] Historical data based approach for straggler avoidance in a heterogeneous Hadoop cluster
    Bawankule, Kamalakant Laxman
    Dewang, Rupesh Kumar
    Singh, Anil Kumar
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (10) : 9573 - 9589
  • [4] Historical data based approach for straggler avoidance in a heterogeneous Hadoop cluster
    Kamalakant Laxman Bawankule
    Rupesh Kumar Dewang
    Anil Kumar Singh
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 9573 - 9589
  • [5] LEVITATION MELTING - A SURVEY OF STATE-OF-ART
    PEIFER, WA
    JOURNAL OF METALS, 1965, 17 (05): : 487 - &
  • [6] COMPUTER MEMORIES - SURVEY OF STATE-OF-ART
    RAJCHMAN, JA
    PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1961, 49 (01): : 104 - &
  • [7] Human Tracking: A State-of-Art Survey
    Watada, Junzo
    Musa, Zalili
    Jain, Lakhmi C.
    Fulcher, John
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, 2010, 6277 : 454 - +
  • [8] An Empirical Analysis of State-of-Art Classification Models in an IT Incident Severity Prediction Framework
    Ahmed, Salman
    Singh, Muskaan
    Doherty, Brendan
    Ramlan, Effirul
    Harkin, Kathryn
    Bucholc, Magda
    Coyle, Damien
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [9] ANIMAL WASTE MANAGEMENT - STATE-OF-ART
    不详
    JOURNAL OF THE ENVIRONMENTAL ENGINEERING DIVISION-ASCE, 1978, 104 (06): : 1239 - 1262
  • [10] Image Segmentation - A State-Of-Art Survey for Prediction
    Raut, Shital
    Raghuvanshi, M.
    Dharaskar, R.
    Raut, Adarsh
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL : ICACC 2009 - PROCEEDINGS, 2009, : 420 - +