An improved data placement strategy in a heterogeneous Hadoop cluster

被引:0
作者
Zhao, Wentao [1 ,2 ]
Meng, Lingjun [1 ]
Sun, Jiangfeng [1 ,2 ]
Ding, Yang [1 ]
Zhao, Haohao [1 ]
Wang, Lina [1 ,2 ]
机构
[1] School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo
[2] Opening Project of Key Laboratory of Mine Informatization, Henan Polytechnic University, Jiaozuo, 454000, Henan
来源
Open Cybernetics and Systemics Journal | 2014年 / 8卷 / 01期
关键词
Data placement; Disk space utilization; HDFS; Network load; Nodes heterogeneity;
D O I
10.2174/1874110X01408010957
中图分类号
学科分类号
摘要
Hadoop Distributed File System (HDFS) is designed to store big data reliably, and to stream these data at high bandwidth to user applications. However, the default HDFS block placement policy assumes that all nodes in the cluster are homogeneous, and randomly place blocks without considering any nodes’ resource characteristics, which decreases self-adaptability of the system. In this paper, we take account nodes heterogeneities, such as utilization of nodes’ disk space, and put forward an improved blocks placement strategy for solving some drawbacks in the default HDFS. The simulation experiments indicate that our improved strategy performs much better not only in the data distribution but also significantly saves more time than the default blocks placement. © Zhao et al.
引用
收藏
页码:957 / 963
页数:6
相关论文
共 50 条
  • [1] An Improved data placement strategy in a heterogeneous hadoop cluster
    Zhao, Wentao
    Meng, Lingjun
    Sun, Jiangfeng
    Ding, Yang
    Zhao, Haohao
    Wang, Lina
    Open Cybernetics and Systemics Journal, 2015, 9 (01): : 792 - 798
  • [2] On a Dynamic Data Placement Strategy for Heterogeneous Hadoop Clusters
    Liu, Yang
    Wu, Chase Q.
    Wang, Meng
    Hou, Aiqin
    Wang, Yongqiang
    2018 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC 2018), 2018,
  • [3] An improved data placement strategy for hadoop
    Lin, Wei-Wei
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2012, 40 (01): : 152 - 158
  • [4] A Dynamic Data Placement Policy for Heterogeneous Hadoop Cluster
    Shithil, Santa Maria
    Saha, Tushar Kanti
    Sharma, Tanusree
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 302 - 307
  • [5] A Dynamic Data Placement Strategy for Hadoop in Heterogeneous Environments
    Lee, Chia-Wei
    Hsieh, Kuang-Yu
    Hsieh, Sun-Yuan
    Hsiao, Hung-Chang
    BIG DATA RESEARCH, 2014, 1 : 14 - 22
  • [6] Optimizing data placement in heterogeneous Hadoop clusters
    Runqun Xiong
    Junzhou Luo
    Fang Dong
    Cluster Computing, 2015, 18 : 1465 - 1480
  • [7] Optimizing data placement in heterogeneous Hadoop clusters
    Xiong, Runqun
    Luo, Junzhou
    Dong, Fang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1465 - 1480
  • [8] New Data Placement Strategy in the HADOOP Framework
    Elomari, Akram
    Hassouni, Larbi
    Maizate, Abderrahim
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 676 - 684
  • [9] IDaPS - Improved data-locality aware data placement strategy based on Markov clustering to enhance MapReduce performance on Hadoop
    Vengadeswaran, S.
    Balasundaram, S. R.
    Dhavakumar, P.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (03)
  • [10] HaDaap: A hotness-aware data placement strategy for improving storage efficiency in heterogeneous Hadoop clusters
    Xiong, Runqun
    Du, Yao
    Jin, Jiahui
    Luo, Junzhou
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (20)