MrHeter: improving MapReduce performance in heterogeneous environments

被引:0
|
作者
Xiao Zhang
Yanjun Wu
Chen Zhao
机构
来源
Cluster Computing | 2016年 / 19卷
关键词
MapReduce; Heterogeneous cluster; Scheduling; Performance;
D O I
暂无
中图分类号
学科分类号
摘要
As GPUs, ARM CPUs and even FPGAs are widely used in modern computing, a data center gradually develops towards the heterogeneous clusters. However, many well-known programming models such as MapReduce are designed for homogeneous clusters and have poor performance in heterogeneous environments. In this paper, we reconsider the problem and make four contributions: (1) We analyse the causes of MapReduce poor performance in heterogeneous clusters, and the most important one is unreasonable task allocation between nodes with different computing ability. (2) Based on this, we propose MrHeter, which separates MapReduce process into map-shuffle stage and reduce stage, then constructs optimization model separately for them and gets different task allocation mlij,mrij,rij\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ml_{ij}, mr_{ij}, r_{ij}$$\end{document} for heterogeneous nodes based on computing ability.(3) In order to make it suitable for dynamic execution, we propose D-MrHeter, which includes monitor and feedback mechanism. (4) Finally, we prove that MrHeter and D-MrHeter can greatly decrease total execution time of MapReduce from 30 to 70 % in heterogeneous cluster comparing with original Hadoop, having better performance especially in the condition of heavy-workload and large-difference between nodes computing ability.
引用
收藏
页码:1691 / 1701
页数:10
相关论文
共 50 条
  • [31] Optimizing Power and Performance Trade-offs of MapReduce Job Processing with Heterogeneous Multi-Core Processors
    Yan, Feng
    Cherkasova, Ludmila
    Zhang, Zhuoyao
    Smirni, Evgenia
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 240 - 247
  • [32] Improving Resource Utilization in MapReduce
    Guo, Zhenhua
    Fox, Geoffrey
    Zhou, Mo
    Ruan, Yang
    2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 402 - 410
  • [33] Data Preloading and Data Placement for MapReduce Performance Improving
    Spivak, Anton
    Nasonov, Denis
    5TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, YSC 2016, 2016, 101 : 379 - 387
  • [34] HAT: history-based auto-tuning MapReduce in heterogeneous environments
    Chen, Quan
    Guo, Minyi
    Deng, Qianni
    Zheng, Long
    Guo, Song
    Shen, Yao
    JOURNAL OF SUPERCOMPUTING, 2013, 64 (03): : 1038 - 1054
  • [35] Improving MapReduce Performance via Heterogeneity-Load-Aware Partition Function
    Sun, Huifeng
    Chen, Junliang
    Liu, ChuanChang
    Zheng, Zibin
    Yu, Nan
    Yang, Zhi
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 557 - 560
  • [36] HAT: history-based auto-tuning MapReduce in heterogeneous environments
    Quan Chen
    Minyi Guo
    Qianni Deng
    Long Zheng
    Song Guo
    Yao Shen
    The Journal of Supercomputing, 2013, 64 : 1038 - 1054
  • [37] Adaptive MapReduce Scheduling in Shared Environments
    Polo, Jorda
    Becerra, Yolanda
    Carrera, David
    Torres, Jordi
    Ayguade, Eduard
    Steinder, Malgorzata
    2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 61 - 70
  • [38] Tarazu: Optimizing MapReduce On Heterogeneous Clusters
    Ahmad, Faraz
    Chakradhar, Srimat
    Raghunathan, Anand
    Vijaykumar, T. N.
    ACM SIGPLAN NOTICES, 2012, 47 (04) : 61 - 74
  • [39] MapReduce Task Scheduling in Heterogeneous Geo-Distributed Data Centers
    Li, Xiaoping
    Chen, Fuchao
    Ruiz, Ruben
    Zhu, Jie
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (06) : 3317 - 3329
  • [40] Task failure resilience technique for improving the performance of MapReduce in Hadoop
    Kavitha, C.
    Anita, X.
    ETRI JOURNAL, 2020, 42 (05) : 751 - 763