MrHeter: improving MapReduce performance in heterogeneous environments

被引:0
|
作者
Xiao Zhang
Yanjun Wu
Chen Zhao
机构
来源
Cluster Computing | 2016年 / 19卷
关键词
MapReduce; Heterogeneous cluster; Scheduling; Performance;
D O I
暂无
中图分类号
学科分类号
摘要
As GPUs, ARM CPUs and even FPGAs are widely used in modern computing, a data center gradually develops towards the heterogeneous clusters. However, many well-known programming models such as MapReduce are designed for homogeneous clusters and have poor performance in heterogeneous environments. In this paper, we reconsider the problem and make four contributions: (1) We analyse the causes of MapReduce poor performance in heterogeneous clusters, and the most important one is unreasonable task allocation between nodes with different computing ability. (2) Based on this, we propose MrHeter, which separates MapReduce process into map-shuffle stage and reduce stage, then constructs optimization model separately for them and gets different task allocation mlij,mrij,rij\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ml_{ij}, mr_{ij}, r_{ij}$$\end{document} for heterogeneous nodes based on computing ability.(3) In order to make it suitable for dynamic execution, we propose D-MrHeter, which includes monitor and feedback mechanism. (4) Finally, we prove that MrHeter and D-MrHeter can greatly decrease total execution time of MapReduce from 30 to 70 % in heterogeneous cluster comparing with original Hadoop, having better performance especially in the condition of heavy-workload and large-difference between nodes computing ability.
引用
收藏
页码:1691 / 1701
页数:10
相关论文
共 50 条
  • [41] Improving MapReduce Performance Using Smart Speculative Execution Strategy
    Chen, Qi
    Liu, Cheng
    Xiao, Zhen
    IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (04) : 954 - 967
  • [42] Job Classification for MapReduce Scheduler in Heterogeneous Environment
    Deshmukh, Shyam
    Aghav, J. V.
    Chakravarthy, Rohan
    2013 INTERNATIONAL CONFERENCE ON CLOUD & UBIQUITOUS COMPUTING & EMERGING TECHNOLOGIES (CUBE 2013), 2013, : 26 - +
  • [43] Insight and Reduction of MapReduce Stragglers in Heterogeneous Environment
    Zhao, Xia
    Kang, Kai
    Sun, YuZhong
    Song, Yin
    Xu, Minhao
    Pan, Tao
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [44] Performance-Driven Task Co-Scheduling for MapReduce Environments
    Polo, Jorda
    Carrera, David
    Becerra, Yolanda
    Torres, Jordi
    Ayguade, Eduard
    Steinder, Malgorzata
    Whalley, Ian
    PROCEEDINGS OF THE 2010 IEEE-IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2010, : 373 - 380
  • [45] Dynamic Scheduling for Speculative Execution to Improve MapReduce Performance in Heterogeneous Environment
    Jung, Hyungjae
    Nakazato, Hidenori
    2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2014, : 119 - 124
  • [46] Improving the efficiency of MapReduce scheduling algorithm in Hadoop
    Thangaselvi, R.
    Ananthbabu, S.
    Jagadeesh, S.
    Aruna, R.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 63 - 68
  • [47] Improving the Performance of Heterogeneous Hadoop Cluster
    VishnuVardhan, Ch. Bhaskar
    Baruah, Pallav Kumar
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 225 - 230
  • [48] Adapting MapReduce for HPC Environments
    Fadika, Zacharia
    Dede, Elif
    Govindaraju, Madhusudhan
    Ramakrishnan, Lavanya
    HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 263 - 264
  • [49] Virtual Hadoop: MapReduce over Docker Containers with an Auto-Scaling Mechanism for Heterogeneous Environments
    Chen, Yi-Wei
    Hung, Shih-Hao
    Tu, Chia-Heng
    Yeh, Chih Wei
    2016 RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS, 2016, : 201 - 206
  • [50] Improving the Performance of MapReduce-based Change Detection Using Sampling
    SaatiAlsoruji, Eihab
    2019 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2019, : 12 - 18