Moving Hadoop into the Cloud with Flexible Slot Management and Speculative Execution

被引:27
作者
Guo, Yanfei [1 ]
Rao, Jia [1 ]
Jiang, Changjun [2 ]
Zhou, Xiaobo [1 ]
机构
[1] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80918 USA
[2] Tongji Univ, Key Lab Embedded Syst & Serv Comp, Minist Educ, Shanghai 201804, Peoples R China
基金
美国国家科学基金会;
关键词
MapReduce; cloud; data skew; stragglers; task slot management; speculative execution; PERFORMANCE; MAPREDUCE;
D O I
10.1109/TPDS.2016.2587641
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Load imbalance is a major source of overhead in parallel programs such as MapReduce. Due to the uneven distribution of input data, tasks with more data become stragglers and delay the overall job completion. Running Hadoop in a private cloud opens up opportunities for expediting stragglers with more resources but also introduces problems that often outweigh the performance gain: (1) performance interference from co-running jobs may create new stragglers; (2) there exists a semantic gap between the Hadoop task management and resource pool-based virtual cluster management preventing tasks from using resources efficiently. In this paper, we strive to make Hadoop more resilient to data skew and more efficient in cloud environments. We present FlexSlot, a usertransparent task slot management scheme that automatically identifies map stragglers and resizes their slots accordingly to accelerate task execution. FlexSlot adaptively changes the number of slots on each virtual node to balance the resource usage so that the pool of resources can be efficiently utilized. FlexSlot further improves mitigation of data skew with an adaptive speculative execution strategy. Experimental results show that FlexSlot effectively reduces job completion time up to 47: 2 percent compared to stock Hadoop and two recently proposed skew mitigation and speculative execution approaches.
引用
收藏
页码:798 / 812
页数:15
相关论文
共 37 条
  • [1] Ahmad Faraz., 2012, P 17 INT C ARCHITECT, P61
  • [2] [Anonymous], 2013, 10 USENIX S NETWORKE
  • [3] [Anonymous], 2012, VMware Technical J
  • [4] [Anonymous], 2004, OSDI 04
  • [5] [Anonymous], 2012, VIRTUALIZING APACHE
  • [6] [Anonymous], 2010, P 9 USENIX C OP SYST
  • [7] [Anonymous], 2011, P 2011 ACM SIGMOD IN
  • [8] [Anonymous], 2008, 8 USENIX S OP SYST D
  • [9] Best Practices for Oversubscription of CPU, 2013, MEM STOR VSPHER VIRT
  • [10] Improving MapReduce Performance Using Smart Speculative Execution Strategy
    Chen, Qi
    Liu, Cheng
    Xiao, Zhen
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (04) : 954 - 967