Dynamic Scheduling for Speculative Execution to Improve MapReduce Performance in Heterogeneous Environment

被引:5
作者
Jung, Hyungjae [1 ]
Nakazato, Hidenori [1 ]
机构
[1] Waseda Univ, Grad Sch Global Informat & Telecommun Studies, Tokyo, Japan
来源
2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW) | 2014年
关键词
Cloud Computing; MapReduce; Speculative Execution; Heterogeneous environment; DSSE;
D O I
10.1109/ICDCSW.2014.23
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce framework allows users to quickly develop big-data applications and process big-data effectively. However, unexpected malfunction may be found in cloud environment because a distributed system consists of several hardware, and this malfunction often causes delay of overall processing. MapReduce framework provides Speculative Execution (SE). SE reduces delay in a homogeneous environment by assigning delayed tasks to additional nodes. As cloud computing prevails, cloud computing environment is moving from homogeneous to heterogeneous. Original SE is not perfect and sometimes produces inefficient result in a heterogeneous environment. This paper proposes Dynamic Scheduling for Speculative Execution (DSSE) which enhances performance in a heterogeneous environment by improving existing SE. DSSE prevents wasted SE since it calculates processing capability of each node more objectively and precisely. DSSE has reduced entire processing time approximately 10% compared to original SE. Success rate of SE was 100%.
引用
收藏
页码:119 / 124
页数:6
相关论文
共 19 条
[1]   A View of Cloud Computing [J].
Armbrust, Michael ;
Fox, Armando ;
Griffith, Rean ;
Joseph, Anthony D. ;
Katz, Randy ;
Konwinski, Andy ;
Lee, Gunho ;
Patterson, David ;
Rabkin, Ariel ;
Stoica, Ion ;
Zaharia, Matei .
COMMUNICATIONS OF THE ACM, 2010, 53 (04) :50-58
[2]   Tiled-MapReduce: Optimizing Resource Usages of Data-parallel Applications on Multicore with Tiling [J].
Chen, Rong ;
Chen, Haibo ;
Zang, Binyu .
PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, :523-534
[3]  
Dean J, 2008, P 5 S OP SYST DES IM, P137
[4]   Toward Efficient and Simplified Distributed Data Intensive Computing [J].
Gu, Yunhong ;
Grossman, Robert .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) :974-984
[5]  
Jiang Wei., 2009, Proceedings of the 2009 IEEE Cluster, P1
[6]  
Kambatla K., 2009, P 1 WORKSH HOT TOP C
[7]  
Kim SG, 2011, J INF SCI ENG, V27, P1137
[8]  
Kruijf M., 2009, IBM J RES DEV
[9]  
Minqi Zhou, 2010, Proceedings 2010 Sixth International Conference on Semantics Knowledge and Grid (SKG 2010), P97, DOI 10.1109/SKG.2010.18
[10]   Cloud Computing The New Frontier of Internet Computing [J].
Pallis, George .
IEEE INTERNET COMPUTING, 2010, 14 (05) :70-73