An Optimized Speculative Execution Strategy Based on Local Data Prediction in a Heterogeneous Hadoop Environment

被引:0
作者
Liu, Xiaodong [1 ]
Liu, Qi [1 ]
机构
[1] Edinburgh Napier Univ, Sch Comp, 10 Colinton Rd, Edinburgh EH10 5DT, Midlothian, Scotland
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 2 | 2017年
基金
欧盟地平线“2020”;
关键词
Hadoop; Speculative Execution; Straggling Task; LWR; Prediction Accuracy; IMPROVING MAPREDUCE PERFORMANCE;
D O I
10.1109/CSE-EUC.2017.208
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hadoop is a famous distributed computing framework that is applied to process large-scale data. "Straggling tasks" have a serious impact on Hadoop performance due to imbalance of slow tasks distribution. Speculative execution (SE) presents a way to deal with Straggling tasks by monitoring the real-time progress of running tasks and replicating potential "Stragglers" on another node to increase the opportunity of completing backup tasks ahead of original. Current proposed SE strategies meet their challenges such as misjudgment of "Straggling tasks", improper selection of backup nodes, etc., which result in inefficient performance of the SE and its Hadoop system. In this paper, we propose an optimized SE strategy based on local data prediction, which collects task execution information in real time and uses Locally Weighted Regression (LWR) to predict remaining time of each running tasks, and selects an appropriate backup task node according to the actual requirements. It also combines a cost-benefit model to maximize the effectiveness of SE. According to the results, the proposed SE strategy implemented in Hadoop-2.6.0 enhances the accuracy of selecting potential Straggler task candidates, and shows better performance in various situations in a heterogeneous Hadoop environment.
引用
收藏
页码:128 / 131
页数:4
相关论文
共 12 条
  • [1] [Anonymous], 2008, 8 USENIX S OP SYST D
  • [2] Improving MapReduce Performance Using Smart Speculative Execution Strategy
    Chen, Qi
    Liu, Cheng
    Xiao, Zhen
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (04) : 954 - 967
  • [3] Huang X., 2015, COMPUTERS ELECT ENG
  • [4] Li Y, 2015, INTELLIGENT COMPUTAT, P284
  • [5] An Exploration of Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements
    Li, Zhuozhao
    Shen, Haiying
    Ligon, Walter
    Denton, Jeffrey
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (02) : 386 - 400
  • [6] Naik NS, 2014, 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P677, DOI 10.1109/ICACCI.2014.6968497
  • [7] DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2014, 2 (03) : 333 - 347
  • [8] Vaquero LM, 2009, ACM SIGCOMM COMP COM, V39, P50, DOI 10.1145/1496091.1496100
  • [9] Improving MapReduce Performance with Partial Speculative Execution
    Wang, Yaoguang
    Lu, Weiming
    Lou, Renjie
    Wei, Baogang
    [J]. JOURNAL OF GRID COMPUTING, 2015, 13 (04) : 587 - 604
  • [10] A Heuristic Speculative Execution Strategy in Heterogeneous Distributed Environments
    Wu, Huicheng
    Li, Kenli
    Tang, Zhuo
    Zhang, Longxin
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 268 - 273