A Task-Based Greedy Scheduling Algorithm for Minimizing Energy of MapReduce Jobs

被引:8
作者
Yousefi, Mostafa Hadadian Nejad [1 ]
Goudarzi, Maziar [1 ]
机构
[1] Sharif Univ Technol, Comp Engn Dept, Tehran, Iran
关键词
Big data; Energy-aware; MapReduce; Scheduling; Heterogeneous systems; PERFORMANCE;
D O I
10.1007/s10723-018-9464-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce and its open source implementation, Hadoop, have gained widespread adoption for parallel processing of big data jobs. Since the number of such big data jobs is also rapidly rising, reducing their energy consumption is increasingly more important to reduce environmental impact as well as operational costs. Prior work by Mashayekhy et al. (IEEE Trans. Parallel Distributed Syst. 26, 2720-2733, 2016), has tackled the problem of energy-aware scheduling of a single MapReduce job but we provide a far more efficient heuristic in this paper. We first model the problem as an Integer Linear Program to find the optimal solution using ILP solvers. Then we present a task-based greedy scheduling algorithm, TGSAVE, to select a slot for each task to minimize the total energy consumption of the MapReduce job for big data applications in heterogeneous environments without significant performance loss while satisfying the service level agreement (SLA). We perform several experiments on a Hadoop cluster to measure characteristics of tasks for nine different applications to evaluate our proposed algorithm. The results show that the total energy consumption of MapReduce jobs obtained by TGSAVE is up to 35% less than that achieved by EMRSA proposed in Mashayekhy et al. (IEEE Trans. Parallel Distributed Syst. 26, 2720-2733, 2016), its closest rival, for same workloads. Besides, TGSAVE is capable of finding a solution in same order of time for up to 74% tighter deadlines than the tightest deadline that EMRSA can find a feasible one. On average, TGSAVE solution is approximately 1.4% far from the optimal solution, and it can meet deadlines as tight as 12%, on average, above the energy-oblivious minimum makespan in the benchmarks we examined.
引用
收藏
页码:535 / 551
页数:17
相关论文
共 50 条
  • [1] A Task-Based Greedy Scheduling Algorithm for Minimizing Energy of MapReduce Jobs
    Mostafa Hadadian Nejad Yousefi
    Maziar Goudarzi
    Journal of Grid Computing, 2018, 16 : 535 - 551
  • [2] Energy-aware Scheduling of MapReduce Jobs
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Lu, Dajun
    Shi, Weisong
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 32 - 39
  • [3] Energy-aware Task Scheduling of MapReduce Cluster
    Wang, Jia
    Li, Xiaoping
    Yang, Jie
    2015 INTERNATIONAL CONFERENCE ON SERVICE SCIENCE (ICSS), 2015, : 187 - 194
  • [4] Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Zhang, Quan
    Shi, Weisong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (10) : 2720 - 2733
  • [5] Task Scheduling for MapReduce Based on Heterogeneous Networks
    Wang, Jia
    Li, Xiaoping
    HUMAN CENTERED COMPUTING, HCC 2014, 2015, 8944 : 278 - 289
  • [6] Energy Utilization Task Scheduling for MapReduce in Heterogeneous Clusters
    Wang, Jia
    Li, Xiaoping
    Ruiz, Ruben
    Yang, Jie
    Chu, Dianhui
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (02) : 931 - 944
  • [7] Shuffle Scheduling for MapReduce Jobs Based on Periodic Network Status
    Fan, Yuqi
    Liu, Wenlong
    Guo, Dan
    Wu, Weili
    Du, Dingzhu
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (04) : 1832 - 1844
  • [8] A MapReduce task scheduling algorithm for deadline constraints
    Zhuo Tang
    Junqing Zhou
    Kenli Li
    Ruixuan Li
    Cluster Computing, 2013, 16 : 651 - 662
  • [9] A MapReduce task scheduling algorithm for deadline constraints
    Tang, Zhuo
    Zhou, Junqing
    Li, Kenli
    Li, Ruixuan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (04): : 651 - 662
  • [10] Comparison of Time and Energy Oriented Scheduling for Task-Based Programs
    Rauber, Thomas
    Ruenger, Gudula
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT I, 2018, 10777 : 185 - 196