Comparative Analysis of Energy-Efficient Scheduling Algorithms for Big Data Applications

被引:4
作者
Li, Hongjian [1 ,2 ]
Wang, Huochen [1 ]
Xiong, Anping [1 ]
Lai, Jun [1 ]
Tian, Wenhong [2 ,3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Dept Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Univ Elect Sci & Technol China, Dept Informat & Software Engn, Chengdu 610054, Sichuan, Peoples R China
[3] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Chongqing 401122, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Big data; deadline-constrained; energy-efficient; Spark application; tasks scheduling algorithm; SPARK;
D O I
10.1109/ACCESS.2018.2855720
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, big data analytics has been widely applied in addressing the growing cybercrime threats. However, energy consumption is explosive increasing with the fast growth of big data processing in anti-cybercrime. In this paper, an energy-efficient framework for big data applications is proposed to reduce energy consumption while satisfying deadline constrains. First, the problem of energy-efficient tasks scheduling of a single Spark job is modeled as an integer program. We design an energy-efficient tasks scheduling algorithm to minimize the energy consumption for big data application in Spark. To avoid service-level agreement violations for execution time, we propose an optimal task scheduling algorithm with deadline constrains by tradingoff execution time and energy consumption. Experiments on a Spark cluster are performed to determine the energy consumption and execution time for several workloads from the HiBench benchmark suite. Our algorithms consume less energy on average than FIFO and FAIR under deadlines. The optimal algorithm is able to find near optimal tasks schedules to trade off energy consumed and response time benefit in small shuffle partitions.
引用
收藏
页码:40073 / 40084
页数:12
相关论文
共 22 条
  • [1] Adedayo OM, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON CYBERCRIME AND COMPUTER FORENSIC (ICCCF)
  • [2] [Anonymous], 2013, P INT GREEN COMP C
  • [3] [Anonymous], SPARK INTERNALS DESI
  • [4] Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility
    Buyya, Rajkumar
    Yeo, Chee Shin
    Venugopal, Srikumar
    Broberg, James
    Brandic, Ivona
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (06): : 599 - 616
  • [5] Chen H, 2015, 2015 IEEE 40TH LOCAL COMPUTER NETWORKS CONFERENCE WORKSHOPS (LCN WORKSHOPS), P708, DOI 10.1109/LCNW.2015.7365918
  • [6] A Parallel Random Forest Algorithm for Big Data in a Spark Cloud Computing Environment
    Chen, Jianguo
    Li, Kenli
    Tang, Zhuo
    Bilal, Kashif
    Yu, Shui
    Weng, Chuliang
    Li, Keqin
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (04) : 919 - 933
  • [7] Dhaka P, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), P117, DOI 10.1109/CCAA.2016.7813701
  • [8] Gibilisco GP, 2016, IEEE INT CONF CLOUD, P188, DOI [10.1109/CLOUD.2016.32, 10.1109/CLOUD.2016.0034]
  • [9] Dynamic Configuration of Partitioning in Spark Applications
    Gounaris, Anastasios
    Kougka, Georgia
    Tous, Ruben
    Montes, Carlos Tripiana
    Torres, Jordi
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 1891 - 1904
  • [10] Huang JD, 2017, ACSR ADV COMPUT, V81, P1