Performance-Aware Fair Scheduling: Exploiting Demand Elasticity of Data Analytics Jobs

被引:0
|
作者
Chen, Chen [1 ]
Wang, Wei [1 ]
Li, Bo [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient resource management is of paramount importance in today's production clusters. In this paper, we identify the demand elasticity of data-parallel jobs. Demand elasticity allows jobs to run with a significantly less amount of resources than they ideally need, at the expense of only a modest performance penalty. Our EC2 experiment using popular Spark benchmark suites confirms that running a job using 50% of demanded slots is sufficient to achieve at least 75% of the ideal performance. We show that such an elasticity is an intrinsic property of data-parallel jobs and can be exploited to speed up average job completion. In this regard, we propose Performance Aware Fair (PAF) scheduler to identify the demand elasticity and use it to improve the average job performance, while still attaining near-optimal isolation guarantee close to fair sharing. PAF starts with a fair allocation and iteratively adjusts it by transferring resources from one job to another, improving the performance of resource-taker without penalizing resource-giver by a noticeable amount. We implemented PAF in Spark and evaluated its effectiveness through both EC2 experiments and large-scale simulations. Evaluation results show that compared with fair allocation, PAF improves the average job performance by 13%, while penalizing resource-givers by no more than 1%.
引用
收藏
页码:504 / 512
页数:9
相关论文
共 50 条
  • [1] PAS: Performance-Aware Job Scheduling for Big Data Processing Systems
    Li, Yiren
    Li, Tieke
    Shen, Pei
    Hao, Liang
    Yang, Jin
    Zhang, Zhengtong
    Chen, Junhao
    Bao, Liang
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [2] Performance-Aware Thermal Management via Task Scheduling
    Zhou X.
    Yang J.
    Chrobak M.
    Zhang Y.
    Transactions on Architecture and Code Optimization, 2010, 7 (01): : 1 - 31
  • [3] Performance-Aware Thermal Management via Task Scheduling
    Zhou, Xiuyi
    Yang, Jun
    Chrobak, Marek
    Zhang, Youtao
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2010, 7 (01)
  • [4] SPO: A Secure and Performance-aware Optimization for MapReduce Scheduling
    Maleki, Neda
    Rahmani, Amir Masoud
    Conti, Mauro
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 176
  • [5] Performance-aware scheduling of streaming applications using genetic algorithm
    Smirnov, Pavel
    Melnik, Mikhail
    Nasonov, Denis
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2240 - 2249
  • [6] Performance-aware Scheduling of Multicore Time-critical Systems
    Boudjadar, Jalil
    Kim, Jin Hyun
    Nadjm-Tehrani, Simin
    2016 ACM/IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2016, : 105 - 114
  • [7] Towards an Energy Efficient Computing With Coordinated Performance-Aware Scheduling in Large Scale Data Clusters
    Hamandawana, Prince
    Mativenga, Ronnie
    Kwon, Se Jin
    Chung, Tae-Sun
    IEEE ACCESS, 2019, 7 : 140261 - 140277
  • [8] A performance comparison of data-aware heuristics for scheduling jobs in mobile Grids
    Hirsch, Matias
    Mateos, Cristian
    Rodriguez, Juan M.
    Zunino, Alejandro
    Gari, Yisel
    Monge, David A.
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [9] Performance-Aware Energy Saving for Data Center Networks
    Al-Tarazi, Motassem
    Chang, J. Morris
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (01): : 206 - 219
  • [10] An Efficient and Performance-Aware Big Data Storage System
    Li, Yang
    Guo, Li
    Guo, Yike
    CLOUD COMPUTING AND SERVICES SCIENCE, CLOSER 2012, 2013, 367 : 102 - 116