Resource Aware Scheduling for EDA Regression Jobs

被引:0
|
作者
Nanda, Saurav [1 ]
Parthasarathy, Ganapathy [1 ]
Choudhary, Parivesh [1 ]
Venkatachar, Arun [1 ]
机构
[1] Synopsys Inc, Mountain View, CA 94043 USA
来源
EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS | 2020年 / 11997卷
关键词
Job scheduling; Machine learning; K-means; Adaptive binning; Regression testing; Electronic Design Automation;
D O I
10.1007/978-3-030-48340-1_49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Typical Integrated Circuit (IC) design projects use Electronic Design Automation (EDA) tool flows to launch thousands of regressions every day on shared compute grids to complete the IC design verification process. These regressions in turn launch compute jobs with varied resource requirements and inter-job dependency constraints. Traditional grid schedulers, such as the Univa Grid Engine (UGE) [12] prioritize fairness over performance to maximize the number of jobs run with equal distribution of resources at any time. A constant challenge in day-to-day operations is to schedule these jobs for minimum overall job completion time so that developers can expect predictable regression turn-around time (TAT). We propose a resource-aware scheduling mechanism that balances performance and fairness for real-word EDA-centric workloads. We present an analysis of historical profile information from a set of regressions with complex inter-job dependencies and highly variable resource requirements to show that many of these regression jobs are well suited for efficient packing on grid machines. We formulate the regression scheduling problem as a variant of the bin packing problem, where the size of bins and balls may vary according to job-resource requirements and differing server configurations on the grid. We propose using two analytic techniques - namely k-means clustering [8] and adaptive binning [10], to solve this problem. We then evaluate the performance of our proposed solution using real workloads from daily regressions on an enterprise compute grid.
引用
收藏
页码:639 / 651
页数:13
相关论文
共 50 条
  • [1] Holistic Slowdown Driven Scheduling and Resource Management for Malleable Jobs
    D'Amico, Marco
    Jokanovic, Ana
    Corbalan, Julita
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [2] Resource intensity aware job scheduling in a distributed cloud
    Huang Daochao
    Zhu Chunge
    Zhang Hong
    Liu Xinran
    CHINA COMMUNICATIONS, 2014, 11 (02) : 175 - 184
  • [3] ScHeduling of jobs and Adaptive Resource Provisioning (SHARP) approach in cloud computing
    Dinesh Komarasamy
    Vijayalakshmi Muthuswamy
    Cluster Computing, 2018, 21 : 163 - 176
  • [4] ScHeduling of jobs and Adaptive Resource Provisioning (SHARP) approach in cloud computing
    Komarasamy, Dinesh
    Muthuswamy, Vijayalakshmi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 163 - 176
  • [5] A performance comparison of data-aware heuristics for scheduling jobs in mobile Grids
    Hirsch, Matias
    Mateos, Cristian
    Rodriguez, Juan M.
    Zunino, Alejandro
    Gari, Yisel
    Monge, David A.
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [6] Resource Utilization Aware Job Scheduling to Mitigate Performance Variability
    Nichols, Daniel
    Marathe, Aniruddha
    Shoga, Kathleen
    Gamblin, Todd
    Bhatele, Abhinav
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 335 - 345
  • [7] Data-Representation Aware Resource Scheduling for Edge Intelligence
    Zeng, Zhi
    Liu, Yuan
    Tang, Weijun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (12) : 13372 - 13376
  • [8] State-of-the-art Survey of Scheduling and Resource Management Technology for Colocation Jobs
    Wang K.-J.
    Jia T.
    Li Y.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (10): : 3100 - 3119
  • [9] Deadline-Aware MapReduce Job Scheduling with Dynamic Resource Availability
    Cheng, Dazhao
    Zhou, Xiaobo
    Xu, Yinggen
    Liu, Liu
    Jiang, Changjun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (04) : 814 - 826
  • [10] Fregata: A Low-Latency and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds
    Liu, Jinwei
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 15 - 22