CDES: An approach to HPC Workload Modelling

被引:3
作者
Brennan, John [1 ]
Kureshi, Ibad [1 ]
Holmes, Violeta [2 ]
机构
[1] Univ Huddersfield, HPC Res Grp, Huddersfield, W Yorkshire, England
[2] Univ Durham, Inst Adv Res Comp, Durham, England
来源
2014 IEEE/ACM 18TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2014) | 2014年
关键词
HPC; WMS; workload modelling; scheduler; HPC simulator;
D O I
10.1109/DS-RT.2014.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computational science and complex system administration relies on being able to model user interactions. When it comes to managing HPC, HTC and grid systems user workloads - their job submission behaviour, is an important metric when designing systems or scheduling algorithms. Most simulators are either inflexible or tied in to proprietary scheduling systems. For system administrators being able to model how a scheduling algorithm behaves or how modifying system configurations can affect the job completion rates is critical. Within computer science research many algorithms are presented with no real description or verification of behaviour. In this paper we are presenting the Cluster Discrete Event Simulator (CDES) as an strong candidate for HPC workload simulation. Built around an open framework, CDES can take system definitions, multi-platform real usage logs and can be interfaced with any scheduling algorithm through the use of an API. CDES has been tested against 3 years of usage logs from a production level HPC system and verified to a greater than 95% accuracy.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 13 条
[1]   Cluster-based static scheduling: Theory and practice [J].
Boeres, C ;
Rebello, VEF .
14TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2002, :133-140
[2]  
Bonner Stephen, 2013, 2013 Science and Information Conference (SAI), P888
[3]  
Brennan J., 2013, SCALING CAMPUS GRIDS
[4]   GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for Grid computing [J].
Buyya, R ;
Murshed, M .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2002, 14 (13-15) :1175-1220
[5]  
Downey A.B., 1997, MODEL SPEEDUP PARALL
[6]  
Hernndez V, 2010, P INT C GREEN COMP, V2010
[7]  
Holmes V., 2010, J PHYS C SERIES, V256
[8]  
Kureshi I., 2012, LOC P BCI 2012 5 BAL, P51
[9]  
Kureshi I., 2013, INT J ADV COMPUTER S, V3, P64
[10]   Scheduling distributed applications: The SimGrid simulation framework [J].
Legrand, A ;
Marchal, L ;
Casanova, H .
CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, :138-145