Automatic On-Line Detection of MPI Application Structure with Event Flow Graphs

被引:5
作者
Aguilar, Xavier [1 ,2 ]
Fuerlinger, Karl [3 ]
Laure, Erwin [1 ,2 ]
机构
[1] KTH Royal Inst Technol, High Performance Comp & Visualizat Dept HPCViz, S-10044 Stockholm, Sweden
[2] Swedish E Sci Res Ctr, S-10044 Stockholm, Sweden
[3] Univ Munich, MNM Team, Dept Comp Sci, D-80538 Munich, Germany
来源
EURO-PAR 2015: PARALLEL PROCESSING | 2015年 / 9233卷
关键词
Application structure detection; Flow graph analysis; Performance monitoring; Online analysis; Automatic loop detection; IDENTIFYING LOOPS; STRUCTURE EXTRACTION;
D O I
10.1007/978-3-662-48096-0_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The deployment of larger and larger HPC systems challenges the scalability of both applications and analysis tools. Performance analysis toolsets provide users with means to spot bottlenecks in their applications by either collecting aggregated statistics or generating loss-less time-stamped traces. While obtaining detailed trace information is the best method to examine the behavior of an application in detail, it is infeasible at extreme scales due to the huge volume of data generated. In this context, knowing the application structure, and particularly the nesting of loops in iterative applications is of great importance as it allows, among other things, to reduce the amount of data collected by focusing on important sections of the code. In this paper we demonstrate how the loop nesting structure of an MPI application can be extracted on-line from its event flow graph without the need of any explicit source code instrumentation. We show how this knowledge on the application structure can be used to compute postmortem statistics as well as to reduce the amount of redundant data collected. To that end, we present a usage scenario where this structure information is utilized on-line (while the application runs) to intelligently collect fine-grained data for only a few iterations of an application, considerably reducing the amount of data gathered.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 22 条
  • [1] Visual MPI Performance Analysis using Event Flow Graphs
    Aguilar, Xavier
    Fuerlinger, Karl
    Laure, Erwin
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1353 - 1362
  • [2] Aguilar X, 2014, LECT NOTES COMPUT SC, V8632, P1, DOI 10.1007/978-3-319-09873-9_1
  • [3] Alawneh L., 2012, 2012 IEEE 20th International Conference on Program Comprehension, P133
  • [4] [Anonymous], P 2011 INT C HIGH PE
  • [5] THE NAS PARALLEL BENCHMARKS
    BAILEY, DH
    BARSZCZ, E
    BARTON, JT
    BROWNING, DS
    CARTER, RL
    DAGUM, L
    FATOOHI, RA
    FREDERICKSON, PO
    LASINSKI, TA
    SCHREIBER, RS
    SIMON, HD
    VENKATAKRISHNAN, V
    WEERATUNGA, SK
    [J]. INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1991, 5 (03): : 63 - 73
  • [6] AutomaDeD: Automata-Based Debugging for Dissimilar Parallel Tasks
    Bronevetsky, Greg
    Laguna, Ignacio
    Bagchi, Saurabh
    de Supinski, Bronis R.
    Ahn, Dong H.
    Schulz, Martin
    [J]. 2010 IEEE-IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS DSN, 2010, : 231 - 240
  • [7] Casas M, 2007, LECT NOTES COMPUT SC, V4641, P3
  • [8] AUTOMATIC PHASE DETECTION AND STRUCTURE EXTRACTION OF MPI APPLICATIONS
    Casas, Marc
    Badia, Rosa M.
    Labarta, Jesus
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2010, 24 (03) : 335 - 360
  • [9] Fürlinger K, 2010, LECT NOTES COMPUT SC, V6043, P218
  • [10] Automatic Refinement of Parallel Applications Structure Detection
    Gonzalez, Juan
    Huck, Kevin
    Gimenez, Judit
    Labarta, Jesus
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1680 - 1687