Performance measurement and analysis tools for extremely scalable systems

被引:8
|
作者
Mohr, B. [1 ]
Wylie, B. J. N. [1 ]
Wolf, F. [1 ,2 ,3 ]
机构
[1] Forschungszentrum Julich, Inst Adv Simulat, Julich Supercomp Ctr, D-52425 Julich, Germany
[2] German Res Sch Simulat Sci, Aachen, Germany
[3] Rhein Westfal TH Aachen, Dept Comp Sci, Aachen, Germany
关键词
performance analysis; parallel programming; scalability; VISUALIZATION;
D O I
10.1002/cpe.1585
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
High-performance computing systems continue to employ more and more processor cores. Current typical high-end machines in industry, university, and government research laboratory computing centers feature thousands of computing cores. While these machines promise ever more compute power and memory capacity to tackle today's complex simulation problems, they force application developers to greatly enhance the scalability of their codes to be able to exploit it. To better support them in their porting and tuning process, many parallel-tools research groups have already started to work on scaling their methods, techniques, and tools to extreme processor counts. In this paper, we survey existing profiling and tracing tools, report on our experience in using them in extreme scaling environments, review working and promising new methods and techniques, and discuss strategies for solving open issues and problems. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:2212 / 2229
页数:18
相关论文
共 50 条
  • [1] Scalable parallel performance measurement and analysis tools - state-of-the-art and future challenges
    Mohr, B.
    Supercomputing Frontiers and Innovations, 2014, 1 (02) : 108 - 123
  • [2] Scalable Automatic Performance Analysis on IBM BlueGene/P Systems
    Oleynik, Yury
    Gerndt, Michael
    EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT II, 2012, 7156 : 146 - 155
  • [3] A Scalable Infrastructure for Online Performance Analysis on CFD Application
    Hu Kai
    Ding Yi
    Zhang Xinyu
    Jiang Shu
    CHINESE JOURNAL OF AERONAUTICS, 2012, 25 (04) : 546 - 558
  • [4] Performance Measurement Analysis for Multi-Agent Systems
    Nagwani, Naresh Kumar
    IAMA: 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT & MULTI-AGENT SYSTEMS, 2009, : 14 - 17
  • [5] Reliability analysis of large circuits using scalable techniques and tools
    Bhaduri, Debayan
    Shukla, Sandeep K.
    Graham, Paul S.
    Gokhale, Maya B.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2007, 54 (11) : 2447 - 2460
  • [6] Tools for scalable parallel program analysis -: Vampir NG and Dewiz
    Brunst, H
    Kranzlmüller, D
    Nagel, WE
    DISTRIBUTED AND PARALLEL SYSTEMS: CLUSTER AND GRID COMPUTING, 2005, 777 : 93 - 102
  • [7] Workload Characterization an Essential Step in Computer Systems Performance Analysis - Methodology and Tools
    Cheveresan, Razvan T.
    Holban, Stefan
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2009, 9 (03) : 100 - 106
  • [8] Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz
    Brunst, Holger
    Kranzlmueller, Dieter
    Mueller, Matthias S.
    Nagel, Wolfgang E.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2009, 4 (03) : 149 - 161
  • [9] A survey of distributed systems performance evaluation tools
    Santha, S
    Pooch, UW
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1111 - 1119
  • [10] Performance Analysis of Scalable Attack Representation Models
    Hong, Jin B.
    Kim, Dong Seong
    SECURITY AND PRIVACY PROTECTION IN INFORMATION PROCESSING SYSTEMS, 2013, 405 : 330 - 343