Performance measurement and analysis tools for extremely scalable systems

被引:8
作者
Mohr, B. [1 ]
Wylie, B. J. N. [1 ]
Wolf, F. [1 ,2 ,3 ]
机构
[1] Forschungszentrum Julich, Inst Adv Simulat, Julich Supercomp Ctr, D-52425 Julich, Germany
[2] German Res Sch Simulat Sci, Aachen, Germany
[3] Rhein Westfal TH Aachen, Dept Comp Sci, Aachen, Germany
关键词
performance analysis; parallel programming; scalability; VISUALIZATION;
D O I
10.1002/cpe.1585
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
High-performance computing systems continue to employ more and more processor cores. Current typical high-end machines in industry, university, and government research laboratory computing centers feature thousands of computing cores. While these machines promise ever more compute power and memory capacity to tackle today's complex simulation problems, they force application developers to greatly enhance the scalability of their codes to be able to exploit it. To better support them in their porting and tuning process, many parallel-tools research groups have already started to work on scaling their methods, techniques, and tools to extreme processor counts. In this paper, we survey existing profiling and tracing tools, report on our experience in using them in extreme scaling environments, review working and promising new methods and techniques, and discuss strategies for solving open issues and problems. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:2212 / 2229
页数:18
相关论文
共 50 条
  • [31] Distributed and scalable message transport service for high performance multi-agent systems
    Bashir, S
    Rehman, MU
    Ahmad, HF
    Ali, A
    Suguri, H
    2004 INTERNATIONAL NETWORKING AND COMMUNICATIONS CONFERENCE, PROCEEDINGS, 2004, : 152 - 157
  • [32] ENABLING SCALABLE HIGH-PERFORMANCE SYSTEMS WITH THE INTEL OMNI-PATH ARCHITECTURE
    Birrittella, Mark S.
    Debbage, Mark
    Huggahalli, Ram
    Kunz, James
    Lovett, Tom
    Rimmer, Todd
    Underwood, Keith D.
    Zak, Robert C.
    IEEE MICRO, 2016, 36 (04) : 38 - 47
  • [33] High Performance and Scalable Virtual Machine Storage I/O Stack for Multicore Systems
    Zhang, Diming
    Wu, Hao
    Xue, Fei
    Chen, Liangqiang
    Huang, Hao
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 292 - 301
  • [34] Effective Performance Measurement and Analysis of Multithreaded Applications
    Tallent, Nathan R.
    Mellor-Crummey, John M.
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 229 - 239
  • [35] Building scalable mediator systems
    Reynaud, C
    BUILDING THE INFORMATION SOCIETY, 2004, 156 : 25 - 30
  • [36] A Scalable Monitor for Large Systems
    Andreolini, Mauro
    Pietri, Marcello
    Tosi, Stefania
    Lancellotti, Riccardo
    CLOUD COMPUTING AND SERVICES SCIENCES, CLOSER 2014, 2015, 512 : 100 - 116
  • [37] Scalable Performance Analysis of Epidemic Routing Considering Skewed Location Visiting Preferences
    Rashidi, Leila
    Dalili-Yazdi, Amir
    Entezari-Maleki, Reza
    Sousa, Leonel
    Movaghar, Ali
    2019 IEEE 27TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2019), 2019, : 201 - 213
  • [38] Online Measurement-Based Adaptive Scalable Video Transmission in Energy Harvesting Aided Wireless Systems
    Yang, Jian
    Cai, Weizhe
    Ran, Yongyi
    Xi, Hongsheng
    Hanzo, Lajos
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (07) : 6231 - 6245
  • [39] Scalable subband subsampled radio architecture for millimetre wave communications with performance analysis
    Rakesh, R. T.
    Kutty, Shajahan
    Sen, Debarati
    Das, Goutam
    IET COMMUNICATIONS, 2016, 10 (16) : 2071 - 2083
  • [40] Analysis of task assignment policies in scalable distributed web-server systems
    Colajanni, M
    Yu, PS
    Dias, DM
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (06) : 585 - 600