Interactive debugging and performance analysis of massively parallel applications

被引:16
作者
Wismuller, R [1 ]
Oberhuber, M [1 ]
Krammer, J [1 ]
Hansen, O [1 ]
机构
[1] PARSYTEC COMP GMBH, D-52035 AACHEN, GERMANY
关键词
parallel programming; performance analysis; debugging; distributed monitoring;
D O I
10.1016/0167-8191(95)00070-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the field of high performance computing, massively parallel processing systems (MPPs) get more and more important. A rising number of complex applications is parallelized for execution on these machines. Still a significant portion of the time needed for parallelization is spent for the process of debugging and performance tuning. A main reason for this fact is the absence of adequate tools supporting this phase of program development. In this article, we present a novel tool environment, consisting of a parallel debugger (DETOP), a performance analyzer (PATOP), and a common monitoring system for PowerPC-based parallel computers. The environment specifically addresses the topics of scalability, usability for dynamic, multithreaded programming models, minimal intrusion, debugging and tuning methodology and comfortable user interfaces. We derive requirements for tools monitoring the runtime behavior of parallel programs, present the concepts used to meet these requirements in our tool environment, and describe its implementation and its usage. DETOP is based on the event-action paradigm and supports both data parallel codes and programs based on functional decomposition. Special features are provided for applications that dynamically create new threads or consist of multiple executables. PATOP supports a systematic search for performance bottlenecks in massively parallel applications using the concept of attributed measurements and distributed evaluation. Both tools are based on a common, distributed on-line monitoring system providing the necessary runtime information.
引用
收藏
页码:415 / 442
页数:28
相关论文
共 21 条
[1]  
BEMMERL T, 1989, LECTURE NOTES COMPUT, V457, P840
[2]  
BEMMERL T, 1994, HIGH PERFORMANCE COM, V2, P394
[3]  
COUCH AL, 1993, J PARALLEL DISTRIBUT, V18, P129
[4]  
FIDGE J, 1988, 11TH P AUSTR COMP SC, P55
[5]   A PROBE EFFECT IN CONCURRENT PROGRAMS [J].
GAIT, J .
SOFTWARE-PRACTICE & EXPERIENCE, 1986, 16 (03) :225-233
[6]  
HABAN D, 1988, 21ST P HAW INT C SYS, V2, P166
[7]  
HANSEN O, 1994, HIGH PERFORMANCE COM, V2, P350
[8]   VISUALIZING THE PERFORMANCE OF PARALLEL PROGRAMS [J].
HEATH, MT ;
ETHERIDGE, JA .
IEEE SOFTWARE, 1991, 8 (05) :29-39
[9]  
HOLLINGSWORTH JK, 1993, P 7 ACM INT C SUP TO
[10]  
*INT CORP, 1991, IPSC 2 IPSC 860 INT