Parallel program performance prediction using deterministic task graph analysis

被引:39
作者
Adve, VS
Vernon, MK
机构
[1] Univ Illinois, Thomas M Siebel Ctr Comp Sci, Urbana, IL 61801 USA
[2] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
来源
ACM TRANSACTIONS ON COMPUTER SYSTEMS | 2004年 / 22卷 / 01期
关键词
design; measurement; performance; analytical model; deterministic model; parallel program performance prediction; queueing network; shared memory; task graph; task scheduling;
D O I
10.1145/966785.966788
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this article, we consider analytical techniques for predicting detailed performance characteristics of a single shared memory parallel program for a particular input. Analytical models for parallel programs have been successful at providing simple qualitative insights and bounds on program scalability, but have been less successful in practice for providing detailed insights and metrics for program performance ( leaving these to measurement or simulation). We develop a conceptually simple modeling technique called deterministic task graph analysis that provides detailed performance prediction for shared-memory programs with arbitrary task graphs, a wide variety of task scheduling policies, and significant communication and resource contention. Unlike many previous models that are stochastic models, our model assumes deterministic task execution times ( while retaining the use of stochastic models for communication and resource contention). This assumption is supported by a previous study of the influence of nondeterministic delays in parallel programs. We evaluate our model in three ways. First, an experimental evaluation shows that our analysis technique is accurate and efficient for a variety of shared-memory programs, including programs with large and/or complex task graphs, sophisticated task scheduling, highly nonuniform task times, and significant communication and resource contention. The results also show that the deterministic assumption is crucial to permit accurate and yet efficient analysis of these programs. Second, we use three example programs to illustrate the predictive capabilities of the model. In two cases, broad insights and detailed metrics from the model are used to suggest improvements in load-balancing and the model quickly and accurately predicts the impact of these changes. In the third case, the model provides novel insights into the impact of program design changes that improve communication locality as well as load-balancing, via new ( but general-purpose) metrics. Finally, we present results from a comparison of our model and representative stochastic models, and use these to characterize the conditions under which a deterministic model or stochastic models would be appropriate.
引用
收藏
页码:94 / 136
页数:43
相关论文
共 65 条
[1]  
ADVE V, 2000, P 13 INT WORKSH LANG
[2]  
Adve V. S., 1993, THESIS U WISCONSIN M
[3]   PERFORMANCE ANALYSIS OF MESH INTERCONNECTION NETWORKS WITH DETERMINISTIC ROUTING [J].
ADVE, VS ;
VERNON, MK .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (03) :225-246
[4]  
Adve VS, 2000, IEEE T SOFTWARE ENG, V26, P1027, DOI 10.1109/32.881716
[5]  
ADVE VS, 1993, P 1993 ACM SIGMETRIC, P61
[6]  
ALEXANDROV A., 1995, P 7 ANN S PAR ALG AR
[7]  
Amdahl G., 1967, AFIPS C P, V30, P483, DOI DOI 10.1145/1465482.1465560
[8]  
AMMAR HH, P 1999 INT C PAR PRO, P90
[9]  
[Anonymous], P 4 ACM SIGPLAN S PR
[10]  
BALASUNDARAM V, 1991, P 3 ACM SIGPLAN S PR