Parallel programming with Polaris

被引:117
作者
Blume, W [1 ]
Doallo, R [1 ]
Eigenmann, R [1 ]
Grout, J [1 ]
Hoeflinger, J [1 ]
Lawrence, T [1 ]
Lee, J [1 ]
Padua, D [1 ]
Paek, Y [1 ]
Pottenger, B [1 ]
Rauchwerger, L [1 ]
Tu, P [1 ]
机构
[1] UNIV ILLINOIS,CTR SUPERCOMP RES & DEV,COMPILER GRP,URBANA,IL 61801
关键词
D O I
10.1109/2.546612
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As we reach the technological limits of hardware improvement, we must rely on multiple processors to improve programming speed. Parallel programming tools are limited, making effective parallel programming difficult and cumbersome. Compilers that translate conventional sequential programs into parallel form would liberate programmers from the complexities of explicit, machine-oriented parallel programming. Polaris, an experimental translator of conventional Fort-ran programs that target machines such as the Cray T3D, is the first step toward this goal. The most important techniques implemented in Polaris resulted from a study of the effectiveness of commercial Fortran parallelizers. The authors compiled the Perfect Benchmarks, a collection of conventional Fortran programs representing the typical workload of high-performance computers, for the Alliant FX/80, an eight-processor multiprocessor popular in the late 1980s. For each program, they measured the quality of the parallelization by computing the speedup. With few exceptions, the Alliant Fortran compiler failed to deliver any significant speedup for the majority of the programs. The compiler failed to produce a speedup because it could not parallelize some of the most important loops in the Perfect Benchmarks. The study showed that extending the four most important analysis and transformation techniques traditionally used for vectorization leads to significant increases in speedup. Polaris detected much of the parallelism available in the set of benchmark codes. A careful analysis of the remaining loops that Polaris could parallelize highlights four areas for improvement.
引用
收藏
页码:78 / &
页数:6
相关论文
共 7 条
  • [1] Blume W., 1994, Proceedings Supercomputing '94 (Cat. No.94CH34819), P528, DOI 10.1109/SUPERC.1994.344316
  • [2] EIGENMANN R, 1993, CONCURRECY PRACT OCT, P553
  • [3] GROUT J, 1995, THESIS U ILLINOIS UR
  • [4] KUCK DJ, 1980, P COMPSAC 80, P709
  • [5] POTTENGER B, 1995, P 9 ACM INT C SUP, P444
  • [6] RAUCHWERGER L, 1995, P SIGPLAN 1995 C PRO, P218
  • [7] TU P, 1993, LECT NOTES COMPUTER, P500