A LOOP TRANSFORMATION THEORY AND AN ALGORITHM TO MAXIMIZE PARALLELISM

被引:243
|
作者
WOLF, ME [1 ]
LAM, MS [1 ]
机构
[1] STANFORD UNIV,COMP SYST LAB,STANFORD,CA 94305
关键词
D O I
10.1109/71.97902
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a new approach to transformations for general loop nests. In this approach, we unify all combinations of loop interchange, skewing and reversal as unimodular transformations. The use of matrices to model transformations has previously been applied only to those loop nests whose dependences can be summarized by distance vectors. Our technique is applicable to general loop nests where the dependences include both distances and directions. This theory provides the foundation for solving an open question in compilation for parallel machines: which loop transformations, and in what order, should be applied to achieve a particular goal, such as maximizing parallelism or data locality. This paper presents an efficient loop transformation algorithm based on this theory to maximize the degree of parallelism in a loop nest.
引用
收藏
页码:452 / 471
页数:20
相关论文
共 50 条
  • [1] Loop striping: Maximize parallelism for nested loops
    Xue, Chun
    Shao, Zili
    Liu, Meilin
    Qiu, Meikang
    Sha, Edwin H-M.
    EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 405 - 414
  • [2] Maximize Parallelism for Nested Loops via Loop Striping
    Xue, Chun
    Shao, Zili
    Zhuge, Qingfeng
    Liu, Meilin
    Qiu, Meikang
    Sha, Edwin H. -M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (5A): : 168 - 178
  • [3] Affine partitioning algorithm to maximize parallelism and minimize communication
    Lim, Amy W.
    Cheong, Gerald I.
    Lam, Monica S.
    Proceedings of the International Conference on Supercomputing, 1999, : 228 - 237
  • [4] Maximize Parallelism Minimize Overhead for Nested Loops via Loop Striping
    Chun Xue
    Zili Shao
    Edwin H.-M. Sha
    The Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 2007, 47 : 153 - 167
  • [5] Maximize parallelism minimize overhead for nested loops via loop striping
    Xue, Chun
    Sha, Edwin H. -M.
    Shao, Zili
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2007, 47 (02): : 153 - 167
  • [6] Loop-synthesizing transformation for maintaining parallelism and enhancing locality
    Lee, S
    Aso, H
    2003 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2003, : 156 - 163
  • [7] Tiling Stencil Computations to Maximize Parallelism
    Bandishti, Vinayaka
    Pananilath, Irshad
    Bondhugula, Uday
    2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
  • [8] A parametrized loop fusion algorithm for improving parallelism and cache locality
    Singhai, SK
    McKinley, KS
    COMPUTER JOURNAL, 1997, 40 (06): : 340 - 355
  • [9] A loop transformation for maximizing parallelism from single loops with nonuniform dependencies
    Cho, CK
    Shim, JC
    Lee, MH
    HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 696 - 699
  • [10] A Loop Transformation Algorithm for Communication Overlapping
    Kazuaki Ishizaki
    Hideaki Komatsu
    Toshio Nakatani
    International Journal of Parallel Programming, 2000, 28 : 135 - 154