Runtime adjustment of parallel nested loops

被引:0
|
作者
Duran, A
Silvera, R
Corbalán, J
Labarta, J
机构
[1] Univ Politecn Cataluna, CEPBA, IBM Res Inst, Barcelona, Spain
[2] IBM Toronto Lab, Markham, ON L6G 1C7, Canada
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
OpenMP allows programmers to specify nested parallelism in parallel applications. In the case of scientific applications, parallel loops are the most important source of parallelism. In this paper we present an automatic mechanism to dynamically detect the best way to exploit the parallelism when having nested parallel loops. This mechanism is based on the number of threads, the problem size, and the number of iterations on the loop. To do that, we claim that programmers must specify the potential application parallelism and give the runtime the responsibility to decide the best way to exploit it. We have implemented this mechanism inside the IBM XL runtime library. Evaluation shows that our mechanism dynamically adapts the parallelism generated to the application and runtime parameters, reaching the same speedup as the best static parallelization (with a priori information).
引用
收藏
页码:137 / 147
页数:11
相关论文
共 50 条
  • [1] GROUPING IN NESTED LOOPS FOR PARALLEL EXECUTION ON MULTICOMPUTERS
    KING, CT
    NI, LM
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 2: SOFTWARE, 1989, : 31 - 38
  • [2] Improving nested loops' ILP on a parallel ASIC design
    Light, R
    Maxfield, W
    Reed, B
    Passos, N
    Sha, EHM
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 105 - 110
  • [3] A PROCESSOR ALLOCATION ALGORITHM FOR NESTED COMBINATIONS OF PARALLEL LOOPS AND CASES
    YAMAMOTO, F
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 2: SOFTWARE, 1989, : 131 - 138
  • [4] Runtime predictability of loops
    de Alba, MR
    Kaeli, DR
    WWC-4: IEEE INTERNATIONAL WORKSHOP ON WORKLOAD CHARACTERIZATION, 2001, : 91 - 98
  • [5] External adjustment of runtime parameters in time warp synchronized parallel simulators
    Radhakrishnan, R
    Moore, L
    Wilsey, PA
    11TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM, PROCEEDINGS, 1997, : 260 - 266
  • [6] DYNAMIC PROCESSOR SELF-SCHEDULING FOR GENERAL PARALLEL NESTED LOOPS
    FANG, ZX
    TANG, PY
    YEW, PC
    ZHU, CQ
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (07) : 919 - 929
  • [8] Optimized unrolling of nested loops
    IBM T. J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY 10598
    Int J Parallel Program, 5 (545-581):
  • [9] Optimized unrolling of nested loops
    Sarkar, V
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2001, 29 (05) : 545 - 581
  • [10] Optimized Unrolling of Nested Loops
    Vivek Sarkar
    International Journal of Parallel Programming, 2001, 29 : 545 - 581