A proposal for a heterogeneous cluster ScaLAPACK (dense linear solvers)

被引:52
作者
Beaumont, O [1 ]
Boudet, V [1 ]
Petitet, A [1 ]
Rastello, F [1 ]
Robert, Y [1 ]
机构
[1] Ecole Normale Super Lyon, INRIA, LIP, CNRS,UMR 5668, F-69364 Lyon 07, France
关键词
heterogeneous network; heterogeneous grid; different-speed processors; load-balancing; data distribution; data allocation; numerical libraries; numerical linear algebra; heterogeneous platforms; cluster computing;
D O I
10.1109/12.956091
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplication or linear system solvers, on heterogeneous networks of workstations. The uniform block-cyclic data distribution scheme commonly used for homogeneous collections of processors limits the performance of these linear algebra kernels on heterogeneous grids to the speed of the slowest processor. We present and study more sophisticated data allocation strategies that balance the load on heterogeneous platforms with respect to the performance of the processors. When targeting unidimensional grids, the load-balancing problem can be solved rather easily. When targeting two-dimensional grids, which are the key to scalability and efficiency for numerical kernels, the problem turns out to be surprisingly difficult. We formally state the 2D load-balancing problem and prove its NP-completeness. Next, we introduce a data allocation heuristic, which turns out to be very satisfactory: Its practical usefulness is demonstrated by MPI experiments conducted with a heterogeneous network of workstations.
引用
收藏
页码:1052 / 1070
页数:19
相关论文
共 38 条
  • [1] A HIGH-PERFORMANCE MATRIX-MULTIPLICATION ALGORITHM ON A DISTRIBUTED-MEMORY PARALLEL COMPUTER, USING OVERLAPPED COMMUNICATION
    AGARWAL, RC
    GUSTAVSON, FG
    ZUBAIR, M
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1994, 38 (06) : 673 - 681
  • [2] Parallel application scheduling on networks of workstations
    Anastasiadis, SV
    Sevcik, KC
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 43 (02) : 109 - 124
  • [3] [Anonymous], 1991, COMPUTERS INTRACTABI
  • [4] Ausiello G, 1999, COMPLEXITY APPROXIMA, DOI DOI 10.1007/978-3-642-58412-1
  • [5] BEAUMONT O, 2000, RR200010 LIP ENS
  • [6] Berman F, 1999, GRID: BLUEPRINT FOR A NEW COMPUTING INFRASTRUCTURE, P279
  • [7] BLACKFORD L, 1996, P SUP 96
  • [8] Blackford L. S., 1997, ScaLAPACK user's guide
  • [9] Static tiling for heterogeneous computing platforms
    Boulet, P
    Dongarra, J
    Vivien, F
    [J]. PARALLEL COMPUTING, 1999, 25 (05) : 547 - 568
  • [10] ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance
    Choi, J
    Demmel, J
    Dhillon, I
    Dongarra, J
    Ostrouchov, S
    Petitet, A
    Stanley, K
    Walker, D
    Whaley, RC
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 1996, 97 (1-2) : 1 - 15