Architecture of an automatically tuned linear algebra library

被引:27
作者
Cuenca, J [1 ]
Giménez, D
González, J
机构
[1] Univ Murcia, Dept Ingn & Tecnol Computadores, E-30071 Murcia, Spain
[2] Univ Murcia, Dept Informat & Sistemas Informat, E-30071 Murcia, Spain
[3] Intel Barcelona Res Ctr, Intel Labs, Barcelona 08034, Spain
关键词
automatic tuning; high performance computing; linear algebra; block methods; polylibraries;
D O I
10.1016/j.parco.2003.11.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
One approach for a hierarchical architecture of a set of linear algebra libraries with self-optimisation capacity is shown. In previous works the optimisation of several routines was studied separately, and in this work the ideas applied to individual routines are combined with the classical hierarchy of linear algebra libraries. Each self-optimised library consists of the former routines of the library and additional special routines which obtain information of the characteristics on the system and tune certain parameters of the former routines accordingly. The relationship between libraries of the different levels of the hierarchy is also strengthened. Just as each routine has in its code different calls to lower levels, so this routine will use the self-optimisation information of these other routines to generate its own information. Experiments with routines of different levels and on different kinds of platforms with constant, variable and heterogeneous load have been carried out. The results obtained allow us to conclude that the proposed methodology is valid for obtaining self-optimised linear algebra libraries. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:187 / 210
页数:24
相关论文
共 29 条
[1]  
ALBERTI P, 2002, THESIS U POLITECNICA
[2]  
ANDERSEN BS, 1985, LECT NOTE COMPUTER S, V1947, P38
[3]   The GrADS project: Software support for high-level grid application development [J].
Berman, F ;
Chien, A ;
Cooper, K ;
Dongarra, J ;
Foster, I ;
Gannon, D ;
Johnsson, L ;
Kennedy, K ;
Kesselman, C ;
Mellor-Crummey, J ;
Reed, D ;
Torczon, L ;
Wolski, R .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (04) :327-344
[4]  
BILMES J, 1997, INT C SUPERCOMPUTING, V3, P340
[5]  
Blackford L. S., 1997, ScaLAPACK user's guide
[6]  
BOUDET V, 1999, P PAR DISTR COMP SYS, P561
[7]  
Boulet P., 1999, Parallel Processing Letters, V9, P197, DOI 10.1142/S0129626499000207
[8]  
BREWER EA, 1995, P 5 ACM SIGPLAN S PR, P80
[9]   Self-adapting software for numerical linear algebra and LAPACK for clusters [J].
Chen, ZZ ;
Dongarra, J ;
Luszczek, P ;
Roche, K .
PARALLEL COMPUTING, 2003, 29 (11-12) :1723-1743
[10]  
Choi J., 1996, SCI PROGRAMMING-NETH, V5, P173