FINE-GRAINED MULTITHREADING FOR THE MULTIFRONTAL QR FACTORIZATION OF SPARSE MATRICES

被引：33

作者：

Buttari, Alfredo ^{[1
]}

机构：

[1] ENSEEIHT, CNRS IRIT, F-31071 Toulouse, France

来源：

SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2013年 / 35卷 / 04期

关键词：

QR factorization; sparse linear systems; multifrontal method; multicore processors; asynchronous parallelism; SCHEME; MEMORY;

D O I：

10.1137/110846427

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The advent of multicore processors represents a disruptive event in the history of computer science as conventional parallel programming paradigms are proving incapable of fully exploiting their potential for concurrent computations. The need for different or new programming models clearly arises from recent studies which identify fine-granularity and dynamic execution as the keys to achieving high efficiency on multicore systems. This work presents an approach to the parallelization of the multifrontal method for the QR factorization of sparse matrices specifically designed for multicore based systems. High efficiency is achieved through a fine-grained partitioning of data and a dynamic scheduling of computational tasks relying on a dataflow parallel programming model. Experimental results show that an implementation of the proposed approach achieves higher performance and better scalability than existing equivalent software.

引用

页码：C323 / C345

页数：23

共 31 条

[1]

AMESTOY P.R, 1947, LECT NOTES COMPUT SC, V2000, P122

[2]

Amestoy PR, 1996, NUMER LINEAR ALGEBR, V3, P275

[3]

[Anonymous], ACM T MATH SOFTW

[4]

Bjorck A., 1996, NUMERICAL METHODS LE, DOI DOI 10.1137/1.9781611971484

[5] hwloc: a Generic Framework for Managing Hardware Affinities in HPC Applications [J].

Broquedis, Francois ;

Clet-Ortega, Jerome ;

Moreaud, Stephanie ;

Furmento, Nathalie ;

Goglin, Brice ;

Mercier, Guillaume ;

Thibault, Samuel ;

Namyst, Raymond .

PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, :180-186

[6] A portable programming interface for performance evaluation on modern processors [J].

Browne, S ;

Dongarra, J ;

Garner, N ;

Ho, G ;

Mucci, P .

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2000, 14 (03) :189-204

[7]

Buttari A, 2012, LECT NOTES COMPUT SC, V7134, P226

[8] A class of parallel tiled linear algebra algorithms for multicore architectures [J].

Buttari, Alfredo ;

Langou, Julien ;

Kurzak, Jakub ;

Dongarra, Jack .

PARALLEL COMPUTING, 2009, 35 (01) :38-53

[9] Algorithm 832: UMFPACK V4.3 - An unsymmetric-pattern multifrontal method [J].

Davis, TA .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2004, 30 (02) :196-199

[10] A column approximate minimum degree ordering algorithm [J].

Davis, TA ;

Gilbert, JR ;

Larimore, SI ;

Ng, EG .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2004, 30 (03) :353-376

← 1 2 3 4 →