Exploiting a Parametrized Task Graph Model for the Parallelization of a Sparse Direct Multifrontal Solver

被引：3

作者：

Agullo, Emmanuel ^{[1
]}

Bosilca, George ^{[5
]}

Buttari, Alfredo ^{[2
]}

Guermouche, Abdou ^{[3
]}

Lopez, Florent ^{[4
]}

机构：

[1] INRIA, LaBRI, Bordeaux, France

[2] CNRS, IRIT, Toulouse, France

[3] Univ Bordeaux, LaBRI, Bordeaux, France

[4] RAL STFC, Didcot, Oxon, England

[5] Univ Tennessee, Knoxville, TN USA

来源：

EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS | 2017年 / 10104卷

关键词：

Multicore architectures; Programming models; Runtime system; Parametrized task graph; Numerical scientific library; Sparse direct solver; Multifrontal QR factorization; QR FACTORIZATION;

D O I：

10.1007/978-3-319-58943-5_14

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The advent of multicore processors requires to reconsider the design of high performance computing libraries to embrace portable and effective techniques of parallel software engineering. One of the most promising approaches consists in abstracting an application as a directed acyclic graph (DAG) of tasks. While this approach has been popularized for shared memory environments by the OpenMP 4.0 standard where dependencies between tasks are automatically inferred, we investigate an alternative approach, capable of describing the DAG of task in a distributed setting, where task dependencies are explicitly encoded. So far this approach has been mostly used in the case of algorithms with a regular data access pattern and we show in this study that it can be efficiently applied to a higly irregular numerical algorithm such as a sparse multifrontal QR method. We present the resulting implementation and discuss the potential and limits of this approach in terms of productivity and effectiveness in comparison with more common parallelization techniques. Although at an early stage of development, preliminary results show the potential of the parallel programming model that we investigate in this work.

引用

页码：175 / 186

页数：12

共 22 条

[11] FINE-GRAINED MULTITHREADING FOR THE MULTIFRONTAL QR FACTORIZATION OF SPARSE MATRICES [J].

Buttari, Alfredo .

SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2013, 35 (04) :C323-C345

[12] A class of parallel tiled linear algebra algorithms for multicore architectures [J].

Buttari, Alfredo ;

Langou, Julien ;

Kurzak, Jakub ;

Dongarra, Jack .

PARALLEL COMPUTING, 2009, 35 (01) :38-53

[13]

Cosnard M., 1995, Proceedings of the Twenty-Eighth Hawaii International Conference on System Sciences, P113, DOI 10.1109/HICSS.1995.375471

[14]

Danalis, 2013, SCALABLE COMPUTING C, P699

[15] The University of Florida Sparse Matrix Collection [J].

Davis, Timothy A. ;

Hu, Yifan .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2011, 38 (01)

[16] Algorithm 915, SuiteSparseQR: Multifrontal Multithreaded Rank-Revealing Sparse QR Factorization [J].

Davis, Timothy A. .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2011, 38 (01)

[17] THE MULTIFRONTAL SOLUTION OF INDEFINITE SPARSE SYMMETRIC LINEAR-EQUATIONS [J].

DUFF, IS ;

REID, JK .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1983, 9 (03) :302-325

[18]

Hadri B., 2010, Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on, P1, DOI DOI 10.1109/IPDPS.2010.5470443

[19] The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations [J].

Igual, Francisco D. ;

Chan, Ernie ;

Quintana-Orti, Enrique S. ;

Quintana-Orti, Gregorio ;

van de Geijn, Robert A. ;

Van Zee, Field G. .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2012, 72 (09) :1134-1143

[20] A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling [J].

Kim, Kyungjoo ;

Eijkhout, Victor .

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2014, 41 (01)

← 1 2 3 →