Exploiting a Parametrized Task Graph Model for the Parallelization of a Sparse Direct Multifrontal Solver

被引:3
作者
Agullo, Emmanuel [1 ]
Bosilca, George [5 ]
Buttari, Alfredo [2 ]
Guermouche, Abdou [3 ]
Lopez, Florent [4 ]
机构
[1] INRIA, LaBRI, Bordeaux, France
[2] CNRS, IRIT, Toulouse, France
[3] Univ Bordeaux, LaBRI, Bordeaux, France
[4] RAL STFC, Didcot, Oxon, England
[5] Univ Tennessee, Knoxville, TN USA
来源
EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS | 2017年 / 10104卷
关键词
Multicore architectures; Programming models; Runtime system; Parametrized task graph; Numerical scientific library; Sparse direct solver; Multifrontal QR factorization; QR FACTORIZATION;
D O I
10.1007/978-3-319-58943-5_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The advent of multicore processors requires to reconsider the design of high performance computing libraries to embrace portable and effective techniques of parallel software engineering. One of the most promising approaches consists in abstracting an application as a directed acyclic graph (DAG) of tasks. While this approach has been popularized for shared memory environments by the OpenMP 4.0 standard where dependencies between tasks are automatically inferred, we investigate an alternative approach, capable of describing the DAG of task in a distributed setting, where task dependencies are explicitly encoded. So far this approach has been mostly used in the case of algorithms with a regular data access pattern and we show in this study that it can be efficiently applied to a higly irregular numerical algorithm such as a sparse multifrontal QR method. We present the resulting implementation and discuss the potential and limits of this approach in terms of productivity and effectiveness in comparison with more common parallelization techniques. Although at an early stage of development, preliminary results show the potential of the parallel programming model that we investigate in this work.
引用
收藏
页码:175 / 186
页数:12
相关论文
共 22 条
[1]  
Agullo E., 2011, GPU Comput Gems Jade Edition, V2, P473
[2]   Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems [J].
Agullo, Emmanuel ;
Buttari, Alfredo ;
Guermouche, Abdou ;
Lopez, Florent .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 43 (02)
[3]   Numerical linear algebra on emerging architectures: the PLASMA and MAGMA projects [J].
Agullo, Emmanuel ;
Demmel, Jim ;
Dongarra, Jack ;
Hadri, Bilel ;
Kurzak, Jakub ;
Langou, Julien ;
Ltaief, Hatem ;
Luszczek, Piotr ;
Tomov, Stanimire .
SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2009, 180
[4]  
Allen Randy, 2002, OPTIMIZING COMPILERS
[5]  
Amestoy PR, 1996, NUMER LINEAR ALGEBR, V3, P275
[6]  
Ayguadé E, 2009, LECT NOTES COMPUT SC, V5704, P851, DOI 10.1007/978-3-642-03869-3_79
[7]   Parallelizing dense and banded linear algebra libraries using SMPSs [J].
Badia, Rosa M. ;
Herrero, Jose R. ;
Labarta, Jesus ;
Perez, Josep M. ;
Quintana-Orti, Enrique S. ;
Quintana-Orti, Gregorio .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (18) :2438-2456
[8]  
Bosilca G., 2011, 2011 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, P1432, DOI 10.1109/IPDPS.2011.299
[9]  
Bosilca G., 2011, 2011 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, P1151, DOI 10.1109/IPDPS.2011.281
[10]   PaRSRC: Exploiting Heterogeneity to Enhance Scalability [J].
Bosilca, George ;
Bouteiller, Aurelien ;
Danalis, Anthony ;
Faverge, Mathieu ;
Herault, Thomas ;
Dongarra, Jack J. .
COMPUTING IN SCIENCE & ENGINEERING, 2013, 15 (06) :36-45