Implementation of a non-linear solver on heterogeneous architectures

被引:4
作者
Carracciuolo, Luisa [1 ]
Lapegna, Marco [2 ]
机构
[1] Italian Natl Res Council, I-00185 Rome, Italy
[2] Univ Naples Federico II, Naples, Italy
关键词
CUDA; heterogeneous architectures; MAGMA; performance metrics; solver for non-linear problem; QUASI-NEWTON METHODS; GPU; COMPONENT; ARM;
D O I
10.1002/cpe.4903
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Heterogeneous architectures seem to be not only the present but also the future of the HPC world (eg, see the Exascale Project of U.S. Department of Energy or the European Horizon 2020 FET Proactive - High Performance Computing Call). A lot of work has been done in developing software libraries useful to solve problem described by linear equations on such computing systems. Instead, not the same effort is spent in such context for the implementation of software modules to be used to solve non-linear problem. In this work, we present some experiences related with the implementation of a Quasi-Newton method able to exploit, using a combination of "Task Scheduling," "matrix-free," and "look-ahead" approaches, both the CPUs and the GP-GPUs components of a heterogeneous system.
引用
收藏
页数:19
相关论文
共 47 条
  • [1] Abadi M, 2016, P 21 USENIX S OP SYS
  • [2] [Anonymous], 2011, S APPL ACCELERATORS
  • [3] [Anonymous], 2016, LLNLTR700962 DOE CTR
  • [4] [Anonymous], 1995, Designing and Building Parallel Programs
  • [5] [Anonymous], 2015, ARXIV151201274 CORR
  • [6] A Scalable Numerical Algorithm for Solving Tikhonov Regularization Problems
    Arcucci, Rosella
    D'Amore, Luisa
    Celestino, Simone
    Laccetti, Giuliano
    Murli, Almerico
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 45 - 54
  • [7] A Decomposition of the Tikhonov Regularization Functional Oriented to Exploit Hybrid Multilevel Parallelism
    Arcucci, Rossella
    D'Amore, Luisa
    Carracciuolo, Luisa
    Scotti, Giuseppe
    Laccetti, Giuliano
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (05) : 1214 - 1235
  • [8] Balay S., PETSc Web page
  • [9] SCoPE@Scuola: (In)-formative Paths on Topics Related with High Performance, Parallel and Distributed Computing
    Barone, Giovanni Battista
    Boccia, Vania
    Bottalico, Davide
    Carracciuolo, Luisa
    [J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 191 - 202
  • [10] Boccia V, 2012, LECT NOTES COMPUT SC, V7203, P700, DOI 10.1007/978-3-642-31464-3_71