A distributed-memory MPI parallelization scheme for multi-domain incompressible SPH

被引:5
|
作者
Monteleone A. [1 ]
Burriesci G. [1 ,2 ]
Napoli E. [3 ]
机构
[1] Bioengineering Unit, Ri.MED Foundation, Palermo
[2] UCL Mechanical Engineering, University College London, London
[3] Engineering Department, University of Palermo, Palermo
关键词
Load balancing; MPI; Multi-domain approach; Parallel distributed-memory computation; Smoothed particle hydrodynamics (SPH);
D O I
10.1016/j.jpdc.2022.08.004
中图分类号
学科分类号
摘要
A parallel scheme for a multi-domain truly incompressible smoothed particle hydrodynamics (SPH) approach is presented. The proposed method is developed for distributed-memory architectures through the Message Passing Interface (MPI) paradigm as communication between partitions. The proposal aims to overcome one of the main drawbacks of the SPH method, which is the high computational cost with respect to mesh-based methods, by coupling a multi-resolution approach with parallel computing techniques. The multi-domain approach aims to employ different resolutions by subdividing the computational domain into non-overlapping blocks separated by block interfaces. The particles belonging to different blocks are efficiently distributed among processors ensuring well balanced loads. The parallelization procedure handles particle exchanges both throughout the blocks and the competence domains of the processors. The matching of the velocity values between neighbouring blocks is obtained solving a system of interpolation equations at each block interfaces through a parallelized BiCGSTAB algorithm. Otherwise, a whole pseudo-pressure system is solved in parallel considering the Pressure Poisson equations of the fluid particles of all the blocks and the interpolation equations of all the block interfaces. The employed test cases show the strong reduction of the computational efforts of the SPH method thanks to the interaction of the employed multi-resolution approach and the proposed parallel algorithms. © 2022 Elsevier Inc.
引用
收藏
页码:53 / 67
页数:14
相关论文
共 11 条
  • [1] On distributed memory MPI-based parallelization of SPH codes in massive HPC context
    Oger, G.
    Le Touze, D.
    Guibert, D.
    de Leffe, M.
    Biddiscombe, J.
    Soumagne, J.
    Piccinali, J. -G.
    COMPUTER PHYSICS COMMUNICATIONS, 2016, 200 : 1 - 14
  • [2] Parallelization of Multilevel ILU Preconditioners on Distributed-Memory Multiprocessors
    Aliaga, Jose I.
    Bollhoefer, Matthias
    Martin, Alberto F.
    Quintana-Orti, Enrique S.
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 162 - 172
  • [3] Performing BMMC permutations efficiently on distributed-memory multiprocessors with MPI
    Cormen, TH
    Clippinger, JC
    ALGORITHMICA, 1999, 24 (3-4) : 349 - 370
  • [4] Hierarchical Distributed-Memory Multi-Leader MPI-Allreduce for Deep Learning Workloads
    Truong Thao Nguyen
    Wahib, Mohamed
    Takano, Ryousei
    2018 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2018), 2018, : 216 - 222
  • [5] Shared- and distributed-memory parallelization of a Lagrangian atmospheric dispersion model
    Larson, DJ
    Nasstrom, JS
    ATMOSPHERIC ENVIRONMENT, 2002, 36 (09) : 1559 - 1564
  • [6] Hierarchical Dynamic Loop Self-Scheduling on Distributed-Memory Systems Using an MPI plus MPI Approach
    Eleliemy, Ahmed
    Ciorba, Florina M.
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 689 - 697
  • [7] 2D gas dynamics problem computation parallelization on unstructured grids on distributed-memory computer
    Barabanov, RA
    Butnev, OI
    Pronin, VA
    Sofronov, ID
    Volkov, SG
    Voronin, BL
    Zhogov, BM
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2899 - 2906
  • [8] LBMDTE: Multi-Domain Traffic Engineering in Distributed Software-Defined Networks
    Wang, Kun
    Lv, Guanghong
    COMPUTER COMMUNICATIONS, 2025, 236
  • [9] An effective 3-D fast fourier transform framework for multi-GPU accelerated distributed-memory systems
    Binbin Zhou
    Lu Lu
    The Journal of Supercomputing, 2022, 78 : 17055 - 17073
  • [10] An effective 3-D fast fourier transform framework for multi-GPU accelerated distributed-memory systems
    Zhou, Binbin
    Lu, Lu
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (15) : 17055 - 17073