A distributed-memory MPI parallelization scheme for multi-domain incompressible SPH

被引：5

作者：

Monteleone A. ^{[1
]}

Burriesci G. ^{[1
,2
]}

Napoli E. ^{[3
]}

机构：

[1] Bioengineering Unit, Ri.MED Foundation, Palermo

[2] UCL Mechanical Engineering, University College London, London

[3] Engineering Department, University of Palermo, Palermo

来源：

Journal of Parallel and Distributed Computing | 2022年 / 170卷

关键词：

Load balancing; MPI; Multi-domain approach; Parallel distributed-memory computation; Smoothed particle hydrodynamics (SPH);

D O I：

10.1016/j.jpdc.2022.08.004

中图分类号：

学科分类号：

摘要：

A parallel scheme for a multi-domain truly incompressible smoothed particle hydrodynamics (SPH) approach is presented. The proposed method is developed for distributed-memory architectures through the Message Passing Interface (MPI) paradigm as communication between partitions. The proposal aims to overcome one of the main drawbacks of the SPH method, which is the high computational cost with respect to mesh-based methods, by coupling a multi-resolution approach with parallel computing techniques. The multi-domain approach aims to employ different resolutions by subdividing the computational domain into non-overlapping blocks separated by block interfaces. The particles belonging to different blocks are efficiently distributed among processors ensuring well balanced loads. The parallelization procedure handles particle exchanges both throughout the blocks and the competence domains of the processors. The matching of the velocity values between neighbouring blocks is obtained solving a system of interpolation equations at each block interfaces through a parallelized BiCGSTAB algorithm. Otherwise, a whole pseudo-pressure system is solved in parallel considering the Pressure Poisson equations of the fluid particles of all the blocks and the interpolation equations of all the block interfaces. The employed test cases show the strong reduction of the computational efforts of the SPH method thanks to the interaction of the employed multi-resolution approach and the proposed parallel algorithms. © 2022 Elsevier Inc.

引用

页码：53 / 67

页数：14

共 11 条

[1] On distributed memory MPI-based parallelization of SPH codes in massive HPC context
Oger, G.
Le Touze, D.
Guibert, D.
de Leffe, M.
Biddiscombe, J.
Soumagne, J.
Piccinali, J. -G.
COMPUTER PHYSICS COMMUNICATIONS, 2016, 200 : 1 - 14
[2] Parallelization of Multilevel ILU Preconditioners on Distributed-Memory Multiprocessors
Aliaga, Jose I.
Bollhoefer, Matthias
Martin, Alberto F.
Quintana-Orti, Enrique S.
APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 162 - 172
[3] Performing BMMC permutations efficiently on distributed-memory multiprocessors with MPI
Cormen, TH
Clippinger, JC
ALGORITHMICA, 1999, 24 (3-4) : 349 - 370
[4] Hierarchical Distributed-Memory Multi-Leader MPI-Allreduce for Deep Learning Workloads
Truong Thao Nguyen
Wahib, Mohamed
Takano, Ryousei
2018 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2018), 2018, : 216 - 222
[5] Shared- and distributed-memory parallelization of a Lagrangian atmospheric dispersion model
Larson, DJ
Nasstrom, JS
ATMOSPHERIC ENVIRONMENT, 2002, 36 (09) : 1559 - 1564
[6] Hierarchical Dynamic Loop Self-Scheduling on Distributed-Memory Systems Using an MPI plus MPI Approach
Eleliemy, Ahmed
Ciorba, Florina M.
2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 689 - 697
[7] 2D gas dynamics problem computation parallelization on unstructured grids on distributed-memory computer
Barabanov, RA
Butnev, OI
Pronin, VA
Sofronov, ID
Volkov, SG
Voronin, BL
Zhogov, BM
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2899 - 2906
[8] LBMDTE: Multi-Domain Traffic Engineering in Distributed Software-Defined Networks
Wang, Kun
Lv, Guanghong
COMPUTER COMMUNICATIONS, 2025, 236
[9] An effective 3-D fast fourier transform framework for multi-GPU accelerated distributed-memory systems
Binbin Zhou
Lu Lu
The Journal of Supercomputing, 2022, 78 : 17055 - 17073
[10] An effective 3-D fast fourier transform framework for multi-GPU accelerated distributed-memory systems
Zhou, Binbin
Lu, Lu
JOURNAL OF SUPERCOMPUTING, 2022, 78 (15) : 17055 - 17073

← 1 2 →