Parallel FFT algorithms for high-order approximations on three-dimensional compact stencils

被引：5

作者：

Gonzales, Ronald ^{[1
]}

Gryazin, Yury ^{[1
]}

Lee, Yun Teck ^{[1
]}

机构：

[1] Idaho State Univ, Dept Math & Stat, 921 S 8th Ave,Stop 8085, Pocatello, ID 83209 USA

来源：

PARALLEL COMPUTING | 2021年 / 103卷

关键词：

Compact finite-difference schemes; FFT; Parallel algorithms; OpenMP; MPI; Hybrid;

D O I：

10.1016/j.parco.2021.102757

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The recent development of multicore technologies on modern desktop computers makes parallelization of the proposed numerical approaches a priority in algorithmic research. The main performance improvement of personal computers in the upcoming years will be made based on the increasing number of cores on modern CPUs. This shifts the focus of algorithmic research from the development of sequential numerical methods to parallel methodology. This paper presents an efficient parallel direct algorithm with near-optimal complexity for the compact fourth and sixth-order approximation of the three-dimensional Helmholtz equations (Turkel et al., 2013) with the problem coefficient depending on only one of the coordinate directions. The developed method is based on a combination of the separation of variables technique and a Fast Fourier Transform (FFT) type method. Similar direct solvers for the lower-order approximations of the two and three-dimensional Helmholtz equation were considered in several previous publications by the authors and other researchers (see, e.g. Gryazin et al. (2000); Gryazin (2014); Elman and O'Leary (1998); Elman and O'Leary (1999); Toivanen and Wolfmayr (2020)). The authors also consider a generalization of the presented algorithm to the solution of a wide class of linear systems obtained from approximation on the compact 27-point three-dimensional stencils on the rectangular grids with similar requirements on the stencil coefficients. The general restrictions on the coefficients in the considered class of compact schemes are developed and presented. This class includes the second, fourth and sixth-order compact approximation schemes for the three-dimensional Helmholtz equation considered in this paper and our previous publications (Gryazin et al., 2000; Gryazin, 2014; Gryazin, 2014). As an example of the diversity of applications of the developed general method, the direct parallel implementation of a compact fourth-order approximation scheme for a convection-diffusion equation is considered. Another goal of this paper is to investigate the scalability of the proposed technique in the case of a large linear system using different parallel programming extensions. The results of the implementation of this method in OpenMP, MPI and hybrid programming environments on the multicore computers and multiple node clusters are presented and discussed. The results demonstrate the high efficiency of the proposed direct solvers for many important applications on the structured grid with the corresponding 27-diagonal matrices of sizes up to 10(11) by 10(11).

引用

页数：10

共 23 条

[1] Fast 3D frequency-domain full-waveform inversion with a parallel block low-rank multifrontal direct solver: Application to OBC data from the North Sea [J].

Amestoy, Patrick ;

Brossier, Romain ;

Buttari, Alfredo ;

L'Excellent, Jean-Yves ;

Mary, Theo ;

Metivier, Ludovic ;

Miniussi, Alain ;

Operto, Stephane .

GEOPHYSICS, 2016, 81 (06) :R363-R383

[2]

Colton D, 2019, Inverse Acoustic and Electromagnetic Scattering Theory

[3] An optimal 13-point finite difference scheme for a 2D Helmholtz equation with a perfectly matched layer boundary condition [J].

Dastour, Hatef ;

Liao, Wenyuan .

NUMERICAL ALGORITHMS, 2021, 86 (03) :1109-1141

[4] Efficient iterative solution of the three-dimensional Helmholtz equation [J].

Elman, HC ;

O'Leary, DP .

JOURNAL OF COMPUTATIONAL PHYSICS, 1998, 142 (01) :163-181

[5] Eigenanalysis of some preconditioned Helmholtz problems [J].

Elman, HC ;

O'Leary, DP .

NUMERISCHE MATHEMATIK, 1999, 83 (02) :231-257

[6]

Gordon D, 2009, CMES-COMP MODEL ENG, V53, P23

[7] STABILITY AND FINITE ELEMENT ERROR ANALYSIS FOR THE HELMHOLTZ EQUATION WITH VARIABLE COEFFICIENTS [J].

Graham, I. G. ;

Sauter, S. A. .

MATHEMATICS OF COMPUTATION, 2020, 89 (321) :105-138

[8]

Gryazin Y., 2014, ISRN Computational Mathematics, DOI DOI 10.1155/2014/745849

[9] GMRES computation of high frequency electrical field propagation in land mine detection [J].

Gryazin, YA ;

Klibanov, MV ;

Lucas, TR .

JOURNAL OF COMPUTATIONAL PHYSICS, 2000, 158 (01) :98-115

[10] High-order approximation compact schemes for forward subsurface scattering problems [J].

Gryazin, Yury A. .

RADAR SENSOR TECHNOLOGY XVIII, 2014, 9077

← 1 2 3 →