EFFICIENT IMPLEMENTATION OF NONLINEAR COMPACT SCHEMES ON MASSIVELY PARALLEL PLATFORMS

被引:8
作者
Ghosh, Debojyoti [1 ]
Constantinescu, Emil M. [1 ]
Brown, Jed [1 ]
机构
[1] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
关键词
compact schemes; WENO; CRWENO; high-performance computing; compressible flows; ESSENTIALLY NONOSCILLATORY SCHEMES; HYPERBOLIC CONSERVATION-LAWS; SHOCK-TURBULENCE INTERACTION; DIRECT NUMERICAL-SIMULATION; WENO SCHEME; ALGORITHM; EQUATIONS; SYSTEMS; DISCRETIZATIONS; FLOWS;
D O I
10.1137/140989261
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Weighted nonlinear compact schemes are ideal for simulating compressible, turbulent flows because of their nonoscillatory nature and high spectral resolution. However, they require the solution to banded systems of equations at each time-integration step or stage. We focus on tridiagonal compact schemes in this paper. We propose an efficient implementation of such schemes on massively parallel computing platforms through an iterative substructuring algorithm to solve the tridiagonal system of equations. The key features of our implementation are that it does not introduce any parallelization-based approximations or errors and it involves minimal neighbor-to-neighbor communications. We demonstrate the performance and scalability of our approach on the IBM Blue Gene/Q platform and show that the compact schemes are efficient and have performance comparable to that of standard noncompact finite-difference methods on large numbers of processors (similar to 500, 000) and small subdomain sizes (four points per dimension per processor).
引用
收藏
页码:C354 / C383
页数:30
相关论文
共 50 条
[41]   PARALLEL IMPLEMENTATION AND APPLICATION OF THE MRTD WITH AN EFFICIENT CFS-PML [J].
Liu, Yawen ;
Chen, Yiwang ;
Zhang, Pin .
PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2013, 143 :223-242
[42]   3D magnetotelluric modeling using high-order tetrahedral Nedelec elements on massively parallel computing platforms [J].
Castillo-Reyes, Octavio ;
Modesto, David ;
Queralt, Pilar ;
Marcuello, Alex ;
Ledo, Juanjo ;
Amor-Martin, Adrian ;
de la Puente, Josep ;
Emilio Garcia-Castillo, Luis .
COMPUTERS & GEOSCIENCES, 2022, 160
[43]   Improvement of Convergence to Steady State Solutions of Euler Equations with Weighted Compact Nonlinear Schemes [J].
Zhang, Shu-hai ;
Deng, Xiao-gang ;
Mao, Mei-liang ;
Shu, Chi-wang .
ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2013, 29 (03) :449-464
[44]   Solution of nonlinear fractional-order models of nuclear reactor with parallel computing: Implementation on GPU platform [J].
Keluskar, Yugesh C. ;
Singhaniya, Navin G. ;
Vyawahare, Vishwesh A. ;
Jage, Chaitanya S. ;
Patil, Parag ;
Espinosa-Paredes, Gilberto .
ANNALS OF NUCLEAR ENERGY, 2024, 195
[45]   Efficient Implementation of ADER Discontinuous Galerkin Schemes for a Scalable Hyperbolic PDE Engine [J].
Dumbser, Michael ;
Fambri, Francesco ;
Tavelli, Maurizio ;
Bader, Michael ;
Weinzierl, Tobias .
AXIOMS, 2018, 7 (03)
[46]   High Order Well-Balanced Weighted Compact Nonlinear Schemes for Shallow Water Equations [J].
Gao, Zhen ;
Hu, Guanghui .
COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2017, 22 (04) :1049-1068
[47]   Fast high-accuracy compact conservative difference schemes for solving the nonlinear Schrodinger equation [J].
Almushaira, Mustafa .
JOURNAL OF DIFFERENCE EQUATIONS AND APPLICATIONS, 2022, 28 (01) :10-38
[48]   Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation [J].
Su, Huayou ;
Wen, Mei ;
Wu, Nan ;
Ren, Ju ;
Zhang, Chunyuan .
SCIENTIFIC WORLD JOURNAL, 2014,
[49]   FORCES NLP: an efficient implementation of interior-point methods for multistage nonlinear nonconvex programs [J].
Zanelli, A. ;
Domahidi, A. ;
Jerez, J. ;
Morari, M. .
INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (01) :13-29
[50]   Engineering efficient and massively parallel 3D self-reconfiguration using sandboxing, scaffolding and coating [J].
Thalamy, Pierre ;
Piranda, Benoit ;
Bourgeois, Julien .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 146