EFFICIENT IMPLEMENTATION OF NONLINEAR COMPACT SCHEMES ON MASSIVELY PARALLEL PLATFORMS

被引:8
作者
Ghosh, Debojyoti [1 ]
Constantinescu, Emil M. [1 ]
Brown, Jed [1 ]
机构
[1] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
关键词
compact schemes; WENO; CRWENO; high-performance computing; compressible flows; ESSENTIALLY NONOSCILLATORY SCHEMES; HYPERBOLIC CONSERVATION-LAWS; SHOCK-TURBULENCE INTERACTION; DIRECT NUMERICAL-SIMULATION; WENO SCHEME; ALGORITHM; EQUATIONS; SYSTEMS; DISCRETIZATIONS; FLOWS;
D O I
10.1137/140989261
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Weighted nonlinear compact schemes are ideal for simulating compressible, turbulent flows because of their nonoscillatory nature and high spectral resolution. However, they require the solution to banded systems of equations at each time-integration step or stage. We focus on tridiagonal compact schemes in this paper. We propose an efficient implementation of such schemes on massively parallel computing platforms through an iterative substructuring algorithm to solve the tridiagonal system of equations. The key features of our implementation are that it does not introduce any parallelization-based approximations or errors and it involves minimal neighbor-to-neighbor communications. We demonstrate the performance and scalability of our approach on the IBM Blue Gene/Q platform and show that the compact schemes are efficient and have performance comparable to that of standard noncompact finite-difference methods on large numbers of processors (similar to 500, 000) and small subdomain sizes (four points per dimension per processor).
引用
收藏
页码:C354 / C383
页数:30
相关论文
共 50 条
[31]   Non-overlapping High-accuracy Parallel Closure for Compact Schemes: Application in Multiphysics and Complex Geometry [J].
Sundaram, Prasannabalaji ;
Sengupta, Aditi ;
Suman, Vajjala K. ;
Sengupta, Tapan K. .
ACM TRANSACTIONS ON PARALLEL COMPUTING, 2023, 10 (01)
[32]   Coupled-cluster singles, doubles and perturbative triples with density fitting approximation for massively parallel heterogeneous platforms [J].
Peng, Chong ;
Calvin, Justus A. ;
Valeev, Edward F. .
INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2019, 119 (12)
[33]   Convergence and efficiency of high order parallel schemes for nonlinear engineering models [J].
Shams, Mudassir ;
Kausar, Nasreen ;
Akguel, Ali ;
El Maalouf, Joseph .
ALEXANDRIA ENGINEERING JOURNAL, 2025, 124 :80-109
[34]   Efficient implementation of high-order WENO schemes with sharing function for solving Euler equations [J].
Liu, Shengping ;
Shen, Yiqing ;
Guo, Shaodong ;
Yong, Heng ;
Ni, Guoxi .
COMPUTERS & FLUIDS, 2023, 251
[35]   Efficient parallel solution of large-scale nonlinear dynamic optimization problems [J].
Word, Daniel P. ;
Kang, Jia ;
Akesson, Johan ;
Laird, Carl D. .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2014, 59 (03) :667-688
[36]   A massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves [J].
Engsig-Karup, A. P. ;
Madsen, Morten G. ;
Glimberg, Stefan L. .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2012, 70 (01) :20-36
[37]   High-order compact schemes for incompressible flows: A simple and efficient method with quasi-spectral accuracy [J].
Laizet, Sylvain ;
Lamballais, Eric .
JOURNAL OF COMPUTATIONAL PHYSICS, 2009, 228 (16) :5989-6015
[38]   Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms [J].
Zhu, Guanghui ;
Wang, Qian ;
Tang, Qiwei ;
Gu, Rong ;
Yuan, Chunfeng ;
Huang, Yihua .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (12) :2663-2676
[39]   Efficient implementation of the improved quasi-minimal residual method on massively distributed memory computers [J].
Yang, TR ;
Lin, HX .
SOLVING IRREGULARLY STRUCTURED PROBLEMS IN PARALLEL, 1997, 1253 :80-92
[40]   Fourth-order compact difference schemes for the two-dimensional nonlinear fractional mobile/immobile transport models [J].
Chai, Li ;
Liu, Yang ;
Li, Hong .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2021, 100 :1-10