An MPI-CUDA approach for hypersonic flows with detailed state-to-state air kinetics using a GPU cluster

被引:49
作者
Bonelli, Francesco [1 ,2 ]
Tuttafesta, Michele [3 ]
Colonna, Gianpiero [4 ]
Cutrone, Luigi [5 ]
Pascazio, Giuseppe [1 ,2 ]
机构
[1] Politecn Bari, DMMM, Via Re David 200, I-70125 Bari, Italy
[2] Politecn Bari, CEMeC, Via Re David 200, I-70125 Bari, Italy
[3] Liceo Sci Statale L da Vinci, Via Cala Arciprete 1, I-76011 Bisceglie, BT, Italy
[4] CNR IMIP, Via Amendola 122-D, I-70126 Bari, Italy
[5] CIRA, I-81043 Capua, Italy
关键词
Multi-GPU; GPU cluster; MPI-CUDA; Hypersonic flows; Air state-to-state chemical kinetics; Multi-temperature; DIRECT NUMERICAL-SIMULATION; THERMAL RATE CONSTANTS; DISSOCIATION RATES; VIBRATIONAL-RELAXATION; TRANSPORT-EQUATION; NO FORMATION; IMPLEMENTATION; ACCELERATION; EXCITATION; MODEL;
D O I
10.1016/j.cpc.2017.05.019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes the most advanced results obtained in the context of fluid dynamic simulations of high-enthalpy flows using detailed state-to-state air kinetics. Thermochemical non-equilibrium, typical of supersonic and hypersonic flows, was modeled by using both the accurate state-to-state approach and the multi-temperature model proposed by Park. The accuracy of the two thermochemical non-equilibrium models was assessed by comparing the results with experimental findings, showing better predictions provided by the state-to-state approach. To overcome the huge computational cost of the state-to-state model, a multiple-nodes GPU implementation, based on an MPI-CUDA approach, was employed and a comprehensive code performance analysis is presented. Both the pure MPI-CPU and the MPI-CUDA implementations exhibit excellent scalability performance. GPUs outperform CPUs computing especially when the state-to-state approach is employed, showing speed-ups, of the single GPU with respect to the single-core CPU, larger than 100 in both the case of one MPI process and multiple MPI process. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:178 / 195
页数:18
相关论文
共 65 条
[1]  
Amdahl G., 1967, Proceedings of AFIPS Spring Join Computer Conference, P483
[2]  
[Anonymous], CUDA BY EXAMPLE
[3]  
Bonelli F., 2011, 17 AIAA INT SPAC PLA
[4]   Thermal rate constants of the O-2+N->NO+O reaction based on the (2)A' and (4)A' potential-energy surfaces [J].
Bose, D ;
Candler, GV .
JOURNAL OF CHEMICAL PHYSICS, 1997, 107 (16) :6136-6145
[5]   Thermal rate constants of the N-2+O->NO+N reaction using ab initio (3)A'' and (3)A' potential energy surfaces [J].
Bose, D ;
Candler, GV .
JOURNAL OF CHEMICAL PHYSICS, 1996, 104 (08) :2825-2833
[6]   Explicit integration with GPU acceleration for large kinetic networks [J].
Brock, Benjamin ;
Belt, Andrew ;
Billings, Jay Jay ;
Guidry, Mike .
JOURNAL OF COMPUTATIONAL PHYSICS, 2015, 302 :591-602
[7]   Massively parallel lattice-Boltzmann codes on large GPU clusters [J].
Calore, E. ;
Gabbana, A. ;
Kraus, J. ;
Pellegrini, E. ;
Schifano, S. F. ;
Tripiccione, R. .
PARALLEL COMPUTING, 2016, 58 :1-24
[8]  
Capitelli M, 2016, Fundamentals aspects of plasma chemical physics: Kinetics
[9]  
Capitelli M., 2015, Fundamental aspects of plasma chemical physics: kinetics, V85
[10]  
Capitelli M., 2011, ATOMIC OPTICAL PLASM, V66