Large-scale simulations on multiple Graphics Processing Units (GPUs) for the direct simulation Monte Carlo method

被引:26
|
作者
Su, C. -C. [1 ]
Smith, M. R. [2 ]
Kuo, F. -A. [1 ,3 ]
Wu, J. -S. [1 ,3 ]
Hsieh, C. -W. [3 ]
Tseng, K. -C. [4 ]
机构
[1] Natl Chiao Tung Univ, Dept Mech Engn, Hsinchu, Taiwan
[2] Natl Cheng Kung Univ, Dept Mech Engn, Tainan 70101, Taiwan
[3] Natl Ctr High Performance Comp, Natl Appl Res Labs, Hsinchu, Taiwan
[4] Natl Space Org, Natl Appl Res Labs, Hsinchu, Taiwan
关键词
Rarefied gas dynamics; Parallel direct simulation Monte Carlo; Graphics Processing Unit (GPU); MPI-CUDA; Very large-scale simulation; IMPLEMENTATION;
D O I
10.1016/j.jcp.2012.07.038
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, the application of the two-dimensional direct simulation Monte Carlo (DSMC) method using an MPI-CUDA parallelization paradigm on Graphics Processing Units (GPUs) clusters is presented. An all-device (i.e. GPU) computational approach is adopted where the entire computation is performed on the GPU device, leaving the CPU idle during all stages of the computation, including particle moving, indexing, particle collisions and state sampling. Communication between the GPU and host is only performed to enable multiple-GPU computation. Results show that the computational expense can be reduced by 15 and 185 times when using a single GPU and 16 GPUs respectively when compared to a single core of an Intel Xeon X5670 CPU. The demonstrated parallel efficiency is 75% when using 16 GPUs as compared to a single GPU for simulations using 30 million simulated particles. Finally, several very large-scale simulations in the near-continuum regime are employed to demonstrate the excellent capability of the current parallel DSMC method. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:7932 / 7958
页数:27
相关论文
共 19 条
  • [1] Graphics processing unit based direct simulation Monte Carlo
    Gladkov, Denis
    Tapia, Jose-Juan
    Alberts, Samuel
    D'Souza, Roshan M.
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2012, 88 (06): : 680 - 693
  • [2] Parallel Direct Simulation Monte Carlo Computation Using CUDA on GPUs
    Su, C. -C.
    Hsieh, C. -W.
    Smith, M. R.
    Jermy, M. C.
    Wu, J. -S.
    27TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS, 2010, PTS ONE AND TWO, 2011, 1333 : 343 - +
  • [3] Graphics Processing Unit Acceleration and Parallelization of GENESIS for Large-Scale Molecular Dynamics Simulations
    Jung, Jaewoon
    Naurse, Akira
    Kobayashi, Chigusa
    Sugita, Yuji
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2016, 12 (10) : 4947 - 4958
  • [4] An augmented Lagrangian interior-point approach for large-scale NLP problems on graphics processing units
    Cao, Yankai
    Seth, Arpan
    Laird, Carl D.
    COMPUTERS & CHEMICAL ENGINEERING, 2016, 85 : 76 - 83
  • [5] The Metropolis Monte Carlo method with CUDA enabled Graphic Processing Units
    Hall, Clifford
    Ji, Weixiao
    Blaisten-Barojas, Estela
    JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 258 : 871 - 879
  • [6] Estimation of the Statistical Error of the Direct Simulation Monte Carlo Method
    Plotnikov, M. Yu.
    Shkarupa, E. V.
    COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2010, 50 (02) : 335 - 344
  • [7] Estimation of the statistical error of the direct simulation Monte Carlo method
    M. Yu. Plotnikov
    E. V. Shkarupa
    Computational Mathematics and Mathematical Physics, 2010, 50 : 335 - 344
  • [8] Large-Scale Transient Stability Simulation of Electrical Power Systems on Parallel GPUs
    Jalili-Marandi, Vahid
    Zhou, Zhiyin
    Dinavahi, Venkata
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (07) : 1255 - 1266
  • [9] Large-Scale Transient Stability Simulation of Electrical Power Systems on Parallel GPUs
    Jalili-Marandi, Vahid
    Zhou, Zhiyin
    Dinavahi, Venkata
    2012 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING, 2012,
  • [10] A parallel lattice Boltzmann method for large eddy simulation on multiple GPUs
    Li, Qinjian
    Zhong, Chengwen
    Li, Kai
    Zhang, Guangyong
    Lu, Xiaowei
    Zhang, Qing
    Zhao, Kaiyong
    Chu, Xiaowen
    COMPUTING, 2014, 96 (06) : 479 - 501