Performance improvements of differential operators code for MPS method on GPU

被引:0
作者
Kohei Murotani
Issei Masaie
Takuya Matsunaga
Seiichi Koshizuka
Ryuji Shioya
Masao Ogino
Toshimitsu Fujisawa
机构
[1] The University of Tokyo,Graduate School of Engineering
[2] Prometech Software,undefined
[3] Inc.,undefined
[4] Toyo University,undefined
[5] Nagoya University,undefined
来源
Computational Particle Mechanics | 2015年 / 2卷
关键词
MPS method; CUDA; Gradient; Laplacian; Differential operators;
D O I
暂无
中图分类号
学科分类号
摘要
In the present study, performance improvements of the particle search and particle interaction calculation steps constituting the performance bottleneck in the moving particle simulation method are achieved by developing GPU-compatible algorithms for many core processor architectures. In the improvements of particle search, bucket loops of the cell-linked list are changed to a loop structure having fewer local variables and the linked list and the forward star of particle search algorithms within a bucket are compared. In the particle interaction calculation, the problem of the ratio of particles within the interaction domain to the neighboring particle candidates being quite low is improved. By these improvements, a performance efficiency of 24.7 % can be achieved for the first-order polynomial approximation scheme using NVIDIA Tesla K20, CUDA-6.5, and double-precision floating-point operations.
引用
收藏
页码:261 / 272
页数:11
相关论文
共 92 条
  • [1] Koshizuka S(1996)Moving-particle semi-implicit method for fragmentation of incompressible fluid Nucl Sci Eng (NSE) 123 421-434
  • [2] Oka Y(1998)Numerical analysis of breaking waves using the moving particle semi-implicit method Int J Numer Methods Fluids 26 751-769
  • [3] Koshizuka S(1977)A numerical approach to the testing of the fission hypothesis Astron J (AJ) 82 1013-1024
  • [4] Nobe A(1977)Smoothed particle hydrodynamics: theory and application to non-spherical stars Mon Not R Astron Soc (MNRAS) 181 375-389
  • [5] Oka Y(2014)Development of hierarchical domain decomposition explicit MPS method and application to large-scale tsunami analysis with floating objects J Adv Simul Sci Eng (JASSE) 1 16-35
  • [6] Lucy LB(2010)Reduction of communication in parallel computing of particle method for flow simulation of seaside areas Coast Eng J 52 287-304
  • [7] Gingold RA(2009)Molecular dynamics simulation of complex multiphase flow on a computer cluster with GPUs Sci China Ser B 52 372-380
  • [8] Monaghan JJ(2010)Research on the influence of machining introduced sub-surface defects and residue stress upon the mechanical properties of single crystal copper Sci China Technol Sci 53 3161-3167
  • [9] Murotani K(2008)General purpose molecular dynamics simulations fully implemented on graphics processing units J Comput Phys 227 5342-5359
  • [10] Koshizuka S(2010)Large-scale powder mixer simulations using massively parallel GPU architectures Chem Eng Sci 65 6435-6442