Kd-tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU

被引:8
作者
Kofler, Klaus [1 ]
Steinhauser, Dominik [2 ]
Cosenza, Biagio [1 ]
Grasso, Ivan [1 ]
Schindler, Sabine [2 ]
Fahringer, Thomas [1 ]
机构
[1] Univ Innsbruck, DPS, Inst Comp Sci, Innsbruck, Austria
[2] Univ Innsbruck, Inst Astro & Particle Phys, Innsbruck, Austria
来源
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW) | 2014年
关键词
N-body; GPGPU; Kd-tree; CODE;
D O I
10.1109/IPDPSW.2014.141
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
N-body simulations represent an important class of numerical simulations in order to study a wide range of physical phenomena for which researchers demand fast and accurate implementations. Due to the computational complexity, simple brute-force methods to solve the long-distance interaction between bodies can only be used for small-scale simulations. Smarter approaches utilize neighbor lists, tree methods or other hierarchical data structures to reduce the complexity of the force calculations. However, such data structures have complex building algorithms which hamper their parallelization for GPUs. In this paper, we introduce a novel method to effectively parallelize N-body simulations for GPU architectures. Our method is based on an efficient, three-phase, parallel Kd-tree building algorithm and a novel volume-mass heuristic to reduce the simulation time and increase accuracy. Experiments demonstrate that our approach is the fastest monopole implementation with an accuracy that is comparable with state of the art implementations (GADGET-2). In particular, we are able to reach a simulation speed of up to 3 Mparticles/s on a single GPU for the force calculation, while still having a relative force error below 0.4% for 99% of the particles. We also show competitive performance with existing GPU implementations, while our competitor shows worse accuracy behavior as well as a higher energy error during time integration.
引用
收藏
页码:1257 / 1266
页数:10
相关论文
共 26 条
  • [1] [Anonymous], THESIS
  • [2] [Anonymous], 2013, P 27 INT ACM C INT C, DOI [DOI 10.1145/2464996.2465008.2,21, DOI 10.1145/2464996.2465008]
  • [3] [Anonymous], 2007, GPU GEMS
  • [4] A HIERARCHICAL O(N-LOG-N) FORCE-CALCULATION ALGORITHM
    BARNES, J
    HUT, P
    [J]. NATURE, 1986, 324 (6096) : 446 - 449
  • [5] A sparse octree gravitational N-body code that runs entirely on the GPU processor
    Bedorf, Jeroen
    Gaburov, Evghenii
    Zwart, Simon Portegies
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2012, 231 (07) : 2825 - 2839
  • [6] Brook for GPUs: Stream computing on graphics hardware
    Buck, I
    Foley, T
    Horn, D
    Sugerman, J
    Fatahalian, K
    Houston, M
    Hanrahan, P
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 777 - 786
  • [7] Towards optimal softening in three-dimensional N-body codes -: I.: Minimizing the force error
    Dehnen, W
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2001, 324 (02) : 273 - 291
  • [8] Elsen Erich., 2006, SC 06, P188, DOI DOI 10.1145/1188455.1188649
  • [9] Hamada T., 2009, SC 09
  • [10] Hamada T., 2010, Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC '10), P1, DOI DOI 10.1109/SC.2010.1