Accuracy and performance of graphics processors: A Quantum Monte Carlo application case study

被引:20
作者
Meredith, Jeremy S. [1 ]
Alvarez, Gonzalo [1 ]
Maier, Thomas A. [1 ]
Schulthess, Thomas C. [1 ]
Vetter, Jeffrey S. [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
关键词
Graphics processors; Quantum Monte Carlo; Accuracy; Performance; GPU; Parallel computing;
D O I
10.1016/j.parco.2008.12.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The tradeoffs of accuracy and performance are as yet an unsolved problem when dealing with Graphics Processing Units (GPUs) as a general-purpose computation device. Their high performance and low cost makes them a desirable target for scientific computation, and new language efforts help address the programming challenges of data parallel algorithms and memory management. But the original task of GPUs - real-time rendering has traditionally kept accuracy as a secondary goal, and sacrifices have sometimes been made as a result. In fact, the widely deployed hardware is generally capable of only single precision arithmetic, and even this accuracy is not necessarily equivalent to that of a commodity CPU. In this paper, we investigate the accuracy and performance characteristics of GPUs, including results from a preproduction double precision-capable GPU. We then accelerate the full Quantum Monte Carlo simulation code DCA++, similarly investigating its tolerance to the precision of arithmetic delivered by GPUs. The results show that while DCA++ has some sensitivity to the arithmetic precision, the single-precision GPU results were comparable to single-precision CPU results. Acceleration of the code on a fully GPU-enabled cluster showed that any remaining inaccuracy in GPU precision was negligible; sufficient accuracy was retained for scientifically meaningful results while still showing significant speedups. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 163
页数:13
相关论文
共 23 条
  • [1] *AMD, 2008, AMD ATI CLOS MET CTM
  • [2] Quantum Monte Carlo on graphical processing units
    Anderson, Amos G.
    Goddard, William A., III
    Schroeder, Peter
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2007, 177 (03) : 298 - 306
  • [3] Brook for GPUs: Stream computing on graphics hardware
    Buck, I
    Foley, T
    Horn, D
    Sugerman, J
    Fatahalian, K
    Houston, M
    Hanrahan, P
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 777 - 786
  • [4] FATAHALIAN K, 2006, 2006 ACM IEEE C SUP
  • [5] Fatahalian Kayvon, 2004, Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, P133
  • [6] GALOPPO N, 2005, P ACM IEEE SUPERCOMP
  • [7] Govindaraju N., 2006, ACM SIGMOD INT C MAN, P325
  • [8] GOVINDARAJU NK, 2005, INT C COMP GRAPH INT
  • [9] HILLESLAND K, ACM WORKSH GEN PURP, pC8
  • [10] MONTE-CARLO METHOD FOR MAGNETIC-IMPURITIES IN METALS
    HIRSCH, JE
    FYE, RM
    [J]. PHYSICAL REVIEW LETTERS, 1986, 56 (23) : 2521 - 2524