Accuracy and performance of graphics processors: A Quantum Monte Carlo application case study

被引:20
作者
Meredith, Jeremy S. [1 ]
Alvarez, Gonzalo [1 ]
Maier, Thomas A. [1 ]
Schulthess, Thomas C. [1 ]
Vetter, Jeffrey S. [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
关键词
Graphics processors; Quantum Monte Carlo; Accuracy; Performance; GPU; Parallel computing;
D O I
10.1016/j.parco.2008.12.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The tradeoffs of accuracy and performance are as yet an unsolved problem when dealing with Graphics Processing Units (GPUs) as a general-purpose computation device. Their high performance and low cost makes them a desirable target for scientific computation, and new language efforts help address the programming challenges of data parallel algorithms and memory management. But the original task of GPUs - real-time rendering has traditionally kept accuracy as a secondary goal, and sacrifices have sometimes been made as a result. In fact, the widely deployed hardware is generally capable of only single precision arithmetic, and even this accuracy is not necessarily equivalent to that of a commodity CPU. In this paper, we investigate the accuracy and performance characteristics of GPUs, including results from a preproduction double precision-capable GPU. We then accelerate the full Quantum Monte Carlo simulation code DCA++, similarly investigating its tolerance to the precision of arithmetic delivered by GPUs. The results show that while DCA++ has some sensitivity to the arithmetic precision, the single-precision GPU results were comparable to single-precision CPU results. Acceleration of the code on a fully GPU-enabled cluster showed that any remaining inaccuracy in GPU precision was negligible; sufficient accuracy was retained for scientifically meaningful results while still showing significant speedups. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 163
页数:13
相关论文
共 23 条
  • [11] HOUSTON M, 2008, 2008 ACM SIGPLAN S P
  • [13] LINDERMAN MD, 2008, 13 INT C ARCH SUPP P
  • [14] Quantum cluster theories
    Maier, T
    Jarrell, M
    Pruschke, T
    Hettler, MH
    [J]. REVIEWS OF MODERN PHYSICS, 2005, 77 (03) : 1027 - 1080
  • [15] Structure of the pairing interaction in the two-dimensional Hubbard model
    Maier, TA
    Jarrell, MS
    Scalapino, DJ
    [J]. PHYSICAL REVIEW LETTERS, 2006, 96 (04)
  • [16] MARK WR, 2003, ACM SIGGRAPH 2003 PA
  • [17] *NVIDIA, NVIDIA CUD HOM
  • [18] Owens J.D., 2005, P EUROGRAPHICS 2005, P21, DOI [10.2312/egst. 20051043, DOI 10.2312/EGST.20051043]
  • [19] Overview of the architecture, circuit design, and physical implementation of a first-generation cell processor
    Pham, DC
    Aipperspach, T
    Boerstler, D
    Bolliger, M
    Chaudhry, R
    Cox, D
    Harvey, P
    Harvey, PM
    Hofstee, HP
    Johns, C
    Kahle, J
    Kameyama, A
    Keaty, J
    Masubuchi, Y
    Pham, M
    Pille, J
    Posluszny, S
    Riley, M
    Stasiak, DL
    Suzuoki, M
    Takahashi, O
    Warnock, J
    Weitzel, S
    Wendel, D
    Yazawa, K
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2006, 41 (01) : 179 - 196
  • [20] Ryoo S., 2008, 13 ACM SIGPLAN S PRI, P73