Fast quantum Monte Carlo on a GPU

被引:14
|
作者
Lutsyshyn, Y. [1 ]
机构
[1] Univ Rostock, Inst Phys, D-18051 Rostock, Germany
关键词
Quantum Monte Carlo; QMC; VMC; GPU; CUDA; Quantum liquid; Liquid He-4; GROUND-STATE; LIQUID-HE-4; SIMULATION;
D O I
10.1016/j.cpc.2014.09.016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a scheme for the parallelization of quantum Monte Carlo method on graphical processing units, focusing on variational Monte Carlo simulation of bosonic systems. We use asynchronous execution schemes with shared memory persistence, and obtain an excellent utilization of the accelerator. The CUDA code is provided along with a package that simulates liquid helium-4. The program was benchmarked on several models of Nvidia CPU, including Fermi GTX560 and M2090, and the Kepler architecture 1(20 GPU. Special optimization was developed for the Kepler cards, including placement of data structures in the register space of the Kepler GPUs. Kepler-specific optimization is discussed. Program Summary Program title: QL Catalogue identifier: AEUP_v1_0 Program summary URL: http://cpc.cs.qub.ac.uk/summaries/AEUP_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland Licensing provisions: Standard CPC licence, http://cpc.cs.qub.ac.uk/licence/licence.html No. of lines in,distributed program, including test data, etc.: 40170 No. of bytes in distributed program, including test data, etc.: 1223080 Distribution format: tar.gz Programming language: CUDA-C, C, Fortran. Computer: Any computer with a CUDA-enabled GPU. Operating system: Linux. RAM: Typical execution uses as much RAM as is available on the CPU; usually between 1 GB and 12 GB. Minimal requirement is I MB. Classification: 4.12, 7.7. Nature ofproblem: QL package executes variational Monte Carlo for liquid helium-4 with Aziz II interaction potential and a Jastrow pair product trial wavefunction. Sampling is performed with a Metropolis scheme applied to single-particle updates. With minimal changes, the package can be applied to other bosonic fluids, given a pairwise interaction potential and a wavefunction in the form of a product of one- and two-body correlation factors. Solution method: The program is parallelized for execution with Nvidia GPU. By design, the generation of new configurations is performed with shared memory persistence and the asynchronous execution allows for the CPU load masking. Restrictions: Code is limited to variational Monte Carlo. Due to the limitation of the shared memory of GPU, only systems under 2000 particles can be treated on the Fermi generation cards, and up to 10000 on Kepler cards. Running time: Because of the statistical nature of Monte Carlo calculations, computations may be chained indefinitely to improve statistical accuracy. As an example, using the QL package, the energy of a liquid helium system with 1952 atoms can be computed to within 1 mK per atom in less than 20 min. This corresponds to the relative error of 10(-4). It is unlikely that a higher accuracy may be needed. (C) 2014 Published by Elsevier B.V.
引用
收藏
页码:162 / 174
页数:13
相关论文
共 50 条
  • [1] Fast quantum Monte Carlo on a GPU
    Institut für Physik, Universität Rostock, Rostock
    18051, Germany
    Comput Phys Commun, 1600, (162-174):
  • [2] Fast Hamiltonian Monte Carlo Using GPU Computing
    Beam, Andrew L.
    Ghosh, Sujit K.
    Doyle, Jon
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2016, 25 (02) : 536 - 548
  • [3] A Fast Monte Carlo GPU Based Algorithm for Particle Breakage
    Devi, Jherna
    Kruis, F. Einar
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 784 - 789
  • [4] Fast Monte Carlo Dose Calculation on GPU for Proton Therapy
    Jia, X.
    Schuemann, J.
    Paganetti, H.
    Jiang, S.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2012, 84 (03): : S841 - S841
  • [5] Delayed Update Algorithms for Quantum Monte Carlo Simulation on GPU
    McDaniel, Tyler
    D'Azevedo, Ed
    Li, Ying Wai
    Kent, Paul
    Wong, Ming
    Wong, Kwai
    PROCEEDINGS OF XSEDE16: DIVERSITY, BIG DATA, AND SCIENCE AT SCALE, 2016,
  • [6] Full acceleration of quantum Monte Carlo simulations on GPU clusters
    Esler, Kenneth
    Kim, Jeongnim
    Ceperley, David
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 240
  • [7] Fast and stable determinant quantum Monte Carlo
    Bauer, Carsten
    SCIPOST PHYSICS CORE, 2020, 2 (02):
  • [8] GPU-Monte Carlo Based Fast IMRT Plan Optimization
    Li, Y.
    Tian, Z.
    Shi, F.
    Jiang, S.
    Jia, X.
    MEDICAL PHYSICS, 2014, 41 (06) : 567 - 567
  • [9] A fast GPU code for full Monte Carlo based SPECT reconstruction
    Magnander, T.
    Lagerlof, J. Heydorn
    Hemmingsson, J.
    Bath, M.
    Svensson, J.
    Gjertsson, P.
    Bernhardt, P.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2016, 43 : S43 - S44
  • [10] Monte Carlo integration on GPU
    Kanzaki, J.
    EUROPEAN PHYSICAL JOURNAL C, 2011, 71 (02):