High-performance GPU-accelerated evaluation of electron repulsion integrals

被引:16
作者
Vallejo, Jorge Luis Galvez [1 ]
Barca, Giuseppe M. J. [2 ]
Gordon, Mark S. [1 ]
机构
[1] Iowa State Univ, Dept Chem, Ames, IA 50011 USA
[2] Australian Natl Univ, Dept Comp Sci, Canberra, ACT, Australia
关键词
gpu; quantum chemistry; high performance computing; electronic structure theory; two electron integrals; GRAPHICAL PROCESSING UNITS; 2-ELECTRON INTEGRALS; QUANTUM-CHEMISTRY; IMPLEMENTATION; COMPUTATION; PRECISION; ALGORITHM;
D O I
10.1080/00268976.2022.2112987
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A novel methodology for the evaluation of two electron integrals up to f functions using Graphics Processing Units (GPUs) is presented. The Head-Gordon-Pople recursion relationships are solved via a simple heuristic methodology to minimize the number of evaluated intermediates in the recursion trees. Automatic code generation is used to generate highly optimized CUDA kernels. A novel approach for f functions is presented in which integral classes are split into smaller subclasses to minimize register pressure and exploit additional parallelism at the cost of recomputing a small number of intermediates. Alongside optimized kernels, the ERI evaluation scheme works in conjunction with an efficient work distribution scheme which guarantees load-balancing during computation. The new HGP scheme shows excellent speedups of 2x to above 60x against existing GPU code. Additionally, when coupled with digestion into the Fock matrix, the scaling is excellent on up to 7 GPUs with an 85% parallel efficiency for the 6-31G(d) basis set. [GRAPHICS] .
引用
收藏
页数:15
相关论文
共 38 条
[1]  
[Anonymous], 2021, JAPAN CAPTURES TOP50
[2]  
[Anonymous], 2011, NVIDIA DEVELOPER
[3]   New Multithreaded Hybrid CPU/GPU Approach to Hartree-Fock [J].
Asadchev, Andrey ;
Gordon, Mark S. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (11) :4166-4176
[4]   Mixed-precision evaluation of two-electron integrals by Rys quadrature [J].
Asadchev, Andrey ;
Gordon, Mark S. .
COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (08) :1563-1567
[5]   Uncontracted Rys Quadrature Implementation of up to G Functions on Graphical Processing Units [J].
Asadchev, Andrey ;
Allada, Veerendra ;
Felder, Jacob ;
Bode, Brett M. ;
Gordon, Mark S. ;
Windus, Theresa L. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2010, 6 (03) :696-704
[6]  
Barca G.M.J., 2021, P INT C HIGH PERF CO, P1
[7]  
Barca G.M. J., 2020, 2020 SC20 INT C HIGH, P1, DOI DOI 10.1109/SC41405.2020.00085
[8]   Faster Self-Consistent Field (SCF) Calculations on GPU Clusters [J].
Barca, Giuseppe M. J. ;
Alkan, Melisa ;
Galvez-Vallejo, Jorge L. ;
Poole, David L. ;
Rendell, Alistair P. ;
Gordon, Mark S. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2021, 17 (12) :7486-7503
[9]   High-Performance, Graphics Processing Unit-Accelerated Fock Build Algorithm [J].
Barca, Giuseppe M. J. ;
Galvez-Vallejo, Jorge L. ;
Poole, David L. ;
Rendell, Alistair P. ;
Gordon, Mark S. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2020, 16 (12) :7232-7238
[10]   Recent developments in the general atomic and molecular electronic structure system [J].
Barca, Giuseppe M. J. ;
Bertoni, Colleen ;
Carrington, Laura ;
Datta, Dipayan ;
De Silva, Nuwan ;
Deustua, J. Emiliano ;
Fedorov, Dmitri G. ;
Gour, Jeffrey R. ;
Gunina, Anastasia O. ;
Guidez, Emilie ;
Harville, Taylor ;
Irle, Stephan ;
Ivanic, Joe ;
Kowalski, Karol ;
Leang, Sarom S. ;
Li, Hui ;
Li, Wei ;
Lutz, Jesse J. ;
Magoulas, Ilias ;
Mato, Joani ;
Mironov, Vladimir ;
Nakata, Hiroya ;
Pham, Buu Q. ;
Piecuch, Piotr ;
Poole, David ;
Pruitt, Spencer R. ;
Rendell, Alistair P. ;
Roskop, Luke B. ;
Ruedenberg, Klaus ;
Sattasathuchana, Tosaporn ;
Schmidt, Michael W. ;
Shen, Jun ;
Slipchenko, Lyudmila ;
Sosonkina, Masha ;
Sundriyal, Vaibhav ;
Tiwari, Ananta ;
Vallejo, Jorge L. Galvez ;
Westheimer, Bryce ;
Wloch, Marta ;
Xu, Peng ;
Zahariev, Federico ;
Gordon, Mark S. .
JOURNAL OF CHEMICAL PHYSICS, 2020, 152 (15)