Double-buffered, heterogeneous CPU plus GPU integral digestion algorithm for single-excitation calculations involving a large number of excited states

被引:6
|
作者
Morrison, Adrian F. [1 ,2 ]
Epifanovsky, Evgeny [2 ]
Herbert, John M. [1 ]
机构
[1] Ohio State Univ, Dept Chem & Biochem, Columbus, OH 43210 USA
[2] Q Chem Inc, Pleasanton, CA USA
关键词
quantum chemistry; excited states; graphics processing units; TDDFT; GRAPHICAL PROCESSING UNITS; DENSITY-FUNCTIONAL THEORY; QUANTUM-CHEMISTRY CALCULATIONS; MAGNETIC CIRCULAR-DICHROISM; IDENTITY APPROXIMATION; CONFIGURATION-INTERACTION; EFFICIENT IMPLEMENTATION; NONADIABATIC DYNAMICS; ELECTRONIC-STRUCTURE; CHARGE-TRANSFER;
D O I
10.1002/jcc.25531
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The most widely used quantum-chemical models for excited states are single-excitation theories, a category that includes configuration interaction with single substitutions, time-dependent density functional theory, and also a recently developed ab initio exciton model. When a large number of excited states are desired, these calculations incur a significant bottleneck in the digestion step in which two-electron integrals are contracted with density or density-like matrices. We present an implementation that moves this step onto graphical processing units (GPUs), and introduce a double-buffer scheme that minimizes latency by computing integrals on the central processing units (CPUs) concurrently with their digestion on the GPUs. An automatic code generation scheme simplifies the implementation of high-performance GPU kernels. For the exciton model, which requires separate excited-state calculations on each electronically coupled chromophore, the heterogeneous implementation described here results in speedups of 2-6x versus a CPU-only implementation. For traditional time-dependent density functional theory calculations, we obtain speedups of up to 5x when a large number of excited states is computed. (c) 2018 Wiley Periodicals, Inc.
引用
收藏
页码:2173 / 2182
页数:10
相关论文
empty
未找到相关数据