Accelerating hartree-fock self-consistent field calculation on C86/DCU heterogenous computing platform

被引:0
作者
Qi, Ji [1 ]
Zhang, Huimin [1 ,2 ]
Shan, Dezun [1 ,2 ]
Yang, Minghui [1 ,3 ]
机构
[1] Chinese Acad Sci, Wuhan Inst Phys & Math, Innovat Acad Precis Measurement Sci & Technol, State Key Lab Magnet Resonance Spect & Imaging, Beijing 430071, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430000, Peoples R China
基金
中国国家自然科学基金;
关键词
Quantum chemistry; Self-consistent field; Hartree-Fock; Electron repulsion integrals; Heterogenous parallel computing; C86/deep computing unit; GRAPHICAL PROCESSING UNITS; EXTENDED HUCKEL THEORY; DENSITY-FUNCTIONAL THEORY; QUANTUM-CHEMISTRY CALCULATIONS; REPULSION INTEGRAL EVALUATION; CONFIGURATION-INTERACTION; MASSIVELY-PARALLEL; HIGH-PERFORMANCE; GPU; PRECISION;
D O I
10.1063/1674-0068/cjcp2403028
中图分类号
O64 [物理化学(理论化学)、化学物理学]; O56 [分子物理学、原子物理学];
学科分类号
070203 ; 070304 ; 081704 ; 1406 ;
摘要
In this study, we investigate the efficacy of a hybrid parallel algorithm aiming at enhancing the speed of evaluation of two-electron repulsion integrals (ERI) and Fock matrix generation on the Hygon C86/DCU (deep computing unit) heterogeneous computing platform. Multiple hybrid parallel schemes are assessed using a range of model systems, including those with up to 1200 atoms and 10000 basis functions. The findings of our research reveal that, during Hartree-Fock (HF) calculations, a single DCU exhibits 33.6 speedups over 32 C86 CPU cores. Compared with the efficiency of Wuhan Electronic Structure Package on Intel X86 and NVIDIA A100 computing platform, the Hygon platform exhibits good cost-effectiveness, showing great potential in quantum chemistry calculation and other high-performance scientific computations.
引用
收藏
页码:81 / 94
页数:14
相关论文
共 135 条
[51]   Multinode Multi-GPU Two-Electron Integrals: Code Generation Using the Regent Language [J].
Johnson, K. Grace ;
Mirchandaney, Seema ;
Hoag, Ellis ;
Heirich, Alan ;
Aiken, Alex ;
Martinez, Todd J. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (11) :6522-6536
[52]   nGIA: A novel Greedy Incremental Alignment based algorithm for gene sequence clustering [J].
Ju, Zhen ;
Zhang, Huiling ;
Meng, Jintao ;
Zhang, Jingjing ;
Fan, Jianping ;
Pan, Yi ;
Liu, Weiguo ;
Li, Xuelei ;
Wei, Yanjie .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 136 :221-230
[53]   New Algorithm for Tensor Contractions on Multi-Core CPUs, GPUs, and Accelerators Enables CCSD and EOM-CCSD Calculations with over 1000 Basis Functions on a Single Compute Node [J].
Kaliman, Ilya A. ;
Krylov, Anna I. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2017, 38 (11) :842-853
[54]   Arbitrary Angular Momentum Electron Repulsion Integrals with Graphical Processing Units: Application to the Resolution of Identity Hartree-Fock Method [J].
Kalinowski, Jaroslaw ;
Wennmohs, Frank ;
Neese, Frank .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (07) :3160-3170
[55]   Massively Parallel Algorithm and Implementation of RI-MP2 Energy Calculation for Peta-Scale Many-Core Supercomputers [J].
Katouda, Michio ;
Naruse, Akira ;
Hirano, Yukihiko ;
Nakajima, Takahito .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (30) :2623-2633
[56]   Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs [J].
Kim, Jinsung ;
Sukumaran-Rajam, Aravind ;
Hong, Changwan ;
Panyala, Ajay ;
Srivastava, Rohit Kumar ;
Krishnamoorthy, Sriram ;
Sadayappan, P. .
INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, :96-106
[57]   Massively parallel and linear-scaling algorithm for second-order Moller-Plesset perturbation theory applied to the study of supramolecular wires [J].
Kjaergaard, Thomas ;
Baudin, Pablo ;
Bykov, Dmytro ;
Eriksen, Janus Juul ;
Ettenhuber, Patrick ;
Kristensen, Kasper ;
Larkin, Jeff ;
Liakh, Dmitry ;
Pawlowski, Filip ;
Vose, Aaron ;
Wang, Yang Min ;
Jorgensen, Poul .
COMPUTER PHYSICS COMMUNICATIONS, 2017, 212 :152-160
[58]   Hybrid CPU/GPU Integral Engine for Strong-Scaling Ab Initio Methods [J].
Kussmann, Joerg ;
Ochsenfeld, Christian .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (07) :3153-3159
[59]   Employing OpenCL to Accelerate Ab Initio Calculations on Graphics Processing Units [J].
Kussmann, Joerg ;
Ochsenfeld, Christian .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (06) :2712-2716
[60]   Preselective Screening for Linear-Scaling Exact Exchange-Gradient Calculations for Graphics Processing Units and General Strong-Scaling Massively Parallel Calculations [J].
Kussmann, Joerg ;
Ochsenfeld, Christian .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2015, 11 (03) :918-922