Parallel scalability of Hartree-Fock calculations

被引:32
作者
Chow, Edmond [1 ]
Liu, Xing [1 ]
Smelyanskiy, Mikhail [2 ]
Hammond, Jeff R. [2 ]
机构
[1] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[2] Intel Corp, Parallel Comp Lab, Santa Clara, CA 95054 USA
基金
美国国家科学基金会;
关键词
PERFORMANCE COMPUTATIONAL CHEMISTRY; DENSITY-MATRIX; ALGORITHM; CONSTRUCTION; PURIFICATION; MULTIPLICATION;
D O I
10.1063/1.4913961
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Quantum chemistry is increasingly performed using large cluster computers consisting of multiple interconnected nodes. For a fixed molecular problem, the efficiency of a calculation usually decreases as more nodes are used, due to the cost of communication between the nodes. This paper empirically investigates the parallel scalability of Hartree-Fock calculations. The construction of the Fock matrix and the density matrix calculation are analyzed separately. For the former, we use a parallelization of Fock matrix construction based on a static partitioning of work followed by a work stealing phase. For the latter, we use density matrix purification from the linear scaling methods literature, but without using sparsity. When using large numbers of nodes for moderately sized problems, density matrix computations are network-bandwidth bound, making purification methods potentially faster than eigendecomposition methods. (C) 2015 AIP Publishing LLC.
引用
收藏
页数:11
相关论文
共 47 条
[1]   A three-dimensional approach to parallel matrix multiplication [J].
Agarwal, RC ;
Balle, SM ;
Gustavson, FG ;
Joshi, M ;
Palkar, P .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1995, 39 (05) :575-582
[2]   COMMUNICATION COMPLEXITY OF PRAMS [J].
AGGARWAL, A ;
CHANDRA, AK ;
SNIR, M .
THEORETICAL COMPUTER SCIENCE, 1990, 71 (01) :3-28
[3]   The distributed data SCF [J].
Alexeev, Y ;
Kendall, RA ;
Gordon, MS .
COMPUTER PHYSICS COMMUNICATIONS, 2002, 143 (01) :69-82
[4]  
Anadakrishnan R., 2012, NUCLEIC ACIDS RES, V40, pW537
[5]   Parallel solution of partial symmetric eigenvalue problems from electronic structure calculations [J].
Auckenthaler, T. ;
Blum, V. ;
Bungartz, H. -J. ;
Huckle, T. ;
Johanni, R. ;
Kraemer, L. ;
Lang, B. ;
Lederer, H. ;
Willems, P. R. .
PARALLEL COMPUTING, 2011, 37 (12) :783-794
[6]   COMMUNICATION EFFICIENT MATRIX MULTIPLICATION ON HYPERCUBES [J].
BERNTSEN, J .
PARALLEL COMPUTING, 1989, 12 (03) :335-342
[7]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748
[8]   O(N) methods in electronic structure calculations [J].
Bowler, D. R. ;
Miyazaki, T. .
REPORTS ON PROGRESS IN PHYSICS, 2012, 75 (03)
[9]  
Cannon L. E, 1969, THESIS MONTANA STATE
[10]   PARALLEL MATRIX AND GRAPH ALGORITHMS [J].
DEKEL, E ;
NASSIMI, D ;
SAHNI, S .
SIAM JOURNAL ON COMPUTING, 1981, 10 (04) :657-675