The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science

被引:211
作者
Marek, A. [1 ]
Blum, V. [2 ,3 ]
Johanni, R. [1 ,2 ]
Havu, V. [4 ]
Lang, B. [5 ]
Auckenthaler, T. [6 ]
Heinecke, A. [6 ]
Bungartz, H-J [6 ]
Lederer, H. [1 ]
机构
[1] Max Planck Inst Plasma Phys, Rechenzentrum Garching Max Planck Gesell, D-85748 Garching, Germany
[2] Fritz Haber Inst, Max Planck Soc, D-14195 Berlin, Germany
[3] Duke Univ, Dept Mech Engn & Mat Sci, Durham, NC 27708 USA
[4] Aalto Univ, COMP, Dept Appl Phys, FI-00076 Aalto, Finland
[5] Berg Univ Wuppertal, Fachbereich C, D-42097 Wuppertal, Germany
[6] Tech Univ Munich, Fak Informat, D-85748 Garching, Germany
关键词
electronic structure theory; density functional theory; eigenvalue solver; linear algebra; TOTAL-ENERGY CALCULATIONS; CONQUER ALGORITHM; DIVIDE; EIGENVECTORS; EIGENSOLVER; FRAMEWORK; SOFTWARE;
D O I
10.1088/0953-8984/26/21/213201
中图分类号
O469 [凝聚态物理学];
学科分类号
070205 ;
摘要
Obtaining the eigenvalues and eigenvectors of large matrices is a key problem in electronic structure theory and many other areas of computational science. The computational effort formally scales as O(N-3) with the size of the investigated problem, N (e.g. the electron count in electronic structure theory), and thus often defines the system size limit that practical calculations cannot overcome. In many cases, more than just a small fraction of the possible eigenvalue/eigenvector pairs is needed, so that iterative solution strategies that focus only on a few eigenvalues become ineffective. Likewise, it is not always desirable or practical to circumvent the eigenvalue solution entirely. We here review some current developments regarding dense eigenvalue solvers and then focus on the Eigenvalue soLvers for Petascale Applications (ELPA) library, which facilitates the efficient algebraic solution of symmetric and Hermitian eigenvalue problems for dense matrices that have real-valued and complex-valued matrix entries, respectively, on parallel computer platforms. ELPA addresses standard as well as generalized eigenvalue problems, relying on the well documented matrix layout of the Scalable Linear Algebra PACKage (ScaLAPACK) library but replacing all actual parallel solution steps with subroutines of its own. For these steps, ELPA significantly outperforms the corresponding ScaLAPACK routines and proprietary libraries that implement the ScaLAPACK interface (e.g. Intel's MKL). The most time-critical step is the reduction of the matrix to tridiagonal form and the corresponding backtransformation of the eigenvectors. ELPA offers both a one-step tridiagonalization (successive Householder transformations) and a two-step transformation that is more efficient especially towards larger matrices and larger numbers of CPU cores. ELPA is based on the MPI standard, with an early hybrid MPI-OpenMPI implementation available as well. Scalability beyond 10 000 CPU cores for problem sizes arising in the field of electronic structure theory is demonstrated for current highperformance computer architectures such as Cray or Intel/Infiniband. For a matrix of dimension 260 000, scalability up to 295 000 CPU cores has been shown on BlueGene/P.
引用
收藏
页数:15
相关论文
共 67 条
  • [1] ALPATOV P, 1997, P C SUP SC97 VIENN J
  • [2] ANDERSON E., 1999, LAPACK USERSGUIDE, V3rd
  • [3] [Anonymous], 1997, Applied numerical linear algebra
  • [4] [Anonymous], 2010, REVISION
  • [5] [Anonymous], 2012, MATRIX COMPUTATIONS
  • [6] [Anonymous], UCBEECS2013131
  • [7] Parallel solution of partial symmetric eigenvalue problems from electronic structure calculations
    Auckenthaler, T.
    Blum, V.
    Bungartz, H. -J.
    Huckle, T.
    Johanni, R.
    Kraemer, L.
    Lang, B.
    Lederer, H.
    Willems, P. R.
    [J]. PARALLEL COMPUTING, 2011, 37 (12) : 783 - 794
  • [8] AUCKENTHALER T, 2014, PARALLEL CO IN PRESS
  • [9] Anasazi Software for the Numerical Solution of Large-Scale Eigenvalue Problems
    Baker, C. G.
    Hetmaniuk, U. L.
    Lehoucq, R. B.
    Thornquist, H. K.
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2009, 36 (03):
  • [10] BAKER G, 2013, ICPP 98, P98