Improving the scalability of a symmetric iterative eigensolver for multi-core platforms

被引:63
作者
Aktulga, Hasan Metin [1 ]
Yang, Chao [1 ]
Ng, Esmond G. [1 ]
Maris, Pieter [2 ]
Vary, James P. [2 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA
[2] Iowa State Univ, Dept Phys & Astron, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
symmetric iterative eigensolvers; topology-aware mapping; hybrid MPI; OpenMP parallelism;
D O I
10.1002/cpe.3129
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe an efficient and scalable symmetric iterative eigensolver developed for distributed memory multi-core platforms. We achieve over 80% parallel efficiency by major reductions in communication overheads for the sparse matrix-vector multiplication and basis orthogonalization tasks. We show that the scalability of the solver is significantly improved compared to an earlier version, after we carefully reorganize the computational tasks and map them to processing units in a way that exploits the network topology. We discuss the advantage of using a hybrid OpenMP/MPI programming model to implement such a solver. We also present strategies for hiding communication on a multi-core platform. We demonstrate the effectiveness of these techniques by reporting the performance improvements achieved when we apply our solver to large-scale eigenvalue problems arising in nuclear structure calculations. Because sparse matrix-vector multiplication and inner product computation constitute the main kernels in most iterative methods, our ideas are applicable in general to the solution of problems involving large-scale symmetric sparse matrices with irregular sparsity patterns. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:2631 / 2651
页数:21
相关论文
共 20 条
[1]  
Aktulga HM, 2012, LECT NOTES COMPUT SC, V7484, P830, DOI 10.1007/978-3-642-32820-6_82
[2]  
Albing C, 2011, LECT NOTES COMPUT SC, V6960, P61, DOI 10.1007/978-3-642-24449-0_9
[3]  
[Anonymous], 2011, ICS 11, DOI [10.1145/1995896.1995909, DOI 10.1145/1995896.1995909]
[4]  
[Anonymous], 1998, Solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods, DOI DOI 10.1137/1.9780898719628
[5]  
Bai Z., 2000, Templates for the Solution of Algebraic Eigenvalue Problems: A Practical Guide. Ed. by, DOI DOI 10.1137/1.9780898719581
[6]  
Bhatel'e A., 2010, Proc. International Conference on High Performance Computing (HiPC), P1, DOI DOI 10.1109/HIPC.2010.5713190
[7]   AN EFFICIENT PARALLEL ALGORITHM FOR MATRIX-VECTOR MULTIPLICATION [J].
HENDRICKSON, B ;
LELAND, R ;
PLIMPTON, S .
INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1995, 7 (01) :73-88
[8]  
Kaiser T. H., 2001, Scientific Programming, V9, P73
[9]  
Lewis JG, 1993, P SUP 93 IEEE COMP S
[10]   Origin of the Anomalous Long Lifetime of 14C [J].
Maris, P. ;
Vary, J. P. ;
Navratil, P. ;
Ormand, W. E. ;
Nam, H. ;
Dean, D. J. .
PHYSICAL REVIEW LETTERS, 2011, 106 (20)