A fast algorithm for sparse matrix computations related to inversion

被引:15
作者
Li, S. [1 ]
Wu, W. [2 ]
Darve, E. [1 ,3 ]
机构
[1] Stanford Univ, Inst Computat & Math Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
关键词
Nested dissection; Green's function; NEGF; Decomposition; Gaussian elimination; Sparse matrix; Inverse; Elimination tree; NESTED DISSECTION; EIGENVECTORS;
D O I
10.1016/j.jcp.2013.01.036
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We have developed a fast algorithm for computing certain entries of the inverse of a sparse matrix. Such computations are critical to many applications, such as the calculation of non-equilibrium Green's functions G(r) and G(<) for nano-devices. The FIND (Fast Inverse using Nested Dissection) algorithm is optimal in the big-O sense. However, in practice, FIND suffers from two problems due to the width-2 separators used by its partitioning scheme. One problem is the presence of a large constant factor in the computational cost of FIND. The other problem is that the partitioning scheme used by FIND is incompatible with most existing partitioning methods and libraries for nested dissection, which all use width-1 separators. Our new algorithm resolves these problems by thoroughly decomposing the computation process such that width-1 separators can be used, resulting in a significant speedup over FIND for realistic devices -up to twelve-fold in simulation. The new algorithm also has the added advantage that desired off-diagonal entries can be computed for free. Consequently, our algorithm is faster than the current state-of-the-art recursive methods for meshes of any size. Furthermore, the framework used in the analysis of our algorithm is the first attempt to explicitly apply the widely-used relationship between mesh nodes and matrix computations to the problem of multiple eliminations with reuse of intermediate results. This framework makes our algorithm easier to generalize, and also easier to compare against other methods related to elimination trees. Finally, our accuracy analysis shows that the algorithms that require back-substitution are subject to significant extra round-off errors, which become extremely large even for some well-conditioned matrices or matrices with only moderately large condition numbers. When compared to these back-substitution algorithms, our algorithm is generally a few orders of magnitude more accurate, and our produced round-off errors stay at a reasonable level. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:915 / 945
页数:31
相关论文
共 35 条
  • [1] Amestoy P., 2010, TRPA1059 CERFACS
  • [2] Amestoy P. R., 2009, SIAM LIN ALG C
  • [3] Campbell Y. E., INVERSE MULTIFRONTAL
  • [4] Campbell Y. E., 1995, THESIS U FLORIDA
  • [5] A two-dimensional domain decomposition technique for the simulation of quantum-scale devices
    Cauley, Stephen
    Balakrishnan, Venkataramanan
    Klimeck, Gerhard
    Koh, Cheng-Kok
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2012, 231 (04) : 1293 - 1313
  • [6] A scalable distributed method for quantum-scale device simulation
    Cauley, Stephen
    Jain, Jitesh
    Koh, Cheng-Kok
    Balakrishnan, Venkataramanan
    [J]. JOURNAL OF APPLIED PHYSICS, 2007, 101 (12)
  • [7] Nanoscale device modeling: the Green's function method
    Datta, S
    [J]. SUPERLATTICES AND MICROSTRUCTURES, 2000, 28 (04) : 253 - 278
  • [8] Davis T. A., 2006, DIRECT METHODS SPARS
  • [9] GEORGES NESTED DISSECTION METHOD
    DUFF, IS
    ERISMAN, AM
    REID, JK
    [J]. SIAM JOURNAL ON NUMERICAL ANALYSIS, 1976, 13 (05) : 686 - 695
  • [10] THE MULTIFRONTAL SOLUTION OF INDEFINITE SPARSE SYMMETRIC LINEAR-EQUATIONS
    DUFF, IS
    REID, JK
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1983, 9 (03): : 302 - 325