Parallel Incomplete LU Factorization Based Iterative Solver for Fixed-Structure Linear Equations in Circuit Simulation

被引:4
作者
Li, Lingjie [1 ]
Liu, Zhiqiang [1 ]
Liu, Kan [1 ]
Shen, Shan [1 ]
Yu, Wenjian [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, BNRist, Beijing, Peoples R China
来源
2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC | 2023年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Circuit simulation; incomplete LU factorization; iterative equation solver; parallel computing; SHARED-MEMORY; ALGORITHM;
D O I
10.1145/3566097.3567882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A series of fixed-structure sparse linear equations are solved in a circuit simulation process. We propose a parallel incomplete LU (ILU) preconditioned GMRES solver for those equations. A new subtree-based scheduling algorithm for ILU factorization and forward/backward substitution is adopted to overcome the load-balancing and data locality problem of the conventional levelizationbased scheduling. Experimental results show that the proposed scheduling algorithm can achieve up to 2.6X speedup for ILU factorization and 3.1X speedup for forward/backward substitution compared to the levelization-based scheduling. The proposed ILU-GMRES solver achieves around 4X parallel speedup with 8 threads, which is up to 2.1X faster than that based on the levelization-based scheme. The proposed parallel solver also shows remarkable advantage over existing methods (including HSPICE) on transient simulation of linear and nonlinear circuits.
引用
收藏
页码:339 / 345
页数:7
相关论文
共 28 条
[21]   ILUT: a Dual Threshold Incomplete LU Factorization [J].
Saad, Yousef .
NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 1994, 1 (04) :387-402
[22]  
Shen K., 1998, SPAA '98. Tenth Annual ACM Symposium on Parallel Algorithms and Architectures, P5, DOI 10.1145/277651.277658
[23]  
Synopsys Inc, PrimeSim HSPICE
[24]  
Thornquist Heidi K., 2009, Proceedings of the 2009 IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2009), P410, DOI 10.1145/1687399.1687477
[25]  
Vladimirescu A., 1994, THE SPICE BOOK
[26]   OpenMP parallel implementation of stiffly stable time-stepping projection/GMRES(ILU(0)) implicit simulation of incompressible fluid flows on shared-memory, multicore architecture [J].
Xu, Xiao .
APPLIED MATHEMATICS AND COMPUTATION, 2019, 355 :238-252
[27]   SFLU: Synchronization-Free Sparse LU Factorization for Fast Circuit Simulation on GPUs [J].
Zhao, Jianqi ;
Wen, Yao ;
Luo, Yuchen ;
Jin, Zhou ;
Liu, Weifeng ;
Zhou, Zhenya .
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, :37-42
[28]  
Zhao XQ, 2012, ICCAD-IEEE ACM INT, P429