Parallel Sparse Matrix Solution for Circuit Simulation on FPGAs

被引:11
作者
Nechma, Tarek [1 ]
Zwolinski, Mark [1 ]
机构
[1] Univ Southampton, Fac Phys Sci & Engn, Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
Hardware acceleration; sparse matrices; SPICE; FPGA arithmetic; pipeline and parallel arithmetic and logic structures; FACTORIZATION;
D O I
10.1109/TC.2014.2308202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
SPICE is the de facto standard for circuit simulation. However, accurate SPICE simulations of today's sub-micron circuits can often take days or weeks on conventional processors. A SPICE simulation is an iterative process that consists of two phases per iteration: model evaluation followed by a matrix solution. The model evaluation phase has been found to be easily parallelizable, unlike the subsequent phase, which involves the solution of highly sparse and asymmetric matrices. In this paper, we present an FPGA implementation of a sparse matrix solver, geared towards matrices that arise in SPICE circuit simulations. Our approach combines static pivoting with symbolic analysis to compute an accurate task flow-graph which efficiently exploits parallelism at multiple granularities and sustains high floating-point data rates. We also present a quantitative comparison between the performance of our hardware prototype and state-of-the-art software packages running on a general-purpose PC. We report average speed-ups of 9.65x, 11.83x, and 17.21x against UMFPACK, KLU, and Kundert Sparse matrix packages, respectively.
引用
收藏
页码:1090 / 1103
页数:14
相关论文
共 33 条
[21]  
Kapre Nachiket, 2009, Proceedings of the 2009 International Conference on Field-Programmable Technology (FPT 2009), P190, DOI 10.1109/FPT.2009.5377665
[22]  
Kundert K.S., 1988, Sparse User's Guide - A Sparse Linear Equation Solver Version 1.3a
[23]  
Li X.S., 1998, SUPERCOMPUTING 98, P1
[24]  
Liao Y. Z., 1983, ACM IEEE 20th Design Automation Conference Proceedings, P107, DOI 10.1109/DAC.1983.1585634
[25]   THE MULTIFRONTAL METHOD FOR SPARSE-MATRIX SOLUTION - THEORY AND PRACTICE [J].
LIU, JWH .
SIAM REVIEW, 1992, 34 (01) :82-109
[26]   A parallel block LU decomposition method for distributed finite element matrices [J].
Maurer, Daniel ;
Wieners, Christian .
PARALLEL COMPUTING, 2011, 37 (12) :742-758
[27]   Efficient reordering for direct methods in analog circuit simulation [J].
Naumann, Ingo ;
Dirks, Heinz K. .
ELECTRICAL ENGINEERING, 2007, 89 (04) :333-337
[28]  
Venetis IE, 2009, CF'09: CONFERENCE ON COMPUTING FRONTIERS & WORKSHOPS, P71
[29]  
Wang X., 2006, THESIS NEW JERSEY I
[30]   Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines [J].
Wang, XF ;
Ziavras, SG .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (04) :319-343