Parallel Sparse Matrix Solution for Circuit Simulation on FPGAs

被引:11
作者
Nechma, Tarek [1 ]
Zwolinski, Mark [1 ]
机构
[1] Univ Southampton, Fac Phys Sci & Engn, Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
Hardware acceleration; sparse matrices; SPICE; FPGA arithmetic; pipeline and parallel arithmetic and logic structures; FACTORIZATION;
D O I
10.1109/TC.2014.2308202
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
SPICE is the de facto standard for circuit simulation. However, accurate SPICE simulations of today's sub-micron circuits can often take days or weeks on conventional processors. A SPICE simulation is an iterative process that consists of two phases per iteration: model evaluation followed by a matrix solution. The model evaluation phase has been found to be easily parallelizable, unlike the subsequent phase, which involves the solution of highly sparse and asymmetric matrices. In this paper, we present an FPGA implementation of a sparse matrix solver, geared towards matrices that arise in SPICE circuit simulations. Our approach combines static pivoting with symbolic analysis to compute an accurate task flow-graph which efficiently exploits parallelism at multiple granularities and sustains high floating-point data rates. We also present a quantitative comparison between the performance of our hardware prototype and state-of-the-art software packages running on a general-purpose PC. We report average speed-ups of 9.65x, 11.83x, and 17.21x against UMFPACK, KLU, and Kundert Sparse matrix packages, respectively.
引用
收藏
页码:1090 / 1103
页数:14
相关论文
共 33 条
[1]   SPARSE GAUSSIAN-ELIMINATION WITH CONTROLLED FILL-IN ON A SHARED MEMORY MULTIPROCESSOR [J].
ALAGHBAND, G ;
JORDAN, HF .
IEEE TRANSACTIONS ON COMPUTERS, 1989, 38 (11) :1539-1557
[2]  
[Anonymous], P INT WORKSH STAT OF
[3]  
[Anonymous], 1994, Synthesis and optimization of digital circuits
[4]  
ASHCRAFT CC, 1987, INT J SUPERCOMPUT AP, V1, P10
[5]  
Davis T., 2009, ACM T MS, V5, P1
[6]  
Davis T., 1989, TECHNICAL REPORT
[7]  
Davis T., 2013, SUITE SPARSE SUITE S
[8]   Algorithm 832: UMFPACK V4.3 - An unsymmetric-pattern multifrontal method [J].
Davis, TA .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2004, 30 (02) :196-199
[9]  
Davis TA, 2006, FUND ALGORITHMS, V2, P1, DOI 10.1137/1.9780898718881
[10]   COMMUNICATION-OPTIMAL PARALLEL AND SEQUENTIAL QR AND LU FACTORIZATIONS [J].
Demmel, James ;
Grigori, Laura ;
Hoemmen, Mark ;
Langou, Julien .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (01) :A206-A239