FPGA-Based High-Performance and Scalable Block LU Decomposition Architecture

被引:42
作者
Jaiswal, Manish Kumar [1 ]
Chandrachoodan, Nitin [2 ]
机构
[1] ICFAI Univ, Dehra Dun, India
[2] Indian Inst Technol, Dept Elect Engn, Madras 600036, Tamil Nadu, India
关键词
LU decomposition; block LU; FPGA; hardware acceleration; floating point arithmetics; single/double precision; scaling; ATLAS; Intel-MKL; GPU; LINEAR ALGEBRA; STABILITY;
D O I
10.1109/TC.2011.24
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Decomposition of a matrix into lower and upper triangular matrices (LU decomposition) is a vital part of many scientific and engineering applications, and the block LU decomposition algorithm is an approach well suited to parallel hardware implementation. This paper presents an approach to speed up implementation of the block LU decomposition algorithm using FPGA hardware. Unlike most previous approaches reported in the literature, the approach does not assume the matrix can be stored entirely on chip. The memory accesses are studied for various FPGA configurations, and a schedule of operations for scaling well is shown. The design has been synthesized for FPGA targets and can be easily retargeted. The design outperforms previous hardware implementations, as well as tuned software implementations including the ATLAS and MKL libraries on workstations.
引用
收藏
页码:60 / 72
页数:13
相关论文
共 32 条
  • [1] Numerical linear algebra on emerging architectures: the PLASMA and MAGMA projects
    Agullo, Emmanuel
    Demmel, Jim
    Dongarra, Jack
    Hadri, Bilel
    Kurzak, Jakub
    Langou, Julien
    Ltaief, Hatem
    Luszczek, Piotr
    Tomov, Stanimire
    [J]. SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2009, 180
  • [2] CHOI S, 2003, P INT C FIELD PROGR, V2278, P507
  • [3] STABILITY OF BLOCK LU FACTORIZATION
    DEMMEL, JW
    HIGHAM, NJ
    SCHREIBER, RS
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 1995, 2 (02) : 173 - 190
  • [4] STABILITY OF BLOCK ALGORITHMS WITH FAST LEVEL-3 BLAS
    DEMMEL, JW
    HIGHAM, NJ
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1992, 18 (03): : 274 - 291
  • [5] DONGARRA J, 2010, LINPACK BENCHMARKING
  • [6] SOFTWARE LIBRARIES FOR LINEAR ALGEBRA COMPUTATIONS ON HIGH-PERFORMANCE COMPUTERS
    DONGARRA, JJ
    WALKER, DW
    [J]. SIAM REVIEW, 1995, 37 (02) : 151 - 180
  • [7] LARGE DENSE NUMERICAL LINEAR ALGEBRA IN 1993 - THE PARALLEL COMPUTING INFLUENCE
    EDELMAN, A
    [J]. INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1993, 7 (02): : 113 - 128
  • [8] Galoppo N., 2005, Proceedings of the 2005 ACM/IEEE conference on Supercomputing, IEEE Computer Society, P3, DOI DOI 10.1109/SC.2005.42
  • [9] GOPALAKRISHNAN L, 2008, XAPP853
  • [10] Govindu G., 2004, Proceedings. 18th International Parallel and Distributed Processing Symposium