FPGA-Based High-Performance and Scalable Block LU Decomposition Architecture

被引：42

作者：

Jaiswal, Manish Kumar ^{[1
]}

Chandrachoodan, Nitin ^{[2
]}

机构：

[1] ICFAI Univ, Dehra Dun, India

[2] Indian Inst Technol, Dept Elect Engn, Madras 600036, Tamil Nadu, India

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2012年 / 61卷 / 01期

关键词：

LU decomposition; block LU; FPGA; hardware acceleration; floating point arithmetics; single/double precision; scaling; ATLAS; Intel-MKL; GPU; LINEAR ALGEBRA; STABILITY;

D O I：

10.1109/TC.2011.24

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Decomposition of a matrix into lower and upper triangular matrices (LU decomposition) is a vital part of many scientific and engineering applications, and the block LU decomposition algorithm is an approach well suited to parallel hardware implementation. This paper presents an approach to speed up implementation of the block LU decomposition algorithm using FPGA hardware. Unlike most previous approaches reported in the literature, the approach does not assume the matrix can be stored entirely on chip. The memory accesses are studied for various FPGA configurations, and a schedule of operations for scaling well is shown. The design has been synthesized for FPGA targets and can be easily retargeted. The design outperforms previous hardware implementations, as well as tuned software implementations including the ATLAS and MKL libraries on workstations.

引用

页码：60 / 72

页数：13

共 32 条

[1] Numerical linear algebra on emerging architectures: the PLASMA and MAGMA projects
Agullo, Emmanuel
Demmel, Jim
Dongarra, Jack
Hadri, Bilel
Kurzak, Jakub
Langou, Julien
Ltaief, Hatem
Luszczek, Piotr
Tomov, Stanimire
[J]. SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2009, 180
[2] CHOI S, 2003, P INT C FIELD PROGR, V2278, P507
[3] STABILITY OF BLOCK LU FACTORIZATION
DEMMEL, JW
HIGHAM, NJ
SCHREIBER, RS
[J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 1995, 2 (02) : 173 - 190
[4] STABILITY OF BLOCK ALGORITHMS WITH FAST LEVEL-3 BLAS
DEMMEL, JW
HIGHAM, NJ
[J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1992, 18 (03): : 274 - 291
[5] DONGARRA J, 2010, LINPACK BENCHMARKING
[6] SOFTWARE LIBRARIES FOR LINEAR ALGEBRA COMPUTATIONS ON HIGH-PERFORMANCE COMPUTERS
DONGARRA, JJ
WALKER, DW
[J]. SIAM REVIEW, 1995, 37 (02) : 151 - 180
[7] LARGE DENSE NUMERICAL LINEAR ALGEBRA IN 1993 - THE PARALLEL COMPUTING INFLUENCE
EDELMAN, A
[J]. INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1993, 7 (02): : 113 - 128
[8] Galoppo N., 2005, Proceedings of the 2005 ACM/IEEE conference on Supercomputing, IEEE Computer Society, P3, DOI DOI 10.1109/SC.2005.42
[9] GOPALAKRISHNAN L, 2008, XAPP853
[10] Govindu G., 2004, Proceedings. 18th International Parallel and Distributed Processing Symposium

← 1 2 3 4 →