A Low Complexity Scaling Method for the Lanczos Kernel in Fixed-Point Arithmetic

被引:11
作者
Jerez, Juan Luis [1 ]
Constantinides, George A. [1 ]
Kerrigan, Eric C. [1 ,2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Aeronaut, London SW7 2AZ, England
基金
英国工程与自然科学研究理事会;
关键词
Computer arithmetic; computations on matrices; numerical algorithms; design aids; PERFORMANCE; ALGORITHMS; SYSTEMS;
D O I
10.1109/TC.2013.162
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of enabling fixed-point implementation of linear algebra kernels on low-cost embedded systems, as well as motivating more efficient computational architectures for scientific applications. Fixed-point arithmetic presents additional design challenges compared to floating-point arithmetic, such as having to bound peak values of variables and control their dynamic ranges. Algorithms for solving linear equations or finding eigenvalues are typically nonlinear and iterative, making solving these design challenges a nontrivial task. For these types of algorithms, the bounding problem cannot be automated by current tools. We focus on the Lanczos iteration, the heart of well-known methods such as conjugate gradient and minimum residual. We show how one can modify the algorithm with a low-complexity scaling procedure to allow us to apply standard linear algebra to derive tight analytical bounds on all variables of the process, regardless of the properties of the original matrix. It is shown that the numerical behavior of fixed-point implementations of the modified problem can be chosen to be at least as good as a floating-point implementation, if necessary. The approach is evaluated on field-programmable gate array (FPGA) platforms, highlighting orders of magnitude potential performance and efficiency improvements by moving form floating-point to fixed-point computation.
引用
收藏
页码:303 / 315
页数:13
相关论文
共 48 条
[1]  
[Anonymous], 2011, LOGICORE IP FLOAT PO
[3]  
Benedetti A, 2000, CONF REC ASILOMAR C, P355, DOI 10.1109/ACSSC.2000.910977
[4]  
Boland D, 2012, FPGA 12: PROCEEDINGS OF THE 2012 ACM-SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, P185
[5]   Analysis of conjugate gradient algorithms for adaptive filtering [J].
Chang, PS ;
Willson, AN .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2000, 48 (02) :409-418
[6]  
Constantinides George A., 2009, 2009 European Control Conference (ECC), P138
[7]   Numerical Data Representations for FPGA-Based Scientific Computing [J].
Constantinides, George A. ;
Kinsman, Adam B. ;
Nicolici, Nicola .
IEEE DESIGN & TEST OF COMPUTERS, 2011, 28 (04) :8-17
[8]   Certification of Bounds on Expressions Involving Rounded Operators [J].
Daumas, Marc ;
Melquiond, Guillaume .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2010, 37 (01)
[9]   The University of Florida Sparse Matrix Collection [J].
Davis, Timothy A. ;
Hu, Yifan .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2011, 38 (01)
[10]   Designing Custom Arithmetic Data Paths with FloPoCo [J].
de Dinechin, Florent ;
Pasca, Bogdan .
IEEE DESIGN & TEST OF COMPUTERS, 2011, 28 (04) :18-27