Accurate Parallel Floating-Point Accumulation

被引:4
作者
Kadric, Edin [1 ]
Gurniak, Paul [1 ]
DeHon, Andre [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
来源
2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH) | 2013年
关键词
Floating-Point Arithmetic; IEEE-754; Parallel; Accumulation; Accurate; Rounding; FPGA;
D O I
10.1109/ARITH.2013.19
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using parallel associative reduction, iterative refinement, and conservative termination detection, we show how to use tree reduce parallelism to compute correctly rounded floating-point sums in O(log N) depth at arbitrary throughput. Our parallel solution shows how we can continue to exploit Moore's Law scaling in transistor count to accelerate floating-point performance even when clock rates remain flat. Empirical evidence suggests our iterative algorithm only requires two tree reduce passes to converge to the accurate sum in virtually all cases. Furthermore, we develop the hardware implementation of a 250 MHz pipelined, native, residue-preserving IEEE-754 double-precision, floating-point adder on a Virtex 6 FPGA that requires only 48% more area than a standard adder without residue. Finally, we show how this module can be used as the base of a streaming accurate floating-point accumulation unit that can be tuned to consume m summands every cycle.
引用
收藏
页码:153 / 162
页数:10
相关论文
共 20 条
[1]  
Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]
[2]   A distillation algorithm for floating-point summation [J].
Anderson, IJ .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1999, 20 (05) :1797-1806
[3]  
Bluespec Inc, BLUESP SYSTEMVERILOG
[4]  
de Dinechin F., 2007, ENSL00174627
[5]   FLOATING-POINT TECHNIQUE FOR EXTENDING AVAILABLE PRECISION [J].
DEKKER, TJ .
NUMERISCHE MATHEMATIK, 1971, 18 (03) :224-+
[6]   Accurate and efficient floating point summation [J].
Demmel, J ;
Hida, Y .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2003, 25 (04) :1214-1248
[7]  
Dieter William R., 2007, IEEE Computer Architecture Letters, V6, P13, DOI 10.1109/L-CA.2007.1
[8]  
Ercegovac M. D., 2004, DIGITAL ARITHMETIC, P608
[9]  
Fuller SH, 2011, FUTURE OF COMPUTING PERFORMANCE: GAME OVER OR NEXT LEVEL?, P1
[10]   THE ACCURACY OF FLOATING-POINT SUMMATION [J].
HIGHAM, NJ .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (04) :783-799