Optimistic parallelization of floating-point accumulation

被引:21
作者
Kapre, Nachiket [1 ]
Dehon, Andre [2 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Univ Penn, Elect Syst Engn, Philadelphia, PA 19104 USA
来源
18TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS | 2007年
关键词
D O I
10.1109/ARITH.2007.25
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Floating-point arithmetic is notoriously non-associative due to the limited precision representation which demands intermediate values be rounded to fit in the available precision. The resulting cyclic dependency in floating-point accumulation inhibits parallelization of the computation, including efficient use of pipelining. In practice, however, we observe that floating-point operations are "mostly" associative. This observation can be exploited to parallelize floating-point accumulation using a form of optimistic concurrency. In this scheme, we first compute an optimistic associative approximation to the sum and then relax the computation by iteratively propagating errors until the correct sum is obtained. We map this computation to a network of 16 statically-scheduled, pipelined, double-precision floating-point adders on the Virtex-4 LX160 (-12) device where each floating-point adder runs at 296 MHz and has a pipeline depth of 10. On this 16 PE design, we demonstrate an average speedup of 6x with randomly generated data and 3-7x with summations extracted from Conjugate Gradient benchmarks.
引用
收藏
页码:205 / +
页数:2
相关论文
共 17 条
[1]  
[Anonymous], P ACM SIGDA 13 INT S
[2]  
[Anonymous], 2004, P 12 ACM INT S FIELD
[3]   A REGULAR LAYOUT FOR PARALLEL ADDERS [J].
BRENT, RP ;
KUNG, HT .
IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (03) :260-264
[4]  
Han T., 1987, Proceedings of the 8th Symposium on Computer Arithmetic (Cat. No.87CH2419-0), P49, DOI 10.1109/ARITH.1987.6158699
[5]  
HEMMERT KS, 2006, P IOEEE S FIELD PROG
[6]   DATA PARALLEL ALGORITHMS [J].
HILLIS, WD ;
STEELE, GL .
COMMUNICATIONS OF THE ACM, 1986, 29 (12) :1170-1183
[7]  
*I S COMM, 1985, 7541985 ANSI IEEE
[8]  
Kapre N, 2006, ANN IEEE SYM FIELD P, P205
[9]  
KELSEY R, 1998, HGHER ORDER SYMBOLIC, V11
[10]   PARALLEL ALGORITHM FOR EFFICIENT SOLUTION OF A GENERAL CLASS OF RECURRENCE EQUATIONS [J].
KOGGE, PM ;
STONE, HS .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (08) :786-793