Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications

被引:66
作者
He, Y [1 ]
Ding, CHQ [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, NERSC Div, Berkeley, CA 94720 USA
关键词
reproducibility; climate models; double-double precision arithmetic; self-compensated summation; distributed memory architecture;
D O I
10.1023/A:1008153532043
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Numerical reproducibility and stability of large scale scientific simulations, especially climate modeling, on distributed memory parallel computers are becoming critical issues. In particular, global summation of distributed arrays is most susceptible to rounding errors, and their propagation and accumulation cause uncertainty in final simulation results. We analyzed several accurate summation methods and found that two methods are particularly effective to improve (ensure) reproducibility and stability: Kahan's self-compensated summation and Bailey's double-double precision summation. We provide an MPI operator MPI_SUMDD to work with MPI collective operations to ensure a scalable implementation on large number of processors. The final methods are particularly simple to adopt in practical codes: not only global summations, but also vector-vector dot products and matrix-vector or matrix-matrix operations.
引用
收藏
页码:259 / 277
页数:19
相关论文
共 26 条
[1]   ALGORITHM 719 - MULTIPRECISION TRANSLATION AND EXECUTION OF FORTRAN PROGRAMS [J].
BAILEY, DH .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1993, 19 (03) :288-319
[2]  
BAILEY DH, FORTRAN 90 SUITE DOU
[3]  
Brent R. P., 1978, ACM Transactions on Mathematical Software, V4, P57, DOI 10.1145/355769.355775
[4]  
Ding CHQ, 1998, LECT NOTES COMPUT SC, V1401, P115, DOI 10.1007/BFb0037138
[5]  
DING CHQ, 1996, SIAM NEWS NOV, P1
[6]  
DING CHQ, 1999, P SUPERCOMPUTING 99
[7]   DESIGN AND PERFORMANCE OF A SCALABLE PARALLEL COMMUNITY CLIMATE MODEL [J].
DRAKE, J ;
FOSTER, I ;
MICHALAKES, J ;
TOONEN, B ;
WORLEY, P .
PARALLEL COMPUTING, 1995, 21 (10) :1571-1591
[8]  
Fox G., 1988, SOLVING PROBLEMS CON
[9]  
GOLDBERG D, 1991, ACM COMPUTING SU MAR
[10]  
Greenbaum Anne, 1997, FRONTIERS APPL MATH, V17