An efficient parallel algorithm for O(N2) direct summation method and its variations on distributed-memory parallel machines

被引:27
作者
Makino, J [1 ]
机构
[1] Univ Tokyo, Sch Sci, Dept Astron, Bunkyo Ku, Tokyo 1130033, Japan
基金
日本学术振兴会;
关键词
celestial mechanics; stellar dynamics; methods : numerical;
D O I
10.1016/S1384-1076(02)00143-4
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present a novel, highly efficient algorithm to parallelize O(N-2) direct summation method for N-body problems with individual timesteps on distributed-memory parallel machines such as Beowulf clusters. Previously known algorithms, in which all processors have complete copies of the N-body system, has the serious problem that the communication-computation ratio increases as we increase the number of processors, since the communication cost is independent of the number of processors. In the new algorithm, p processors are organized as a rootp x rootp two-dimensional array. Each processor has N/rootp particles, but the data are distributed in such a way that complete system is presented if we look at any row or column consisting of rootp processors. In this algorithm, the communication cost scales as N/rootp, while the calculation cost scales as N-2/p. Thus, we can use a much larger number of processors without losing efficiency compared to what was practical with previously known algorithms. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:373 / 384
页数:12
相关论文
共 13 条
[1]   DYNAMICAL EVOLUTION OF CLUSTERS OF GALAXIES .1. [J].
AARSETH, SJ .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1963, 126 (03) :223-255
[2]   From NBODY1 to NBODY6: The growth of an industry [J].
Aarseth, SJ .
PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 1999, 111 (765) :1333-1346
[3]   NUMERICAL-INTEGRATION SCHEME FOR N-BODY GRAVITATIONAL PROBLEM [J].
AHMAD, A ;
COHEN, L .
JOURNAL OF COMPUTATIONAL PHYSICS, 1973, 12 (03) :389-402
[4]  
[Anonymous], 1999, BUILD BEOWULF GUIDE
[5]  
BAUMGARDT H, 2002, UNPUB MNRAS
[6]  
DORBAND EN, 2002, ASTROPH0112092
[7]  
Fox G.C., 1994, PARALLEL COMPUTING W
[8]   Hyper-systolic parallel computing [J].
Lippert, T ;
Seyfried, A ;
Bode, A ;
Schilling, K .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (02) :97-108
[9]   PERFORMANCE ANALYSIS OF DIRECT N-BODY CALCULATIONS [J].
MAKINO, J ;
HUT, P .
ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 1988, 68 (04) :833-856
[10]  
MAKINO J, 1991, PUBL ASTRON SOC JPN, V43, P859