Supporting efficient tree structures for distributed scientific computation

被引:0
作者
Liu, PF [1 ]
Wu, JJ
机构
[1] Natl Chung Cheng Univ, Dept Comp Sci & Comp Engn, Chiayi 621, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei 115, Taiwan
关键词
distributed scientific computation; data structures; parallel trees; n-body simulations; class library;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an implementation of a platform-independent parallel C++ framework that can support various scientific simulations involving tree structures, such as astrophysics, semiconductor device simulation, molecular dynamics, plasma physics, and fluid mechanics. Within the framework, users are able to concentrate on the computation kernels that differentiate different tree-structured scientific simulation problems and let the framework take care of tedious and error-prone details that are common among these applications. This framework was developed based on the techniques we learned from previous CM-5 C implementations, which have been rigorously justified both experimentally and mathematically. This gives us confidence that our framework will allow fast prototyping of different scientific simulation applications that run on different parallel platforms and deliver good performance as well. We used MPI to implement the communication routines within the framework for better portability; therefore, the parallel library, as well as the applications developed within the framework, can run on every parallel machine where MPI is available. The applications remain portable across different platforms, and the communication library within the framework can be customized to explore possible performance gains based on individual characteristics of different parallel machines. To demonstrate the flexibility and performance of this framework, we implemented a gravitational force field computation code and a multi-filament vortex method on a SPARC Ultra workstation cluster, and report very competitive speedup even compared with a highly optimized sequential C implementation.
引用
收藏
页码:79 / 105
页数:27
相关论文
共 44 条
  • [1] ANDERSON C, 1989, COMMUNICATIONS PURE, V42, P1103
  • [2] [Anonymous], 1992, SUPERCOMPUTING 92, DOI DOI 10.1109/SUPERC.1992.236647
  • [3] AN EFFICIENT PROGRAM FOR MANY-BODY SIMULATION
    APPEL, AW
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1985, 6 (01): : 85 - 103
  • [4] ATLAS S, 1995, P SUPERCOMPUTING 95, P76
  • [5] BAIN WL, 1990, 5 DISTR MEM COMP C C, V2, P1050
  • [6] A HIERARCHICAL O(N-LOG-N) FORCE-CALCULATION ALGORITHM
    BARNES, J
    HUT, P
    [J]. NATURE, 1986, 324 (6096) : 446 - 449
  • [7] BHATT S, 1992, SOLABLE HIGH PERFORM, P38
  • [8] BHATT S, 1995, INT PAR PROC S, P31
  • [9] CHASE CM, 1991, 1991 INT C PAR PROC, V2, P211
  • [10] CHIEN AA, 1990, 2 ACM SIGPLAN S PRIN, P187