Analysis and performance of a distributed memory multilevel fast multipole algorithm

被引:118
作者
Velamparambil, S [1 ]
Chew, WC
机构
[1] Ansoft Corp, Boulder, CO 80303 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
electromagnetic scattering; fast multipole method (FMM); integral equations; parallel algorithms;
D O I
10.1109/TAP.2005.851859
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we analyze the communication pattern and study the scalability of a distributed memory implementation of the multilevel fast multipole algorithm (MLFMA) called ScaleME. ScaleME uses the message passing interface (MPI) for communication between processors. The parallelization of MLFMA uses a novel a hybrid scheme for distributing the workload across the processors. We study the communication and computational behavior and demonstrate the effectiveness of the parallelization scheme using realistic problems.
引用
收藏
页码:2719 / 2727
页数:9
相关论文
共 25 条
[1]  
[Anonymous], THESIS U ILLINOIS UR
[2]   OPTIMAL INTERPOLATION OF RADIATED FIELDS OVER A SPHERE [J].
BUCCI, OM ;
GENNARELLI, C ;
SAVARESE, C .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1991, 39 (11) :1633-1643
[3]   A DECOMPOSITION OF MULTIDIMENSIONAL POINT SETS WITH APPLICATIONS TO K-NEAREST-NEIGHBORS AND N-BODY POTENTIAL FIELDS [J].
CALLAHAN, PB ;
KOSARAJU, SR .
JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY, 1995, 42 (01) :67-90
[4]  
Chew W. C., 2001, FAST EFFICIENT ALGOR
[5]  
CHEW WC, 2001, FAST EFFICIENT ALGOR, pCH4
[6]   The accuracy of fast multipole methods for Maxwell's equations [J].
Dembart, B ;
Yip, E .
IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (03) :48-56
[7]  
Grama A. Y., 1994, Proceedings Supercomputing '94 (Cat. No.94CH34819), P439, DOI 10.1109/SUPERC.1994.344307
[8]   A massively parallel fast multipole algorithm in three dimensions [J].
Lu, EJL ;
Okunbor, DI .
PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 1996, :40-48
[9]   Parallel performance of two applications in the Boeing high performance computing benchmark suite [J].
Manke, JW ;
Kerlick, GD ;
Levine, D ;
Banerjee, S ;
Dillon, E .
PARALLEL COMPUTING, 2001, 27 (04) :457-475
[10]  
OTTUSCH JJ, 2000, SUPERCOMPUTING 2000, P54