Analysis and performance of a distributed memory multilevel fast multipole algorithm

被引：118

作者：

Velamparambil, S ^{[1
]}

Chew, WC

机构：

[1] Ansoft Corp, Boulder, CO 80303 USA

[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA

来源：

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION | 2005年 / 53卷 / 08期

关键词：

electromagnetic scattering; fast multipole method (FMM); integral equations; parallel algorithms;

D O I：

10.1109/TAP.2005.851859

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we analyze the communication pattern and study the scalability of a distributed memory implementation of the multilevel fast multipole algorithm (MLFMA) called ScaleME. ScaleME uses the message passing interface (MPI) for communication between processors. The parallelization of MLFMA uses a novel a hybrid scheme for distributing the workload across the processors. We study the communication and computational behavior and demonstrate the effectiveness of the parallelization scheme using realistic problems.

引用

页码：2719 / 2727

页数：9

共 25 条

[1]

[Anonymous], THESIS U ILLINOIS UR

[2] OPTIMAL INTERPOLATION OF RADIATED FIELDS OVER A SPHERE [J].

BUCCI, OM ;

GENNARELLI, C ;

SAVARESE, C .

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1991, 39 (11) :1633-1643

[3] A DECOMPOSITION OF MULTIDIMENSIONAL POINT SETS WITH APPLICATIONS TO K-NEAREST-NEIGHBORS AND N-BODY POTENTIAL FIELDS [J].

CALLAHAN, PB ;

KOSARAJU, SR .

JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY, 1995, 42 (01) :67-90

[4]

Chew W. C., 2001, FAST EFFICIENT ALGOR

[5]

CHEW WC, 2001, FAST EFFICIENT ALGOR, pCH4

[6] The accuracy of fast multipole methods for Maxwell's equations [J].

Dembart, B ;

Yip, E .

IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (03) :48-56

[7]

Grama A. Y., 1994, Proceedings Supercomputing '94 (Cat. No.94CH34819), P439, DOI 10.1109/SUPERC.1994.344307

[8] A massively parallel fast multipole algorithm in three dimensions [J].

Lu, EJL ;

Okunbor, DI .

PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 1996, :40-48

[9] Parallel performance of two applications in the Boeing high performance computing benchmark suite [J].

Manke, JW ;

Kerlick, GD ;

Levine, D ;

Banerjee, S ;

Dillon, E .

PARALLEL COMPUTING, 2001, 27 (04) :457-475

[10]

OTTUSCH JJ, 2000, SUPERCOMPUTING 2000, P54

← 1 2 3 →