Finite-difference time-domain on the Cell/BE processor

被引:0
作者
Xu, Meilian [1 ]
Thulasiraman, Parimala [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8 | 2008年
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Finite-Difference Time-Domain (FDTD) is a kernel used to solve problems in electromagnetics applications such as microwave tomography. It is a data-intensive and computation-intensive problem. However, its computation scheme indicates that an architecture with SIMD support has the potential to bring performance improvement over traditional architectures without SIMD support. The Cell Broadband Engine (Cell/B. E.) processor is an implementation of a heterogeneous multicore architecture. It consists of one conventional microprocessor, PowerPC Processor Element (PPE), and eight SIMD co-processor elements, Synergistic Processor Elements (SPEs). One unique feature of an SPE is that it has 128-entry 128-bit uniform registers which support SIMD. Therefore, FDTD may be mapped well on Cell/B.E. processor However, each SPE can directly access only 256KB local store (LS) both for instructions and data. The size of LS is much less than what is needed for an accurate simulation of FDTD which requires large number of fine-grained Yee cells. In this paper, we design the algorithm on Cell/B.E. by efficiently using the asynchronous DMA (direct memory access) mechanism available on an SPE transferring data between its LS and the main memory via the high bandwidth bus on-chip EIB (Element Interconnect Bus). The new algorithm was run on an IBM Blade QS20 blades running at 3.2GHz. For a computation domain of 600 x 600 Yee cells, we achieve an overall speedup of 14.14 over AMD Athlon and 7.05 over AMD Opteron at the processor level.
引用
收藏
页码:3025 / 3032
页数:8
相关论文
共 17 条
[1]  
[Anonymous], 2006, Proceedings of the 3rd conference on Computing frontiers, CF '06, DOI DOI 10.1145/1128022.1128027
[2]  
BROKENSHIRE D, 2006, MAXIMIZING POWER CEL
[3]   MODIFIED BERENGER PML ABSORBING BOUNDARY-CONDITION FOR FD-TD MESHES [J].
CHEN, B ;
FANG, DG ;
ZHOU, BH .
IEEE MICROWAVE AND GUIDED WAVE LETTERS, 1995, 5 (11) :399-401
[4]   Cell broadband engine architecture and its first implementation - A performance view [J].
Chen, T. ;
Raghavan, R. ;
Dale, J. N. ;
Iwata, E. .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2007, 51 (05) :559-572
[5]  
Courant R, 1967, IBM J, P215
[6]   A parallel FDTD algorithm using the MPI library [J].
Guiffaut, C ;
Mahdjoubi, K .
IEEE ANTENNAS AND PROPAGATION MAGAZINE, 2001, 43 (02) :94-103
[7]   Introduction to the cell multiprocessor [J].
Kahle, JA ;
Day, MN ;
Hofstee, HP ;
Johns, CR ;
Maeurer, TR ;
Shippy, D .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2005, 49 (4-5) :589-604
[8]   Cell multiprocessor communication network: Built for speed [J].
Kistler, Michael ;
Perrone, Michael ;
Petrini, Fabrizio .
IEEE MICRO, 2006, 26 (03) :10-23
[9]   TECHNIQUES FOR IMPLEMENTATION OF THE FDTD METHOD ON A CM-5 PARALLEL COMPUTER [J].
LIU, ZM ;
MOHAN, AS ;
AUBREY, TA ;
BELCHER, WR .
IEEE ANTENNAS AND PROPAGATION MAGAZINE, 1995, 37 (05) :64-71
[10]  
RODOHAN D, 1993, IEEE T ANTENN PROPAG, V14, P302