Optimizing a conjugate gradient solver with non-blocking collective operations

被引:0
作者
Hoefler, Torsten [1 ]
Gottschling, Peter
Rehm, Wolfgang
Lumsdaine, Andrew
机构
[1] Indiana Univ, Open Syst Lab, Bloomington, IN 47404 USA
[2] Tech Univ Chemnitz, Dept Comp Sci, D-09107 Chemnitz, Germany
来源
RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE | 2006年 / 4192卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a case study about the applicability and usage of non-blocking collective operations. These operations provide the ability to overlap communication with computation and to avoid unnecessary synchronization. We introduce our NBC library, a portable low-overhead implementation of non-blocking collectives on top of MPI-1. We demonstrate the easy usage of the NBC library with the optimization of a conjugate gradient solver with only minor changes to the traditional parallel implementation of the program. The optimized solver runs up to 34% faster and is able to overlap most of the communication. We show that there is, due to the overlap, no performance difference between Gi-gabit Ethernet and InfiniBand (TM) for our calculation.
引用
收藏
页码:374 / 382
页数:9
相关论文
共 16 条
  • [1] [Anonymous], P 11 EUR PVM MPI US
  • [2] Send-receive considered harmful: Myths and realities of message passing
    Gorlatch, S
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2004, 26 (01): : 47 - 56
  • [3] GOTTSCHLING P, 2000, LNCS, V1900
  • [4] Hackbusch W, 1994, ITERATIVE SOLUTION L
  • [5] METHODS OF CONJUGATE GRADIENTS FOR SOLVING LINEAR SYSTEMS
    HESTENES, MR
    STIEFEL, E
    [J]. JOURNAL OF RESEARCH OF THE NATIONAL BUREAU OF STANDARDS, 1952, 49 (06): : 409 - 436
  • [6] HOEFLER T, 2006, UNPUB ISPA
  • [7] HOEFLER T, 2006, NONBLOCKING COLLECTI
  • [8] HOEFLER T, 2006, 19 INT C ARCH COMP S, P343
  • [9] Kale L. V., 2003, P IPDPS 03 NIC FRANC
  • [10] KANEVSKY A, 1998, HICSS, P157